1
|
Guyeux C, Senelle G, Le Meur A, Supply P, Gaudin C, Phelan JE, Clark TG, Rigouts L, de Jong B, Sola C, Refrégier G. Newly Identified Mycobacterium africanum Lineage 10, Central Africa. Emerg Infect Dis 2024; 30:560-563. [PMID: 38407162 PMCID: PMC10902520 DOI: 10.3201/eid3003.231466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/27/2024] Open
Abstract
Analysis of genome sequencing data from >100,000 genomes of Mycobacterium tuberculosis complex using TB-Annotator software revealed a previously unknown lineage, proposed name L10, in central Africa. Phylogenetic reconstruction suggests L10 could represent a missing link in the evolutionary and geographic migration histories of M. africanum.
Collapse
|
2
|
AlKendi W, Gechter F, Heyberger L, Guyeux C. Advancements and Challenges in Handwritten Text Recognition: A Comprehensive Survey. J Imaging 2024; 10:18. [PMID: 38249003 PMCID: PMC10817575 DOI: 10.3390/jimaging10010018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 01/03/2024] [Indexed: 01/23/2024] Open
Abstract
Handwritten Text Recognition (HTR) is essential for digitizing historical documents in different kinds of archives. In this study, we introduce a hybrid form archive written in French: the Belfort civil registers of births. The digitization of these historical documents is challenging due to their unique characteristics such as writing style variations, overlapped characters and words, and marginal annotations. The objective of this survey paper is to summarize research on handwritten text documents and provide research directions toward effectively transcribing this French dataset. To achieve this goal, we presented a brief survey of several modern and historical HTR offline systems of different international languages, and the top state-of-the-art contributions reported of the French language specifically. The survey classifies the HTR systems based on techniques employed, datasets used, publication years, and the level of recognition. Furthermore, an analysis of the systems' accuracies is presented, highlighting the best-performing approach. We have also showcased the performance of some HTR commercial systems. In addition, this paper presents a summarization of the HTR datasets that publicly available, especially those identified as benchmark datasets in the International Conference on Document Analysis and Recognition (ICDAR) and the International Conference on Frontiers in Handwriting Recognition (ICFHR) competitions. This paper, therefore, presents updated state-of-the-art research in HTR and highlights new directions in the research field.
Collapse
Affiliation(s)
| | | | - Laurent Heyberger
- FEMTO-ST Institute/RECITS, UMR 6174 CNRS, UTBM, F-90010 Belfort, France;
| | - Christophe Guyeux
- FEMTO-ST Institute/DISC, UMR 6174 CNRS, Université de Franche-Comté, F-90016 Belfort, France;
| |
Collapse
|
3
|
Zein-Eddine R, Hak F, Le Meur A, Genestet C, Dumitrescu O, Guyeux C, Senelle G, Sola C, Refrégier G. The paradoxes of Mycobacterium tuberculosis molecular evolution and consequences for the inference of tuberculosis emergence date. Tuberculosis (Edinb) 2023; 143S:102378. [PMID: 38012921 DOI: 10.1016/j.tube.2023.102378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 07/10/2023] [Accepted: 07/17/2023] [Indexed: 11/29/2023]
Abstract
The date of Mycobacterium tuberculosis complex emergence has been the subject of long debates. New studies joining archaeological efforts with sequencing methods raise high hopes for solving whether this emergence is closer to 70,000 or to 6000 years before present. Inferring the date of emergence of this pathogen based on sequence data requires a molecular clock. Several clocks inferred from different types of loci and/or different samples, using both sound reasoning and reliable data, are actually very different, which we refer to as the paradoxes of M. tuberculosis molecular evolution. After having presented these paradoxes, we will remind the limits of the molecular clocks used in the different studies such as the assumption of homogeneous substitution rate. We will then review recent results that shed new light on the characteristics of M. tuberculosis molecular evolution: traces of diverse selection pressures, the impact of host dynamics, etc. We provide some ideas on what to do next to get nearer to a reliable dating of Mycobacterium tuberculosis complex emergence. Among them, the collection of additional remains from ancient tuberculosis seems still essential.
Collapse
Affiliation(s)
- R Zein-Eddine
- Laboratoire d'Optique et Biosciences, Ecole Polytechnique, Institut National de la Santé et de la Recherche Médicale: U1182, Centre National de la Recherche Scientifique: UMR7645, France
| | - F Hak
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Systématique et Evolution, 91190, Gif-sur-Yvette, France
| | - A Le Meur
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Systématique et Evolution, 91190, Gif-sur-Yvette, France
| | - C Genestet
- CIRI - Centre International de Recherche en Infectiologie, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon-1, Inserm U1111, CNRS UMR5308, Lyon, France; Hospices Civils de Lyon, Institut des Agents Infectieux, Laboratoire de bactériologie, Lyon, France
| | - O Dumitrescu
- CIRI - Centre International de Recherche en Infectiologie, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon-1, Inserm U1111, CNRS UMR5308, Lyon, France; Hospices Civils de Lyon, Institut des Agents Infectieux, Laboratoire de bactériologie, Lyon, France
| | - C Guyeux
- DISC Computer Science Department, FEMTO-ST Institute, UMR 6174 CNRS, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, 25000, Besançon, France
| | - G Senelle
- DISC Computer Science Department, FEMTO-ST Institute, UMR 6174 CNRS, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, 25000, Besançon, France
| | - C Sola
- Université de Paris, IAME, UMR1137, INSERM, Paris, France; AP-HP, GHU Nord, Service de mycobactériologie spécialisée et de référence, Paris, France; Université Paris-Saclay, Saint-Aubin, France
| | - G Refrégier
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Systématique et Evolution, 91190, Gif-sur-Yvette, France.
| |
Collapse
|
4
|
Senelle G, Sahal MR, La K, Billard-Pomares T, Marin J, Mougari F, Bridier-Nahmias A, Carbonnelle E, Cambau E, Refrégier G, Guyeux C, Sola C. Towards the reconstruction of a global TB history using a new pipeline "TB-Annotator". Tuberculosis (Edinb) 2023; 143S:102376. [PMID: 38012933 DOI: 10.1016/j.tube.2023.102376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 07/03/2023] [Accepted: 07/10/2023] [Indexed: 11/29/2023]
Abstract
Mycobacterium tuberculosis complex (MTBC) has a population structure consisting of 9 human and animal lineages. The genomic diversity within these lineages is a pathogenesis factor that affects virulence, transmissibility, host response, and antibiotic resistance. Hence it is important to develop improved information systems for tracking and understanding the spreading and evolution of genomes. We present results obtained thanks to a new informatics platform for computational biology of MTBC, that uses a convenience sample from public/private SRAs, designated as TB-Annotator. Version 1 was a first interactive graphic-based web tool based on 15,901 representative genomes. Version 2, still interactive, is a more sophisticated database, developed using the Snakemake Workflow Management System (WMS) that allows an unsupervised global and scalable analysis of the content of the USA National Center for Biotechnology Information Short Read Archives database. This platform analyzes nucleotide variants, the presence/absence of genes, known regions of difference and detect new deletions, the insertion sites of mobile genetic elements, and allows phylogenetic trees to be built, imported in a graphical interface and interactively analyzed between the data and the tree. The objective of TB-Annotator is triple: detect recent epidemiological links, reconstruct distant phylogeographical histories as well as perform more complex phenotypic/genotypic Genome-Wide Association Studies (GWAS). In this paper, we compare the various taxonomic SNPs-based labels and hierarchies previously described in recent reference papers for L1, and present a comparative analysis that allows identification of alias and thus provides the basis of a future unifying naming scheme for L1 sublineages. We present a global phylogenetic tree built with RAxML-NG, and one on L2; at the time of writing, we characterized about 200 sublineages, with many new ones; a detail tree for Modern L2 and a hierarchical scheme allowing to facilitate L2 lineage assignment are also presented.
Collapse
Affiliation(s)
- Gaetan Senelle
- FEMTO-ST Institute, UMR 6174, CNRS-Université Bourgogne Franche-Comté (UBFC), France
| | - Muhammed Rabiu Sahal
- Université Paris-Saclay, 91190, Gif-sur-Yvette, France; Université Paris-Cité, IAME, UMR 1137, INSERM, Paris, France
| | - Kevin La
- Université Paris-Cité, IAME, UMR 1137, INSERM, Paris, France; AP-HP, GHU Nord site Bichat, Service de mycobactériologie spécialisée et de référence, Paris, France
| | - Typhaine Billard-Pomares
- Service de microbiologie clinique, Hôpital Avicenne, 93017, Bobigny, France; Université Paris 13, IAME, Inserm, 93017, Bobigny, France
| | - Julie Marin
- Service de microbiologie clinique, Hôpital Avicenne, 93017, Bobigny, France; Université Paris 13, IAME, Inserm, 93017, Bobigny, France
| | - Faiza Mougari
- Université Paris-Cité, IAME, UMR 1137, INSERM, Paris, France; AP-HP, GHU Nord site Bichat, Service de mycobactériologie spécialisée et de référence, Paris, France
| | | | - Etienne Carbonnelle
- Université Paris-Cité, IAME, UMR 1137, INSERM, Paris, France; Service de microbiologie clinique, Hôpital Avicenne, 93017, Bobigny, France; Université Paris 13, IAME, Inserm, 93017, Bobigny, France
| | - Emmanuelle Cambau
- Université Paris-Cité, IAME, UMR 1137, INSERM, Paris, France; AP-HP, GHU Nord site Bichat, Service de mycobactériologie spécialisée et de référence, Paris, France
| | - Guislaine Refrégier
- Université Paris-Saclay, 91190, Gif-sur-Yvette, France; Ecologie Systématique Evolution, Université Paris-Saclay, CNRS, AgroParisTech, UMR ESE, 91405, Orsay, France
| | - Christophe Guyeux
- FEMTO-ST Institute, UMR 6174, CNRS-Université Bourgogne Franche-Comté (UBFC), France
| | - Christophe Sola
- Université Paris-Saclay, 91190, Gif-sur-Yvette, France; Université Paris-Cité, IAME, UMR 1137, INSERM, Paris, France.
| |
Collapse
|
5
|
Senelle G, Guyeux C, Refrégier G, Sola C. Towards a better understanding of the long-lasting evolutionary history of Mycobacterium tuberculosis. Tuberculosis (Edinb) 2023; 143S:102374. [PMID: 38012920 DOI: 10.1016/j.tube.2023.102374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2023] [Revised: 06/30/2023] [Accepted: 07/03/2023] [Indexed: 11/29/2023]
Abstract
The daily increasing sequencing of Mycobacterium tuberculosis has made it possible to establish an advanced phylogeny of this bacterium. It currently includes 9 lineages mainly affecting humans, completed by animal lineages, which form the Mycobacterium tuberculosis complex. Inherited from various historical approaches, this phylogeny is now based on Single Nucleotide Polymorphisms (SNPs), of which updates are frequently proposed. We present here evidence that the task needs refinements: some lineages have currently suboptimal defining SNPs, and many sublineages still need to be named and characterized. These findings are based on a new tool specifically designed to index the entire existing sequencing data. In this article, we focus on lineages 4.5, 4.7, 6 and 7. We take the opportunity to present some evidence that TB-annotator shows strong relevance, identifying well supported sublineages, as well as good global agreement with previous findings.
Collapse
Affiliation(s)
- Gaetan Senelle
- Institut FEMTO-ST, UMR 6174 CNRS, Université de Franche-Comté, France
| | - Christophe Guyeux
- Institut FEMTO-ST, UMR 6174 CNRS, Université de Franche-Comté, France.
| | | | | |
Collapse
|
6
|
Sahal MR, Senelle G, La K, Panda TW, Taura DW, Guyeux C, Cambau E, Sola C. Mycobacterium tuberculosis complex drug-resistance, phylogenetics, and evolution in Nigeria: Comparison with Ghana and Cameroon. PLoS Negl Trop Dis 2023; 17:e0011619. [PMID: 37824575 PMCID: PMC10597489 DOI: 10.1371/journal.pntd.0011619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 10/24/2023] [Accepted: 08/24/2023] [Indexed: 10/14/2023] Open
Abstract
In this article, we provide an in-depth analysis on the drug-resistance phenotypic characteristics of a cohort of 325 tuberculosis and characterize by Whole Genome Sequencing 24 isolates from Nigeria belonging to L4, L5 and L6. Our results suggest an alarming rate of drug-resistance of the L4.6.2.2 Mycobacterium tuberculosis complex (MTBC) lineage and a high diversity of L5. We compiled these new Sequence Read Archives (SRAs) to previously published ones from available Bioprojects run in Nigeria. We performed RAxML phylogenetic reconstructions of larger samples that include public NCBI SRAs from some neighboring countries (Cameroon, Ghana). To confront phylogenetic reconstruction to metadata, we used a new proprietary database named TB-Annotator. We show that L5 genomes in Northern Nigeria belong to new clades as the ones described until now and allow an update of the taxonomy of L5. In addition, we describe the L4.6.2.2 lineage in Nigeria, Cameroon and Ghana. We provide computations on the likely divergence time of L4.6.2.2 and suggest a new hypothesis concerning its origin. Finally we provide a short overview on M. bovis diversity in Nigeria. This study constitutes a baseline knowledge on the global genomic diversity, phylogeography and phylodynamics of MTBC in Nigeria, as well as on the natural history of this largely ignored but densely populated country of Africa. These results highlight the need of sequencing additional MTBC genomes in Nigeria and more generally in West-Africa, both for public health and for academic reasons. The likelihood of replacement of L5-L6 by L4.6.2.2 isolates, leave potentially little time to gather historical knowledge informative on the ancient history of tuberculosis in West-Africa.
Collapse
Affiliation(s)
- Muhammed Rabiu Sahal
- Université Paris-Cité, IAME, Inserm, Paris, France
- Université Paris-Saclay, Gif-sur-Yvette, France
| | - Gaetan Senelle
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Franche-Comté (UFC), 16 Route de Gray, 25000 Besançon, France
| | - Kevin La
- Université Paris-Cité, IAME, Inserm, Paris, France
| | | | | | - Christophe Guyeux
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Franche-Comté (UFC), 16 Route de Gray, 25000 Besançon, France
| | - Emmanuelle Cambau
- Université Paris-Cité, IAME, Inserm, Paris, France
- APHP, GHU Nord site Bichat, Service de Mycobactériologie Spécialisée et de Référence, Paris, France
| | - Christophe Sola
- Université Paris-Cité, IAME, Inserm, Paris, France
- Université Paris-Saclay, Gif-sur-Yvette, France
| |
Collapse
|
7
|
Hajj S, Azar J, Bou Abdo J, Demerjian J, Guyeux C, Makhoul A, Ginhac D. Cross-Layer Federated Learning for Lightweight IoT Intrusion Detection Systems. Sensors (Basel) 2023; 23:7038. [PMID: 37631575 PMCID: PMC10458682 DOI: 10.3390/s23167038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/03/2023] [Accepted: 08/07/2023] [Indexed: 08/27/2023]
Abstract
With the proliferation of IoT devices, ensuring the security and privacy of these devices and their associated data has become a critical challenge. In this paper, we propose a federated sampling and lightweight intrusion-detection system for IoT networks that use K-meansfor sampling network traffic and identifying anomalies in a semi-supervised way. The system is designed to preserve data privacy by performing local clustering on each device and sharing only summary statistics with a central aggregator. The proposed system is particularly suitable for resource-constrained IoT devices such as sensors with limited computational and storage capabilities. We evaluate the system's performance using the publicly available NSL-KDD dataset. Our experiments and simulations demonstrate the effectiveness and efficiency of the proposed intrusion-detection system, highlighting the trade-offs between precision and recall when sharing statistics between workers and the coordinator. Notably, our experiments show that the proposed federated IDS can increase the true-positive rate up to 10% when the workers and the coordinator collaborate.
Collapse
Affiliation(s)
- Suzan Hajj
- Imagerie et Vision Artificielle (ImVIA) Laboratory, Université de Bourgogne Franche-Comté, 21078 Dijon, France
| | - Joseph Azar
- Femto-St Institute, UMR 6174 CNRS, Université de Franche-Comté, 25030 Besançon, France
| | - Jacques Bou Abdo
- School of Information Technology, University of Cincinnati, Cincinnati, OH 45221, USA
| | - Jacques Demerjian
- LaRRIS, Faculty of Sciences, Lebanese University, Fanar P.O. Box 90656, Lebanon
- Computer Science & IT Department, Faculty of Arts and Sciences, Holy Spirit University of Kaslik (USEK), Jounieh P.O. Box 446, Lebanon
| | - Christophe Guyeux
- Femto-St Institute, UMR 6174 CNRS, Université de Franche-Comté, 25030 Besançon, France
| | - Abdallah Makhoul
- Femto-St Institute, UMR 6174 CNRS, Université de Franche-Comté, 25030 Besançon, France
| | - Dominique Ginhac
- Imagerie et Vision Artificielle (ImVIA) Laboratory, Université de Bourgogne Franche-Comté, 21078 Dijon, France
| |
Collapse
|
8
|
Napier G, Couvin D, Refrégier G, Guyeux C, Meehan CJ, Sola C, Campino S, Phelan J, Clark TG. Comparison of in silico predicted Mycobacterium tuberculosis spoligotypes and lineages from whole genome sequencing data. Sci Rep 2023; 13:11368. [PMID: 37443186 PMCID: PMC10345134 DOI: 10.1038/s41598-023-38384-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 07/07/2023] [Indexed: 07/15/2023] Open
Abstract
Bacterial strain-types in the Mycobacterium tuberculosis complex underlie tuberculosis disease, and have been associated with drug resistance, transmissibility, virulence, and host-pathogen interactions. Spoligotyping was developed as a molecular genotyping technique used to determine strain-types, though recent advances in whole genome sequencing (WGS) technology have led to their characterization using SNP-based sub-lineage nomenclature. Notwithstanding, spoligotyping remains an important tool and there is a need to study the congruence between spoligotyping-based and SNP-based sub-lineage assignation. To achieve this, an in silico spoligotype prediction method ("Spolpred2") was developed and integrated into TB-Profiler. Lineage and spoligotype predictions were generated for > 28 k isolates and the overlap between strain-types was characterized. Major spoligotype families detected were Beijing (25.6%), T (18.6%), LAM (13.1%), CAS (9.4%), and EAI (8.3%), and these broadly followed known geographic distributions. Most spoligotypes were perfectly correlated with the main MTBC lineages (L1-L7, plus animal). Conversely, at lower levels of the sub-lineage system, the relationship breaks down, with only 65% of spoligotypes being perfectly associated with a sub-lineage at the second or subsequent levels of the hierarchy. Our work supports the use of spoligotyping (membrane or WGS-based) for low-resolution surveillance, and WGS or SNP-based systems for higher-resolution studies.
Collapse
Affiliation(s)
- Gary Napier
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK
| | - David Couvin
- Institut Pasteur de la Guadeloupe, Les Abymes, Guadeloupe
| | - Guislaine Refrégier
- Université Paris-Saclay, Saint-Aubin, France
- CNRS, UMR ESE, AgroParisTech, 91405, Orsay, France
| | - Christophe Guyeux
- DISC Computer Science Department, FEMTO-ST Institute, UMR 6174 CNRS, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, 25000, Besançon, France
| | | | - Christophe Sola
- Université Paris-Saclay, Saint-Aubin, France
- IAME, UMR1137, Université Paris-Cité, INSERM, Paris, France
| | - Susana Campino
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK
| | - Jody Phelan
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK.
| | - Taane G Clark
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK.
- Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK.
| |
Collapse
|
9
|
Couturier R, Dionis E, Guérin S, Guyeux C, Sugny D. Characterization of a Driven Two-Level Quantum System by Supervised Learning. Entropy (Basel) 2023; 25:446. [PMID: 36981334 PMCID: PMC10048282 DOI: 10.3390/e25030446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 02/26/2023] [Accepted: 02/27/2023] [Indexed: 06/18/2023]
Abstract
We investigate the extent to which a two-level quantum system subjected to an external time-dependent drive can be characterized by supervised learning. We apply this approach to the case of bang-bang control and the estimation of the offset and the final distance to a given target state. For any control protocol, the goal is to find the mapping between the offset and the distance. This mapping is interpolated using a neural network. The estimate is global in the sense that no a priori knowledge is required on the relation to be determined. Different neural network algorithms are tested on a series of data sets. We show that the mapping can be reproduced with very high precision in the direct case when the offset is known, while obstacles appear in the indirect case starting from the distance to the target. We point out the limits of the estimation procedure with respect to the properties of the mapping to be interpolated. We discuss the physical relevance of the different results.
Collapse
Affiliation(s)
- Raphaël Couturier
- Université de Franche-Comté, CNRS, Institut FEMTO-ST, F-90000 Belfort, France
| | - Etienne Dionis
- Laboratoire Interdisciplinaire Carnot de Bourgogne (ICB), UMR 6303 CNRS-Université de Bourgogne, 9 Av. A. Savary, BP 47 870, CEDEX, F-21078 Dijon, France
| | - Stéphane Guérin
- Laboratoire Interdisciplinaire Carnot de Bourgogne (ICB), UMR 6303 CNRS-Université de Bourgogne, 9 Av. A. Savary, BP 47 870, CEDEX, F-21078 Dijon, France
| | - Christophe Guyeux
- Université de Franche-Comté, CNRS, Institut FEMTO-ST, F-90000 Belfort, France
| | - Dominique Sugny
- Laboratoire Interdisciplinaire Carnot de Bourgogne (ICB), UMR 6303 CNRS-Université de Bourgogne, 9 Av. A. Savary, BP 47 870, CEDEX, F-21078 Dijon, France
| |
Collapse
|
10
|
Cerna S, Guyeux C, Laiymani D. The usefulness of NLP techniques for predicting peaks in firefighter interventions due to rare events. Neural Comput Appl 2022; 34:10117-10132. [PMID: 35250179 PMCID: PMC8881897 DOI: 10.1007/s00521-022-06996-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 01/30/2022] [Indexed: 11/26/2022]
Abstract
In some countries such as France, the number of operations assisted by firefighters has shown an almost linear increase over the years, contrary to their resource capacity. For this reason, predicting the number of interventions has become a necessity. Initially, time series models were developed with several types of qualitative and quantitative features, including the alert level of the bulletins, to predict the operational load. We realized that interventions related to human activities are quite predictable. However, the recognition of interventions due to rare events such as storms or floods needs more than quantitative meteorological data to be identified, since there are almost always zero cases. Thus, this work proposes the application of natural language processing techniques, namely long short-term memory, convolutional neural networks, FlauBERT, and CamemBERT to extract features from the texts of weather bulletins in order to recognize periods with peak interventions, where the intense workload of firefighters is caused by rare events. Four categories identified as Emergency Person Rescue, Total Person Rescue, interventions related to Heating, and Storm/Flood were our targets for the multilabel classification models developed. The results showed a remarkable accuracy of 80%, 86%, 92%, and 86% for Emergency Rescue People, Total Rescue People, Heating, and Storm/Flood, respectively.
Collapse
Affiliation(s)
- Selene Cerna
- Femto-ST Institute, UMR 6174 CNRS, University Bourgogne Franche-Comté, Belfort, France
| | - Christophe Guyeux
- Femto-ST Institute, UMR 6174 CNRS, University Bourgogne Franche-Comté, Belfort, France
| | - David Laiymani
- Femto-ST Institute, UMR 6174 CNRS, University Bourgogne Franche-Comté, Belfort, France
| |
Collapse
|
11
|
Guyeux C, Senelle G, Refrégier G, Bretelle-Establet F, Cambau E, Sola C. Connection between two historical tuberculosis outbreak sites in Japan, Honshu, by a new ancestral Mycobacterium tuberculosis L2 sublineage. Epidemiol Infect 2022; 150:1-25. [PMID: 35042579 PMCID: PMC8931808 DOI: 10.1017/s0950268822000048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 12/24/2021] [Accepted: 01/03/2022] [Indexed: 11/07/2022] Open
Abstract
By gathering 680 publicly available Sequence Read Archives from isolates of Mycobacterium tuberculosis complex (MTBC) including 190 belonging to the lineage 2 Beijing , and using an in-house bioinformatical pipeline, the TB-Annotator , that analyses more than 50 000 characters, we describe herein a new L2 sublineage from 20 isolates found in the Tochigi province, (Japan), that we designate as asia ancestral 5 (AAnc5). These isolates harbour a number of specific criteria (42 SNPs) and their intra-cluster pairwise distance suggests historical and not epidemiological transmission. These isolates harbour a mutation in rpoC , and do not fulfil, any of the modern Beijing lineage criteria, nor any of the other ancestral Beijing lineages described so far. Asia ancestral 5 isolates do not possess mutT2 58 and ogt 12 characteristics of modern Beijing , but possess ancestral Beijing SNPs characteristics. By looking into the literature, we found a reference isolate ID381, described in Kobe and Osaka belonging to the ‘G3’ group, sharing 36 out of the 42 specific SNPs found in AAnc5. We also assessed the intermediate position of the asia ancestral 4 (AAnc4) sublineage recently described in Thailand and propose an improved classification of the L2 that now includes AAnc4 and AAnc5. By increasing the recruitment into TB-Annotator to around 3000 genomes (including 642 belonging to L2), we confirmed our results and discovered additional historical ancestral L2 branches that remain to be investigated in more detail. We also present, in addition, some anthropological and historical data from Chinese and Japan history of tuberculosis, as well as from Korea, that could support our results on L2 evolution. This study shows that the reconstruction of the early history of tuberculosis in Asia is likely to reveal complex patterns since its emergence.
Collapse
Affiliation(s)
- Christophe Guyeux
- DISC Computer Science Department, FEMTO-ST Institute, UMR 6174 CNRS, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, 25000Besançon, France
| | - Gaetan Senelle
- DISC Computer Science Department, FEMTO-ST Institute, UMR 6174 CNRS, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, 25000Besançon, France
| | - Guislaine Refrégier
- Université Paris-Saclay, Saint-Aubin, France
- Université Paris-Saclay, CNRS, AgroParisTech, UMR ESE, 91405, Orsay, France
| | | | - Emmanuelle Cambau
- Université de Paris, IAME, UMR1137, INSERM, Paris, France
- AP-HP, GHU Nord, service de mycobactériologie spécialisée et de référence, Laboratoire associé du Centre National de Référence des mycobactéries et résistance des mycobactéries aux antituberculeux (CNR-MyRMA), Paris, France
| | - Christophe Sola
- Université Paris-Saclay, Saint-Aubin, France
- Université de Paris, IAME, UMR1137, INSERM, Paris, France
| |
Collapse
|
12
|
Nerich V, Guyeux C, Henry-Amar M, Couturier R, Thieblemont C, Ribrag V, Tilly H, Haioun C, Casasnovas RO, Morschhauser F, Feugier P, Sibon D, Ysebaert L, Nicolas-Virelizier E, Broussais-Guillaumot F, Damaj GL, Jais JP, Salles G, Woronoff-Lemsi M, Mounier N. Economic burden in non-Hodgkin lymphoma survivors: The French Lymphoma Study Association SIMONAL cross-sectional study. Cancer 2021; 128:519-528. [PMID: 34605020 DOI: 10.1002/cncr.33938] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Revised: 08/19/2021] [Accepted: 08/24/2021] [Indexed: 12/30/2022]
Abstract
BACKGROUND No study has focused on the economic burden in non-Hodgkin lymphoma (NHL) survivors, even though this knowledge is essential. This study reports on health care resource use and associated health care costs as well as related factors in a series of 1671 French long-term NHL survivors. METHODS Health care costs were measured from the payer perspective. Only direct medical costs (medical consultations, outpatient treatments, hospitalizations, and medical transport) in the past 12 months were included (reference year 2015). Multiple linear regression was used to search for explanatory factors of health care costs. RESULTS In total, 1100 survivors (66%) reported having used at least 1 health care resource, and 867 (52%) reported having used at least 1 outpatient treatment. After the authors accounted for missing data, the mean health care cost was estimated at €702 ± €2221. Hospitalizations and outpatient treatments were the main cost drivers. Sensitivity analyses confirmed the robustness of the results. For the 1100 survivors who reported using at least 1 health care resource, the mean health care cost was €1067 ± €2268. Several factors demonstrated statistically significant relationships with health care costs. For instance, cardiovascular disorders increased costs by 66% ± 16%. In contrast, rituximab or autologous stem cell transplantation as initial therapy had no effect on health care costs. CONCLUSIONS The consideration of economic constraints in health care is now a reality. This retrospective study reports on a better understanding of health care resource use and associated health care costs as well as related factors. It may help health care professionals in their ongoing efforts to design person-centered health care pathways.
Collapse
Affiliation(s)
- Virginie Nerich
- Department of Pharmacy, University Hospital, Besançon, France.,INSERM, EFS-BFC, UMR1098, University of Franche-Comté, Besançon, France
| | - Christophe Guyeux
- Femto-ST Institute, UMR 6174 CNRS, University of Bourgogne Franche-Comté, Besançon, France
| | - Michel Henry-Amar
- French Center on eHealth, North-West Region Data Processing Center and French National League Against Cancer Clinical Research Platform, CCC François Baclesse, Caen, France
| | - Raphaël Couturier
- Femto-ST Institute, UMR 6174 CNRS, University of Bourgogne Franche-Comté, Besançon, France
| | - Catherine Thieblemont
- Hemato-Oncology Unit, Saint-Louis University Hospital Center, Public Hospital Network of Paris, Paris, France
| | - Vincent Ribrag
- Hematology Unit, Gustave Roussy Cancer Campus, Villejuif, France
| | - Hervé Tilly
- Hematology Department and French Institute of Health and Medical Research Unit 1243, Henri Becquerel Center, Rouen, France
| | - Corinne Haioun
- Lymphoid Malignancies Unit, Henri Mondor University Hospital Center, Public Hospital Network of Paris, Créteil, France
| | - René-Olivier Casasnovas
- Hematology Unit and French Institute of Health and Medical Research Unit 1231, Bocage Hospital, Dijon Bourgogne Regional University Hospital Center, Dijon, France
| | - Franck Morschhauser
- Hematology Transfusion Institute, Claude Huriez Hospital, Lille Regional University Hospital Center, Lille, France
| | - Pierre Feugier
- Hematology Unit, Brabois Hospital, Nancy University Hospital Center, Vandoeuvre-lès-Nancy, France
| | - David Sibon
- Hematology Unit, Necker University Hospital for Sick Children, Public Hospital Network of Paris, Paris, France
| | - Loic Ysebaert
- Oncopole, Toulouse University Cancer Institute, Toulouse, France
| | | | | | - Gandhi L Damaj
- Basse-Normandy Hematology Institute, Côte de Nacre Regional University Hospital Center, Caen, France
| | - Jean-Philippe Jais
- Laboratory of Biostatistics, Paris V University-Descartes, Paris, France
| | - Gilles Salles
- Faculty of Medicine, Claude Bernard University, Lyon, France.,Hematology Department, Lyon South Hospital Center, Pierre-Bénite, France
| | - Macha Woronoff-Lemsi
- INSERM, EFS-BFC, UMR1098, University of Franche-Comté, Besançon, France.,Department of Clinical Research and Innovation, University Hospital, Besançon, France
| | - Nicolas Mounier
- Onco-Hematology Unit, l'Archet Hospital, Nice University Hospital Center, University of Côte d'Azur, Nice, France
| |
Collapse
|
13
|
Guyeux C, Sola C, Noûs C, Refrégier G. CRISPRbuilder-TB: "CRISPR-builder for tuberculosis". Exhaustive reconstruction of the CRISPR locus in mycobacterium tuberculosis complex using SRA. PLoS Comput Biol 2021; 17:e1008500. [PMID: 33667225 PMCID: PMC7968741 DOI: 10.1371/journal.pcbi.1008500] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Revised: 03/17/2021] [Accepted: 11/08/2020] [Indexed: 11/18/2022] Open
Abstract
Mycobacterium tuberculosis complex (MTC) CRISPR locus diversity has long been studied solely investigating the presence/absence of a known set of spacers. Unveiling the genetic mechanisms of its evolution requires a more exhaustive reconstruction in a large amount of representative strains. In this article, we point out and resolve, with a new pipeline, the problem of CRISPR reconstruction based directly on short read sequences in M. tuberculosis. We first show that the process we set up, that we coin as “CRISPRbuilder-TB” (https://github.com/cguyeux/CRISPRbuilder-TB), allows an efficient reconstruction of simulated or real CRISPRs, even when including complex evolutionary steps like the insertions of mobile elements. Compared to more generalist tools, the whole process is much more precise and robust, and requires only minimal manual investigation. Second, we show that more than 1/3 of the currently complete genomes available for this complex in the public databases contain largely erroneous CRISPR loci. Third, we highlight how both the classical experimental in vitro approach and the basic in silico spoligotyping provided by existing analytic tools miss a whole diversity of this locus in MTC, by not capturing duplications, spacer and direct repeats variants, and IS6110 insertion locations. This description is extended in a second article that describes MTC-CRISPR diversity and suggests general rules for its evolution. This work opens perspectives for an in-depth exploration of M. tuberculosis CRISPR loci diversity and of mechanisms involved in its evolution and its functionality, as well as its adaptation to other CRISPR locus-harboring bacterial species. In this article, we tackle the bioinformatical issue of the reconstruction of the Mycobacterium tuberculosis complex CRISPR locus using short read sequences without requiring genome assembly. We first show that many complete genomes, as found in public databases and often reconstructed by de novo assemblies, often contain errors on this locus as well as on other repeated sequences. We provide an in-depth description of our new method, designated as ‘CRISPRbuilder-TB’, and we show that our method provides much more exhaustive and reliable information (on DR variants, spacer diversity, global structure) than Crass and CRISPR_detector. The new and unsuspected genomic diversity we detected is described in a companion paper. Scripts are available to adapt the tool to other species.
Collapse
Affiliation(s)
- Christophe Guyeux
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Department, Univ. Bourgogne Franche-Comté (UBFC), Besançon, France
- * E-mail:
| | - Christophe Sola
- IAME, UMR1137 INSERM, Université Paris, Université Paris Nord
- 3 Université Paris-Saclay, Saint-Aubin, France
| | - Camille Noûs
- IAME, UMR1137 INSERM, Université Paris, Université Paris Nord
| | - Guislaine Refrégier
- 4 Ecologie Systematique Evolution, Batiment 360, Université Paris-Saclay, CNRS, AgroParisTech,Orsay 91400, France
| |
Collapse
|
14
|
Valette N, Renou J, Boutilliat A, Fernández-González AJ, Gautier V, Silar P, Guyeux C, Charr JC, Cuenot S, Rose C, Gelhaye E, Morel-Rouhier M. OSIP1 is a self-assembling DUF3129 protein required to protect fungal cells from toxins and stressors. Environ Microbiol 2021; 23:1594-1607. [PMID: 33393164 DOI: 10.1111/1462-2920.15381] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Accepted: 12/28/2020] [Indexed: 11/28/2022]
Abstract
Secreted proteins are key players in fungal physiology and cell protection against external stressing agents and antifungals. Oak stress-induced protein 1 (OSIP1) is a fungal-specific protein with unknown function. By using Podospora anserina and Phanerochaete chrysosporium as models, we combined both in vivo functional approaches and biophysical characterization of OSIP1 recombinant protein. The P. anserina OSIP1Δ mutant showed an increased sensitivity to the antifungal caspofungin compared to the wild type. This correlated with the production of a weakened extracellular exopolysaccharide/protein matrix (ECM). Since the recombinant OSIP1 from P. chrysosporium self-assembled as fibers and was capable of gelation, it is likely that OSIP1 is linked to ECM formation that acts as a physical barrier preventing drug toxicity. Moreover, compared to the wild type, the OSIP1Δ mutant was more sensitive to oak extractives including chaotropic phenols and benzenes. It exhibited a strongly modified secretome pattern and an increased production of proteins associated to the cell-wall integrity signalling pathway, when grown on oak sawdust. This demonstrates that OSIP1 has also an important role in fungal resistance to extractive-induced stress.
Collapse
Affiliation(s)
- Nicolas Valette
- Université de Lorraine, INRAE, Interactions Arbres/Micro-organismes (IAM), UMR 1136, Nancy, 54000, France
| | - Julien Renou
- Université de Lorraine, INRAE, Interactions Arbres/Micro-organismes (IAM), UMR 1136, Nancy, 54000, France
| | - Alexis Boutilliat
- Université de Lorraine, INRAE, Interactions Arbres/Micro-organismes (IAM), UMR 1136, Nancy, 54000, France
| | | | - Valérie Gautier
- Université Paris Diderot, Sorbonne Paris Cité, Laboratoire Interdisciplinaire des Energies de Demain (LIED), Paris, 75205, France
| | - Philippe Silar
- Université Paris Diderot, Sorbonne Paris Cité, Laboratoire Interdisciplinaire des Energies de Demain (LIED), Paris, 75205, France
| | - Christophe Guyeux
- Computer Science Department, FEMTO-ST Institute, UMR 6174 CNRS, Université de Bourgogne Franche-Comté, 16 route de Gray, Besançon, 25030, France
| | - Jean-Claude Charr
- Computer Science Department, FEMTO-ST Institute, UMR 6174 CNRS, Université de Bourgogne Franche-Comté, 16 route de Gray, Besançon, 25030, France
| | - Stéphane Cuenot
- Institut des Matériaux Jean Rouxel, Université de Nantes, 2 rue de la Houssinière, Nantes Cedex 3, 44322, France
| | - Christophe Rose
- Université de Lorraine, AgroParisTech, INRAE, UMR Silva, Nancy, 54000, France
| | - Eric Gelhaye
- Université de Lorraine, INRAE, Interactions Arbres/Micro-organismes (IAM), UMR 1136, Nancy, 54000, France
| | - Mélanie Morel-Rouhier
- Université de Lorraine, INRAE, Interactions Arbres/Micro-organismes (IAM), UMR 1136, Nancy, 54000, France
| |
Collapse
|
15
|
Refrégier G, Sola C, Guyeux C. Unexpected diversity of CRISPR unveils some evolutionary patterns of repeated sequences in Mycobacterium tuberculosis. BMC Genomics 2020; 21:841. [PMID: 33256602 PMCID: PMC7708916 DOI: 10.1186/s12864-020-07178-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Accepted: 10/22/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Diversity of the CRISPR locus of Mycobacterium tuberculosis complex has been studied since 1997 for molecular epidemiology purposes. By targeting solely the 43 spacers present in the two first sequenced genomes (H37Rv and BCG), it gave a biased idea of CRISPR diversity and ignored diversity in the neighbouring cas-genes. RESULTS We set up tailored pipelines to explore the diversity of CRISPR-cas locus in Short Reads. We analyzed data from a representative set of 198 clinical isolates as evidenced by well-characterized SNPs. We found a relatively low diversity in terms of spacers: we recovered only the 68 spacers that had been described in 2000. We found no partial or global inversions in the sequences, letting always the Direct Variant Repeats (DVR) in the same order. In contrast, we found an unexpected diversity in the form of: SNPs in spacers and in Direct Repeats, duplications of various length, and insertions at various locations of the IS6110 insertion sequence, as well as blocks of DVR deletions. The diversity was in part specific to lineages. When reconstructing evolutionary steps of the locus, we found no evidence for SNP reversal. DVR deletions were linked to recombination between IS6110 insertions or between Direct Repeats. CONCLUSION This work definitively shows that CRISPR locus of M. tuberculosis did not evolve by classical CRISPR adaptation (incorporation of new spacers) since the last most recent common ancestor of virulent lineages. The evolutionary mechanisms that we discovered could be involved in bacterial adaptation but in a way that remains to be identified.
Collapse
Affiliation(s)
- Guislaine Refrégier
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, cedex, 91198, Gif-sur-Yvette, France.
| | - Christophe Sola
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, cedex, 91198, Gif-sur-Yvette, France.
| | - Christophe Guyeux
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, 25000, Besançon, France
| |
Collapse
|
16
|
Arcolezi HH, Couchot JF, Cerna S, Guyeux C, Royer G, Bouna BA, Xiao X. Forecasting the number of firefighter interventions per region with local-differential-privacy-based data. Comput Secur 2020. [DOI: 10.1016/j.cose.2020.101888] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
17
|
Charr JC, Garavito A, Guyeux C, Crouzillat D, Descombes P, Fournier C, Ly SN, Raharimalala EN, Rakotomalala JJ, Stoffelen P, Janssens S, Hamon P, Guyot R. Complex evolutionary history of coffees revealed by full plastid genomes and 28,800 nuclear SNP analyses, with particular emphasis on Coffea canephora (Robusta coffee). Mol Phylogenet Evol 2020; 151:106906. [PMID: 32653553 DOI: 10.1016/j.ympev.2020.106906] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 06/17/2020] [Accepted: 07/06/2020] [Indexed: 11/16/2022]
Abstract
For decades coffees were associated with the genus Coffea. In 2011, the closely related genus Psilanthus was subsumed into Coffea. However, results obtained in 2017-based on 28,800 nuclear SNPs-indicated that there is not substantial phylogenetic support for this incorporation. In addition, a recent study of 16 plastid full-genome sequences highlighted an incongruous placement of Coffea canephora (Robusta coffee) between maternal and nuclear trees. In this study, similar global features of the plastid genomes of Psilanthus and Coffea are observed. In agreement with morphological and physiological traits, the nuclear phylogenetic tree clearly separates Psilanthus from Coffea (with exception to C. rhamnifolia, closer to Psilanthus than to Coffea). In contrast, the maternal molecular tree was incongruent with both morphological and nuclear differentiation, with four main clades observed, two of which include both Psilanthus and Coffea species, and two with either Psilanthus or Coffea species. Interestingly, Coffea and Psilanthus taxa sampled in West and Central Africa are members of the same group. Several mechanisms such as the retention of ancestral polymorphisms due to incomplete lineage sorting, hybridization leading to homoploidy (without chromosome doubling) and alloploidy (for C. arabica) are involved in the evolutionary history of the coffee species. While sharing similar morphological characteristics, the genetic relationships within C. canephora have shown that some populations are well differentiated and genetically isolated. Given the position of its closely-related species, we may also consider C. canephora to be undergoing a long process of speciation with an intermediate step of (sub-)speciation.
Collapse
Affiliation(s)
- Jean-Claude Charr
- Femto-ST Institute, UMR 6174 CNRS, Université de Bourgogne Franche-Comté, France.
| | - Andrea Garavito
- Departamento de Ciencias biológicas, Facultad de Ciencias Exactas y Naturales, Universidad de Caldas, Manizales, Colombia
| | - Christophe Guyeux
- Femto-ST Institute, UMR 6174 CNRS, Université de Bourgogne Franche-Comté, France.
| | | | | | | | - Serigne N Ly
- Institut de Recherche pour le Développement, UMR DIADE, CIRAD, Université de Montpellier, France.
| | | | | | - Piet Stoffelen
- Meise Botanic Garden, Nieuwelaan 38, BE-1860 Meise, Belgium.
| | - Steven Janssens
- Meise Botanic Garden, Nieuwelaan 38, BE-1860 Meise, Belgium.
| | - Perla Hamon
- Institut de Recherche pour le Développement, UMR DIADE, CIRAD, Université de Montpellier, France.
| | - Romain Guyot
- Institut de Recherche pour le Développement, UMR DIADE, CIRAD, Université de Montpellier, France; Department of Electronics and Automatization, Universidad Autónoma de Manizales, Manizales, Colombia.
| |
Collapse
|
18
|
Ly SN, Garavito A, De Block P, Asselman P, Guyeux C, Charr JC, Janssens S, Mouly A, Hamon P, Guyot R. Chloroplast genomes of Rubiaceae: Comparative genomics and molecular phylogeny in subfamily Ixoroideae. PLoS One 2020; 15:e0232295. [PMID: 32353023 PMCID: PMC7192488 DOI: 10.1371/journal.pone.0232295] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Accepted: 04/11/2020] [Indexed: 11/19/2022] Open
Abstract
In Rubiaceae phylogenetics, the number of markers often proved a limitation with authors failing to provide well-supported trees at tribal and generic levels. A robust phylogeny is a prerequisite to study the evolutionary patterns of traits at different taxonomic levels. Advances in next-generation sequencing technologies have revolutionized biology by providing, at reduced cost, huge amounts of data for an increased number of species. Due to their highly conserved structure, generally recombination-free, and mostly uniparental inheritance, chloroplast DNA sequences have long been used as choice markers for plant phylogeny reconstruction. The main objectives of this study are: 1) to gain insight in chloroplast genome evolution in the Rubiaceae (Ixoroideae) through efficient methodology for de novo assembly of plastid genomes; and, 2) to test the efficiency of mining SNPs in the nuclear genome of Ixoroideae based on the use of a coffee reference genome to produce well-supported nuclear trees. We assembled whole chloroplast genome sequences for 27 species of the Rubiaceae subfamily Ixoroideae using next-generation sequences. Analysis of the plastid genome structure reveals a relatively good conservation of gene content and order. Generally, low variation was observed between taxa in the boundary regions with the exception of the inverted repeat at both the large and short single copy junctions for some taxa. An average of 79% of the SNP determined in the Coffea genus are transferable to Ixoroideae, with variation ranging from 35% to 96%. In general, the plastid and the nuclear genome phylogenies are congruent with each other. They are well-resolved with well-supported branches. Generally, the tribes form well-identified clades but the tribe Sherbournieae is shown to be polyphyletic. The results are discussed relative to the methodology used and the chloroplast genome features in Rubiaceae and compared to previous Rubiaceae phylogenies.
Collapse
Affiliation(s)
- Serigne Ndiawar Ly
- Institut de Recherche pour le Développement, UMR DIADE, Université de Montpellier, Montpellier, France
| | - Andrea Garavito
- Departamento Ciencias Biológicas, Universidad de Caldas, Manizales, Colombia
| | | | - Pieter Asselman
- Meise Botanic Garden, Meise, Belgium
- University of Ghent, Ghent, Belgium
| | - Christophe Guyeux
- Femto-ST Institute, UMR 6174 CNRS, Université de Bourgogne Franche-Comté, Besançon, France
| | - Jean-Claude Charr
- Femto-ST Institute, UMR 6174 CNRS, Université de Bourgogne Franche-Comté, Besançon, France
| | | | - Arnaud Mouly
- Laboratory Chrono-Environment, UMR CNRS 6249, Université de Bourgogne Franche-Comté, Besançon, France
- Besançon Botanic Garden, Université de Bourgogne Franche-Comté, Besançon, France
| | - Perla Hamon
- Institut de Recherche pour le Développement, UMR DIADE, Université de Montpellier, Montpellier, France
| | - Romain Guyot
- Institut de Recherche pour le Développement, UMR DIADE, Université de Montpellier, Montpellier, France
- Department of Electronics and Automatization, Universidad Autónoma de Manizales, Manizales, Colombia
- * E-mail:
| |
Collapse
|
19
|
Guyeux C, Charr JC, Abdo JB, Demerjian J. Advances in the enumeration of foldable self-avoiding walks. IJCSE 2020. [DOI: 10.1504/ijcse.2020.109398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
20
|
Demerjian J, Guyeux C, Abdo JB, Charr JC. Advances in the enumeration of foldable self-avoiding walks. IJCSE 2020. [DOI: 10.1504/ijcse.2020.10031596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
21
|
Guyeux C, Salomon M, Al-Nuaimi B, AlKindy B, Couchot JF. Ancestral Reconstruction and Investigations of Genomic Recombination on some Pentapetalae Chloroplasts. J Integr Bioinform 2019; 16:/j/jib.ahead-of-print/jib-2018-0057/jib-2018-0057.xml. [PMID: 31860470 PMCID: PMC7074144 DOI: 10.1515/jib-2018-0057] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Accepted: 10/14/2019] [Indexed: 11/15/2022] Open
Abstract
In this article, we propose a semi-automated method to rebuild genome ancestors of chloroplasts by taking into account gene duplication. Two methods have been used in order to achieve this work: a naked eye investigation using homemade scripts, whose results are considered as a basis of knowledge, and a dynamic programming based approach similar to Needleman-Wunsch. The latter fundamentally uses the Gestalt pattern matching method of sequence matcher to evaluate the occurrences probability of each gene in the last common ancestor of two given genomes. The two approaches have been applied on chloroplastic genomes from Apiales, Asterales, and Fabids orders, the latter belonging to Pentapetalae group. We found that Apiales species do not undergo indels, while they occur in the Asterales and Fabids orders. A series of experiments was then carried out to extensively verify our findings by comparing the obtained ancestral reconstruction results with the latest released approach called MLGO (Maximum Likelihood for Gene-Order analysis).
Collapse
Affiliation(s)
- Christophe Guyeux
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Université de Bourgogne Franche-Comté, Besançon, France
| | - Michel Salomon
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Université de Bourgogne Franche-Comté, Besançon, France
| | - Bashar Al-Nuaimi
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Université de Bourgogne Franche-Comté, Besançon, France.,Department of Computer Science, University of Diyala, Baqubah, Iraq
| | - Bassam AlKindy
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Université de Bourgogne Franche-Comté, Besançon, France.,Department of Computer Science, University of Mustansiriyah, Baghdad, Iraq
| | - Jean-François Couchot
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Université de Bourgogne Franche-Comté, Besançon, France
| |
Collapse
|
22
|
Treepong P, Guyeux C, Meunier A, Couchoud C, Hocquet D, Valot B. panISa: ab initio detection of insertion sequences in bacterial genomes from short read sequence data. Bioinformatics 2019; 34:3795-3800. [PMID: 29931098 DOI: 10.1093/bioinformatics/bty479] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Accepted: 06/13/2018] [Indexed: 11/14/2022] Open
Abstract
Motivation The advent of next-generation sequencing has boosted the analysis of bacterial genome evolution. Insertion sequence (IS) elements play a key role in prokaryotic genome organization and evolution, but their repetitions in genomes complicate their detection from short-read data. Results PanISa is a software pipeline that identifies IS insertions ab initio in bacterial genomes from short-read data. It is a highly sensitive and precise tool based on the detection of read-mapping patterns at the insertion site. PanISa performs better than existing IS detection systems as it is based on a database-free approach. We applied it to a high-risk clone lineage of the pathogenic species Pseudomonas aeruginosa, and report 43 insertions of five different ISs (among which three are new) and a burst of ISPa1635 in a hypermutator isolate. Availability and implementation PanISa is implemented in Python and released as an open source software (GPL3) at https://github.com/bvalot/panISa. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Panisa Treepong
- Département DISC, UMR CNRS 6174 Institut FEMTO-ST, Université de Bourgogne, Franche-Comté, Besançon, France.,Faculty of Technology and Environment, Prince of Songkla University, Phuket, Thailand
| | - Christophe Guyeux
- Département DISC, UMR CNRS 6174 Institut FEMTO-ST, Université de Bourgogne, Franche-Comté, Besançon, France
| | - Alexandre Meunier
- Laboratoire d'Hygiène Hospitalière, Centre Hospitalier Régional Universitaire, Besançon, France.,UMR CNRS 6249, Chrono-environnement, Université de Bourgogne Franche-Comté, Besançon, France
| | - Charlotte Couchoud
- Laboratoire d'Hygiène Hospitalière, Centre Hospitalier Régional Universitaire, Besançon, France
| | - Didier Hocquet
- Laboratoire d'Hygiène Hospitalière, Centre Hospitalier Régional Universitaire, Besançon, France.,UMR CNRS 6249, Chrono-environnement, Université de Bourgogne Franche-Comté, Besançon, France
| | - Benoit Valot
- UMR CNRS 6249, Chrono-environnement, Université de Bourgogne Franche-Comté, Besançon, France
| |
Collapse
|
23
|
Matar J, Khoury HE, Charr JC, Guyeux C, Chrétien S. SpCLUST: Towards a fast and reliable clustering for potentially divergent biological sequences. Comput Biol Med 2019; 114:103439. [PMID: 31550555 DOI: 10.1016/j.compbiomed.2019.103439] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2019] [Revised: 09/05/2019] [Accepted: 09/05/2019] [Indexed: 11/18/2022]
Abstract
This paper presents SpCLUST, a new C++ package that takes a list of sequences as input, aligns them with MUSCLE, computes their similarity matrix in parallel and then performs the clustering. SpCLUST extends a previously released software by integrating additional scoring matrices which enables it to cover the clustering of amino-acid sequences. The similarity matrix is now computed in parallel according to the master/slave distributed architecture, using MPI. Performance analysis, realized on two real datasets of 100 nucleotide sequences and 1049 amino-acids ones, show that the resulting library substantially outperforms the original Python package. The proposed package was also intensively evaluated on simulated and real genomic and protein data sets. The clustering results were compared to the most known traditional tools, such as UCLUST, CD-HIT and DNACLUST. The comparison showed that SpCLUST outperforms the other tools when clustering divergent sequences, and contrary to the others, it does not require any user intervention or prior knowledge about the input sequences.
Collapse
Affiliation(s)
- Johny Matar
- Université de Bourgogne Franche-Comté, UMR 6174 CNRS, 16 route de Gray, Besançon, France; LaRRIS, Faculty of Science, Lebanese University, Fanar, Lebanon
| | | | - Jean-Claude Charr
- Université de Bourgogne Franche-Comté, UMR 6174 CNRS, 16 route de Gray, Besançon, France.
| | - Christophe Guyeux
- Université de Bourgogne Franche-Comté, UMR 6174 CNRS, 16 route de Gray, Besançon, France
| | - Stéphane Chrétien
- National Physical Laboratory, Hampton Road, Teddington, United Kingdom
| |
Collapse
|
24
|
Guyeux C, Charr JC, Tran HTM, Furtado A, Henry RJ, Crouzillat D, Guyot R, Hamon P. Evaluation of chloroplast genome annotation tools and application to analysis of the evolution of coffee species. PLoS One 2019; 14:e0216347. [PMID: 31188829 PMCID: PMC6561552 DOI: 10.1371/journal.pone.0216347] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2018] [Accepted: 04/18/2019] [Indexed: 12/13/2022] Open
Abstract
Chloroplast sequences are widely used for phylogenetic analysis due to their high degree of conservation in plants. Whole chloroplast genomes can now be readily obtained for plant species using new sequencing methods, giving invaluable data for plant evolution However new annotation methods are required for the efficient analysis of this data to deliver high quality phylogenetic analyses. In this study, the two main tools for chloroplast genome annotation were compared. More consistent detection and annotation of genes were produced with GeSeq when compared to the currently used Dogma. This suggests that the annotation of most of the previously annotated chloroplast genomes should now be updated. GeSeq was applied to species related to coffee, including 16 species of the Coffea and Psilanthus genera to reconstruct the ancestral chloroplast genomes and to evaluate their phylogenetic relationships. Eight genes in the plant chloroplast pan genome (consisting of 92 genes) were always absent in the coffee species analyzed. Notably, the two main cultivated coffee species (i.e. Arabica and Robusta) did not group into the same clade and differ in their pattern of gene evolution. While Arabica coffee (Coffea arabica) belongs to the Coffea genus, Robusta coffee (Coffea canephora) is associated with the Psilanthus genus. A more extensive survey of related species is required to determine if this is a unique attribute of Robusta coffee or a more widespread feature of coffee tree species.
Collapse
Affiliation(s)
- Christophe Guyeux
- Femto-ST Institute, UMR 6174 CNRS, Université de Bourgogne Franche-Comté, Besançon, France
| | - Jean-Claude Charr
- Femto-ST Institute, UMR 6174 CNRS, Université de Bourgogne Franche-Comté, Besançon, France
| | - Hue T. M. Tran
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Brisbane, QLD, Australia
| | - Agnelo Furtado
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Brisbane, QLD, Australia
| | - Robert J. Henry
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Brisbane, QLD, Australia
| | | | - Romain Guyot
- Institut de Recherche pour le Développement, UMR IPME, CIRAD, Université de Montpellier, Montpellier, France
- Department of Electronics and Automatization, Universidad Autónoma de Manizales, Manizales, Colombia
| | - Perla Hamon
- Institut de Recherche pour le Développement, UMR DIADE, Université de Montpellier, Montpellier, France
| |
Collapse
|
25
|
|
26
|
Abstract
BACKGROUND To reconstruct the evolution history of DNA sequences, novel models of increasing complexity regarding the number of free parameters taken into account in the sequence evolution, as well as faster and more accurate algorithms, and statistical and computational methods, are needed. More particularly, as the principal forces that have led to major structural changes are genome rearrangements (such as translocations, fusions, and so on), understanding their underlying mechanisms, among other things via the ancestral genome reconstruction, are essential. In this problem, since finding the ancestral genomes that minimize the number of rearrangements in a phylogenetic tree is known to be NP-hard for three or more genomes, heuristics are commonly chosen to obtain approximations of the exact solution. The aim of this work is to show that another path is possible. RESULTS Various algorithms and software already deal with the difficult nature of the problem of reconstruction of the ancestral genome, but they do not function with precision, in particular when indels or single nucleotide polymorphisms fall into repeated sequences. In this article, and despite the theoretical NP-hardness of the ancestral reconstruction problem, we show that an exact solution can be found in practice in various cases, encompassing organelles and some bacteria. A practical example proves that an accurate reconstruction, which also allows to highlight homoplasic events, can be obtained. This is illustrated by the reconstruction of ancestral genomes of two bacterial pathogens, belonging in Mycobacterium and Brucella genera. CONCLUSIONS By putting together automatically reconstructed ancestral regions with handmade ones for problematic cases, we show that an accurate reconstruction of ancestors of the Brucella genus and of the Mycobacterium tuberculosis complex is possible. By doing so, we are able to investigate the evolutionary history of each pathogen by computing their common ancestors. They can be investigated extensively, by studying the gene content evolution over time, the resistance acquisition, and the impacts of mobile elements on genome plasticity.
Collapse
Affiliation(s)
- Christophe Guyeux
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, Besançon, 25000 France
| | - Bashar Al-Nuaimi
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, Besançon, 25000 France
- Department of Computer Science, Diyala University, Diyala, 32001 Iraq
| | - Bassam AlKindy
- Department of Computer Science, Al-Mustansiriyah University, Baghdad, 10052 Iraq
| | - Jean-François Couchot
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, Besançon, 25000 France
| | - Michel Salomon
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, Besançon, 25000 France
| |
Collapse
|
27
|
Zappelini C, Alvarez-Lopez V, Capelli N, Guyeux C, Chalot M. Streptomyces Dominate the Soil Under Betula Trees That Have Naturally Colonized a Red Gypsum Landfill. Front Microbiol 2018; 9:1772. [PMID: 30123206 PMCID: PMC6085467 DOI: 10.3389/fmicb.2018.01772] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Accepted: 07/16/2018] [Indexed: 11/13/2022] Open
Abstract
The successful restoration of well-engineered tailings storage facilities is needed to avoid mine tailings problems. This study characterized the bacterial communities from vegetated and non-vegetated soils from a red gypsum landfill resulting from the industrial extraction of titanium. A set of 275 bacteria was isolated from vegetated soil and non-vegetated soil areas and taxonomically characterized using BOX-PCR. The study also evaluated the ability of a subset of 88 isolated bacteria on their ability to produce plant growth promoting (PGP) traits [indoleacetic acid (IAA) production, phosphate solubilization, and siderophore production] and their tolerance to potentially toxic elements (PTEs). Twenty strains were chosen for further analysis to produce inoculum for birch-challenging experiments. Principal component analysis (PCA) showed that the set of pedological parameters (pH, granulometry, carbon, organic matter, and Mg content) alone explained approximately 40% of the differences between the two soils. The highest density of total culturable bacteria was found in the vegetated soil, and it was much higher than that in the non-vegetated soil. The Actinobacteria phyla dominated the culturable soil community (70% in vegetated soil and 95% in non-vegetated soil), while the phyla Firmicutes (including the genus Bacillus) and Bacteroides (including the genera Pedobacter and Olivibacter) were found only in the vegetated soil fraction. Additional genera (Rhizobium, Variovorax, and Ensifer) were found solely in the vegetated soil. The vegetated soil bacteria harbored the most beneficial PGP bacteria with 12% of the isolates showing three or more PGP traits. The strains with higher metal tolerances in our study were Phyllobacterium sp. WR140 (RO1.15), Phyllobacterium sp. WR140 (R01.34), and Streptomyces sp. (R04.15), all isolated from the vegetated soil. Among the isolates tested in challenging experiments, Phyllobacterium (R01.34) and Streptomyces sp. (R05.33) have the greatest potential to act as PGP rhizobacteria and therefore to be used in the biological restoration of tailings dumps.
Collapse
Affiliation(s)
- Cyril Zappelini
- Laboratoire Chrono-Environnement, UMR CNRS 6249, Université Bourgogne Franche-Comté, Montbéliard, France
| | - Vanessa Alvarez-Lopez
- Laboratoire Chrono-Environnement, UMR CNRS 6249, Université Bourgogne Franche-Comté, Montbéliard, France
| | - Nicolas Capelli
- Laboratoire Chrono-Environnement, UMR CNRS 6249, Université Bourgogne Franche-Comté, Montbéliard, France
| | - Christophe Guyeux
- Département d'Informatique des Systèmes Complexes, Institut FEMTO-ST (UMR 6174 CNRS), Université Bourgogne Franche-Comté, Besançon, France
| | - Michel Chalot
- Laboratoire Chrono-Environnement, UMR CNRS 6249, Université Bourgogne Franche-Comté, Montbéliard, France.,Faculté des Sciences et Technologies, Université de Lorraine, Nancy, France
| |
Collapse
|
28
|
Garnier R, Guyeux C, Couchot JF, Salomon M, Al-Nuaimi B, AlKindy B. Comparison of metaheuristics to measure gene effects on phylogenetic supports and topologies. BMC Bioinformatics 2018; 19:218. [PMID: 30066663 PMCID: PMC6069296 DOI: 10.1186/s12859-018-2172-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Background A huge and continuous increase in the number of completely sequenced chloroplast genomes, available for evolutionary and functional studies in plants, has been observed during the past years. Consequently, it appears possible to build large-scale phylogenetic trees of plant species. However, building such a tree that is well-supported can be a difficult task, even when a subset of close plant species is considered. Usually, the difficulty raises from a few core genes disturbing the phylogenetic information, due for example from problems of homoplasy. Fortunately, a reliable phylogenetic tree can be obtained once these problematic genes are identified and removed from the analysis.Therefore, in this paper we address the problem of finding the largest subset of core genomes which allows to build the best supported tree. Results As an exhaustive study of all core genes combination is untractable in practice, since the combinatorics of the situation made it computationally infeasible, we investigate three well-known metaheuristics to solve this optimization problem. More precisely, we design and compare distributed approaches using genetic algorithm, particle swarm optimization, and simulated annealing. The latter approach is a new contribution and therefore is described in details, whereas the two former ones have been already studied in previous works. They have been designed de novo in a new platform, and new experiments have been achieved on a larger set of chloroplasts, to compare together these three metaheuristics. Conclusions The ways genes affect both tree topology and supports are assessed using statistical tools like Lasso or dummy logistic regression, in an hybrid approach of the genetic algorithm. By doing so, we are able to provide the most supported trees based on the largest subsets of core genes.
Collapse
Affiliation(s)
- Régis Garnier
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, Besançon, 25000, France
| | - Christophe Guyeux
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, Besançon, 25000, France.
| | - Jean-François Couchot
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, Besançon, 25000, France
| | - Michel Salomon
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, Besançon, 25000, France
| | - Bashar Al-Nuaimi
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Univ. Bourgogne Franche-Comté (UBFC), 16 Route de Gray, Besançon, 25000, France.,Department of Computer Science, Diyala University, Diyala, 32001, Iraq
| | - Bassam AlKindy
- Department of Computer Science, Mustansiriyah University, Baghdad, 10052, Iraq
| |
Collapse
|
29
|
Treepong P, Kos V, Guyeux C, Blanc D, Bertrand X, Valot B, Hocquet D. Global emergence of the widespread Pseudomonas aeruginosa ST235 clone. Clin Microbiol Infect 2018. [DOI: 10.1016/j.cmi.2017.06.018] [Citation(s) in RCA: 102] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
30
|
Bruneau M, Mottet T, Moulin S, Kerbiriou M, Chouly F, Chretien S, Guyeux C. A clustering package for nucleotide sequences using Laplacian Eigenmaps and Gaussian Mixture Model. Comput Biol Med 2017; 93:66-74. [PMID: 29288886 DOI: 10.1016/j.compbiomed.2017.12.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2017] [Revised: 12/08/2017] [Accepted: 12/09/2017] [Indexed: 11/25/2022]
Abstract
In this article, a new Python package for nucleotide sequences clustering is proposed. This package, freely available on-line, implements a Laplacian eigenmap embedding and a Gaussian Mixture Model for DNA clustering. It takes nucleotide sequences as input, and produces the optimal number of clusters along with a relevant visualization. Despite the fact that we did not optimise the computational speed, our method still performs reasonably well in practice. Our focus was mainly on data analytics and accuracy and as a result, our approach outperforms the state of the art, even in the case of divergent sequences. Furthermore, an a priori knowledge on the number of clusters is not required here. For the sake of illustration, this method is applied on a set of 100 DNA sequences taken from the mitochondrially encoded NADH dehydrogenase 3 (ND3) gene, extracted from a collection of Platyhelminthes and Nematoda species. The resulting clusters are tightly consistent with the phylogenetic tree computed using a maximum likelihood approach on gene alignment. They are coherent too with the NCBI taxonomy. Further test results based on synthesized data are then provided, showing that the proposed approach is better able to recover the clusters than the most widely used software, namely Cd-hit-est and BLASTClust.
Collapse
Affiliation(s)
- Marine Bruneau
- Laboratoire de Mathématiques de Besançon, UMR 6623 CNRS, France; Université de Bourgogne Franche-Comté, 16 route de Gray, 25030 Besançon, France
| | - Thierry Mottet
- Computer Science Department, FEMTO-ST Institute, UMR 6174 CNRS, France; Université de Bourgogne Franche-Comté, 16 route de Gray, 25030 Besançon, France
| | - Serge Moulin
- Computer Science Department, FEMTO-ST Institute, UMR 6174 CNRS, France; Université de Bourgogne Franche-Comté, 16 route de Gray, 25030 Besançon, France.
| | - Maël Kerbiriou
- Laboratoire de Mathématiques de Besançon, UMR 6623 CNRS, France; Université de Bourgogne Franche-Comté, 16 route de Gray, 25030 Besançon, France
| | - Franz Chouly
- Laboratoire de Mathématiques de Besançon, UMR 6623 CNRS, France; Université de Bourgogne Franche-Comté, 16 route de Gray, 25030 Besançon, France
| | - Stéphane Chretien
- National Physical Laboratory, Hampton Road, Teddington, United Kingdom
| | - Christophe Guyeux
- Computer Science Department, FEMTO-ST Institute, UMR 6174 CNRS, France; Université de Bourgogne Franche-Comté, 16 route de Gray, 25030 Besançon, France
| |
Collapse
|
31
|
Guyeux C, Chrétien S, Côté NML, Bahi JM. Systematic investigations of gene effects on both topologies and supports: An Echinococcus illustration. J Bioinform Comput Biol 2017; 15:1750019. [PMID: 28874087 DOI: 10.1142/s0219720017500196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In this paper, we propose a high performance computing toolbox implementing efficient statistical methods for the study of phylogenies. This toolbox, which implements logit models and LASSO-type penalties, gives a way to better understand, measure, and compare the impact of each gene on a global phylogeny. As an application, we study the Echinococcus phylogeny, which is often considered as a particularly difficult example. Mitochondrial and nuclear genomes (19 coding sequences) of nine Echinococcus species are considered in order to investigate the molecular phylogeny of this genus. First, we check that the 19 gene trees lead to 19 totally different unsupported topologies (a topology is the sister relationship when both branch lengths and supports are ignored in a phylogenetic tree), while using the 19 genes as a whole are not sufficient for estimating the phylogeny. In order to circumvent this issue and understand the impact of the genes, we computed 43,796 trees using combinations ranging from 13 to 19 genes. By doing so, 15 topologies are obtained. Four particular topologies, appearing more robust and frequent, are then selected for more precise investigation. Refining further our statistical analysis, a particularly robust topology is extracted. We also carefully demonstrate the influence of nuclear genes on the likelihood of the phylogeny.
Collapse
Affiliation(s)
- Christophe Guyeux
- * FEMTO-ST Institute, UMR 6174 CNRS, France.,† University of Bourgogne Franche-Comté, Besançon, France
| | - Stéphane Chrétien
- † University of Bourgogne Franche-Comté, Besançon, France.,‡ Laboratory of Mathematics, UMR 6623 CNRS, France
| | - Nathalie M-L Côté
- § Département d'Obstétrique & Gynécologie et Département, de Microbiologie et Infectiologie, Faculté de Médecine et, des Sciences de la Santé, Université de Sherbrooke et, Centre de Recherche du CHUS, Sherbrooke, QC, Canada
| | - Jacques M Bahi
- * FEMTO-ST Institute, UMR 6174 CNRS, France.,† University of Bourgogne Franche-Comté, Besançon, France
| |
Collapse
|
32
|
Moulin S, Seux N, Chrétien S, Guyeux C, Lerat E. Simulation-based estimation of branching models for LTR retrotransposons. Bioinformatics 2017; 33:320-326. [PMID: 28011770 DOI: 10.1093/bioinformatics/btw622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Accepted: 09/23/2016] [Indexed: 11/12/2022] Open
Abstract
Motivation LTR retrotransposons are mobile elements that are able, like retroviruses, to copy and move inside eukaryotic genomes. In the present work, we propose a branching model for studying the propagation of LTR retrotransposons in these genomes. This model allows us to take into account both the positions and the degradation level of LTR retrotransposons copies. In our model, the duplication rate is also allowed to vary with the degradation level. Results Various functions have been implemented in order to simulate their spread and visualization tools are proposed. Based on these simulation tools, we have developed a first method to evaluate the parameters of this propagation model. We applied this method to the study of the spread of the transposable elements ROO, GYPSY and DM412 on a chromosome of Drosophila melanogaster . Availability and Implementation Our proposal has been implemented using Python software. Source code is freely available on the web at https://github.com/SergeMOULIN/retrotransposons-spread . Contact serge.moulin@univ-fcomte.fr. Supplementary information are available at Bioinformatics online.
Collapse
Affiliation(s)
- Serge Moulin
- Département d'Informatique des Systèmes Complexes, FEMTO-ST Institute, UMR 6174 CNRS, Université de Bourgogne Franche-Comté, Besançon, France
| | - Nicolas Seux
- Laboratoire de Mathématiques de Besançon, UMR 6623 CNRS, Université de Bourgogne Franche-Comté, Besançon, France
| | | | - Christophe Guyeux
- Département d'Informatique des Systèmes Complexes, FEMTO-ST Institute, UMR 6174 CNRS, Université de Bourgogne Franche-Comté, Besançon, France
| | - Emmanuelle Lerat
- Université de Lyon, Université Claude Bernard Lyon 1, CNRS, UMR 5558, Laboratoire Biométrie et Biologie Evolutive, Villeurbanne, France
| |
Collapse
|
33
|
Chrétien S, Guyeux C, Conesa B, Delage-Mouroux R, Jouvenot M, Huetz P, Descôtes F. A Bregman-proximal point algorithm for robust non-negative matrix factorization with possible missing values and outliers - application to gene expression analysis. BMC Bioinformatics 2016; 17 Suppl 8:284. [PMID: 27585655 PMCID: PMC5009666 DOI: 10.1186/s12859-016-1120-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Background Non-Negative Matrix factorization has become an essential tool for feature extraction in a wide spectrum of applications. In the present work, our objective is to extend the applicability of the method to the case of missing and/or corrupted data due to outliers. Results An essential property for missing data imputation and detection of outliers is that the uncorrupted data matrix is low rank, i.e. has only a small number of degrees of freedom. We devise a new version of the Bregman proximal idea which preserves nonnegativity and mix it with the Augmented Lagrangian approach for simultaneous reconstruction of the features of interest and detection of the outliers using a sparsity promoting ℓ1 penality. Conclusions An application to the analysis of gene expression data of patients with bladder cancer is finally proposed.
Collapse
Affiliation(s)
- Stéphane Chrétien
- National Physical Laboratory, Hampton Road, Teddington, Middlesex, UK.
| | - Christophe Guyeux
- FEMTO-ST Institute, UMR 6174 CNRS, DISC Computer Science Department, Université de Bourgogne Franche-Comté, 16, route de Gray, Besançon, 25000, France
| | - Bastien Conesa
- ISIFC, Université de Bourgogne Franche-Comté, 23, rue Alain Savary, Besançon, 25000, France
| | - Régis Delage-Mouroux
- EA 3922/IFR133, UFR Sciences et Techniques, Université de Bourgogne Franche-Comté, 16, route de Gray, Besançon, 25000, France
| | - Michèle Jouvenot
- EA 3922/IFR133, UFR Sciences et Techniques, Université de Bourgogne Franche-Comté, 16, route de Gray, Besançon, 25000, France
| | | | - Françoise Descôtes
- Service de Biochimie et Biologie Moléculaire Sud, Pavillon 3D, Centre Hospitalier Lyon Sud, Pierre Bénite, Lyon, Cedex 69495, France
| |
Collapse
|
34
|
Maillard F, Girardclos O, Assad M, Zappelini C, Pérez Mena JM, Yung L, Guyeux C, Chrétien S, Bigham G, Cosio C, Chalot M. Dendrochemical assessment of mercury releases from a pond and dredged-sediment landfill impacted by a chlor-alkali plant. Environ Res 2016; 148:122-126. [PMID: 27038833 DOI: 10.1016/j.envres.2016.03.034] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Revised: 03/09/2016] [Accepted: 03/24/2016] [Indexed: 05/15/2023]
Abstract
Although current Hg emissions from industrial activities may be accurately monitored, evidence of past releases to the atmosphere must rely on one or more environmental proxies. We used Hg concentrations in tree cores collected from poplars and willows to investigate the historical changes of Hg emissions from a dredged sediment landfill and compared them to a nearby control location. Our results demonstrated the potential value of using dendrochemistry to record historical Hg emissions from past industrial activities.
Collapse
Affiliation(s)
- François Maillard
- Laboratoire Chrono-Environnement (UMR 6249), Université de Bourgogne Franche-Comté, Pôle Universitaire du Pays de Montbéliard, 4 Place Tharradin, BP 71427, 25211 Montbéliard, France
| | - Olivier Girardclos
- Laboratoire Chrono-Environnement (UMR 6249), Université de Bourgogne Franche-Comté, Pôle Universitaire du Pays de Montbéliard, 4 Place Tharradin, BP 71427, 25211 Montbéliard, France
| | - Mohamad Assad
- Laboratoire Chrono-Environnement (UMR 6249), Université de Bourgogne Franche-Comté, Pôle Universitaire du Pays de Montbéliard, 4 Place Tharradin, BP 71427, 25211 Montbéliard, France
| | - Cyril Zappelini
- Laboratoire Chrono-Environnement (UMR 6249), Université de Bourgogne Franche-Comté, Pôle Universitaire du Pays de Montbéliard, 4 Place Tharradin, BP 71427, 25211 Montbéliard, France
| | - Julia Maria Pérez Mena
- Laboratoire Chrono-Environnement (UMR 6249), Université de Bourgogne Franche-Comté, Pôle Universitaire du Pays de Montbéliard, 4 Place Tharradin, BP 71427, 25211 Montbéliard, France
| | - Loïc Yung
- Laboratoire Chrono-Environnement (UMR 6249), Université de Bourgogne Franche-Comté, Pôle Universitaire du Pays de Montbéliard, 4 Place Tharradin, BP 71427, 25211 Montbéliard, France
| | - Christophe Guyeux
- Département d'Informatique des Systèmes Complexes, Institut FEMTO-ST (UMR 6174 CNRS), Université de Bourgogne Franche-Comté, 16 Route de Gray, 25030 Besançon Cedex, France
| | - Stéphane Chrétien
- Laboratoire de Mathématiques de Besancon, Université de Bourgogne Franche-Comté, 16 Route de Gray, 25030 Besançon Cedex, France
| | - Gary Bigham
- Exponent, Inc., 15375 SE 30th Place, Suite 250, Bellevue, WA 98007, USA
| | - Claudia Cosio
- Institute F.-A. Forel, University of Geneva, 10 Route de Suisse, CP 416, CH-1290 Versoix, Switzerland
| | - Michel Chalot
- Laboratoire Chrono-Environnement (UMR 6249), Université de Bourgogne Franche-Comté, Pôle Universitaire du Pays de Montbéliard, 4 Place Tharradin, BP 71427, 25211 Montbéliard, France; Université de Lorraine, Faculté des Sciences et Technologies, BP 70239, 54506 Vandoeuvre-les-Nancy, France.
| |
Collapse
|
35
|
Abstract
In this paper, we explain why the chaotic mutation (CM) model of J. M. Bahi and C. Michel (2008) simulates the genes mutations over time with good accuracy. It is firstly shown that the CM model is a truly chaotic one, as it is defined by Devaney. Then, it is established that mutations occurring in genes mutations have indeed a same chaotic dynamic, thus making relevant the use of chaotic models for genomes evolution. Transposition and inversion dynamics are finally investigated.
Collapse
Affiliation(s)
- Jacques M. Bahi
- Institut FEMTO-ST, Université Bourgogne Franche-Comté, 25000 Besançon, France
| | - Christophe Guyeux
- Institut FEMTO-ST, Université Bourgogne Franche-Comté, 25000 Besançon, France
| | - Antoine Perasso
- UMR Chrono-Environnement, Université Bourgogne Franche-Comté, 25000 Besançon, France
| |
Collapse
|
36
|
Guillard T, Cholley P, Limelette A, Hocquet D, Matton L, Guyeux C, Lebreil AL, Bajolet O, Brasme L, Madoux J, Vernet-Garnier V, Barbe C, Bertrand X, de Champs On Behalf Of CarbaFrEst Group C. Fluoroquinolone Resistance Mechanisms and population structure of Enterobacter cloacae non-susceptible to Ertapenem in North-Eastern France. Front Microbiol 2015; 6:1186. [PMID: 26557115 PMCID: PMC4616961 DOI: 10.3389/fmicb.2015.01186] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Accepted: 10/12/2015] [Indexed: 12/21/2022] Open
Abstract
Fluoroquinolone (FQ) agents are a potential resort to treat infection due to Enterobacteriaceae producing extended spectrum β-lactamase and susceptible to FQ. In a context of increase of non-susceptibility to carbapenems among Enterobacteriaceae, we characterized FQ resistance mechanisms in 75 Enterobacter cloacae isolates non-susceptible to ertapenem in North-Eastern France in 2012 and describe the population structure by pulsed field gel electrophoresis (PFGE) and multi-locus sequence typing (MLST). Among them, 14.7% (12/75) carried a carbapenemase-encoding gene. Except one isolate producing VIM-1, the carbapenemase-producing isolates carried the well-known IncL/M pOXA48a plasmid. Most of the isolates (59/75) harbored at least a FQ-R determinant. qnr genes were predominant (40%, 30/75). The MLST study revealed that E. cloacae isolates’ clonality was wide [24 different sequence types (STs)]. The more widespread STs were ST74, ST101, ST110, ST114, and ST133. Carbapenem MICs were higher for E. cloacae ST74 than for other E. cloacae isolates. Plasmid-mediated quinolone resistance determinants were more often observed in E. cloacae ST74 isolates. These findings showed that (i) pOXA-48a is spreading in North-Eastern France, (ii) qnr is preponderant in E. cloacae, (iii) E. cloacae comprised a large amount of lineages spreading in North-Eastern France, and (iv) FQ as an alternative to β-lactams to treat ertapenem non-susceptible Enterobacteriaceae are compromised.
Collapse
Affiliation(s)
- Thomas Guillard
- Laboratoire de Bactériologie-Virologie-Hygiène, Hôpital Robert Debré, CHU Reims Reims, France ; Structure Fédérative de Recherche CAP-Santé, UFR Médecine, Université de Reims Champagne-Ardenne Reims, France
| | - Pascal Cholley
- Service d'Hygiène Hospitalière, CHRU Besançon Besançon, France ; UMR 6249 Chrono-environnement, Université de Franche-Comté Besançon, France
| | - Anne Limelette
- Laboratoire de Bactériologie-Virologie-Hygiène, Hôpital Robert Debré, CHU Reims Reims, France ; Structure Fédérative de Recherche CAP-Santé, UFR Médecine, Université de Reims Champagne-Ardenne Reims, France
| | - Didier Hocquet
- Service d'Hygiène Hospitalière, CHRU Besançon Besançon, France ; UMR 6249 Chrono-environnement, Université de Franche-Comté Besançon, France
| | - Lucie Matton
- Laboratoire de Bactériologie-Virologie-Hygiène, Hôpital Robert Debré, CHU Reims Reims, France ; Structure Fédérative de Recherche CAP-Santé, UFR Médecine, Université de Reims Champagne-Ardenne Reims, France
| | - Christophe Guyeux
- UMR 6174 CNRS, Département d'Informatique des Systèmes Complexes, Université de Franche-Comté Belfort, France
| | - Anne-Laure Lebreil
- Structure Fédérative de Recherche CAP-Santé, UFR Médecine, Université de Reims Champagne-Ardenne Reims, France
| | - Odile Bajolet
- Laboratoire de Bactériologie-Virologie-Hygiène, Hôpital Robert Debré, CHU Reims Reims, France ; Structure Fédérative de Recherche CAP-Santé, UFR Médecine, Université de Reims Champagne-Ardenne Reims, France
| | - Lucien Brasme
- Laboratoire de Bactériologie-Virologie-Hygiène, Hôpital Robert Debré, CHU Reims Reims, France ; Structure Fédérative de Recherche CAP-Santé, UFR Médecine, Université de Reims Champagne-Ardenne Reims, France
| | - Janick Madoux
- Laboratoire de Bactériologie-Virologie-Hygiène, Hôpital Robert Debré, CHU Reims Reims, France
| | - Véronique Vernet-Garnier
- Laboratoire de Bactériologie-Virologie-Hygiène, Hôpital Robert Debré, CHU Reims Reims, France ; Structure Fédérative de Recherche CAP-Santé, UFR Médecine, Université de Reims Champagne-Ardenne Reims, France
| | - Coralie Barbe
- Centre de Recherche et d'Investigation Clinique, Hôpital Robert Debré, CHU Reims Reims, France
| | - Xavier Bertrand
- Service d'Hygiène Hospitalière, CHRU Besançon Besançon, France ; UMR 6249 Chrono-environnement, Université de Franche-Comté Besançon, France
| | - Christophe de Champs On Behalf Of CarbaFrEst Group
- Laboratoire de Bactériologie-Virologie-Hygiène, Hôpital Robert Debré, CHU Reims Reims, France ; Structure Fédérative de Recherche CAP-Santé, UFR Médecine, Université de Reims Champagne-Ardenne Reims, France
| |
Collapse
|
37
|
Guyeux C, Nicod JM, Philippe L, Bahi JM. The study of unfoldable self-avoiding walks — Application to protein structure prediction software. J Bioinform Comput Biol 2015; 13:1550009. [DOI: 10.1142/s0219720015500092] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Self-avoiding walks (SAWs) are the source of very difficult problems in probability and enumerative combinatorics. They are of great interest as, for example, they are the basis of protein structure prediction (PSP) in bioinformatics. The authors of this paper have previously shown that, depending on the prediction algorithm, the sets of obtained walk conformations differ: For example, all the SAWs can be generated using stretching-based algorithms whereas only the unfoldable SAWs can be obtained with methods that iteratively fold the straight line. A deeper study of (non-)unfoldable SAWs is presented in this paper. The contribution is first a survey of what is currently known about these sets. In particular, we provide clear definitions of various subsets of SAWs related to pivot moves (unfoldable and non-unfoldable SAWs, etc.) and the first results that we have obtained, theoretically or computationally, on these sets. Then a new theorem on the number of non-unfoldable SAWs is demonstrated. Finally, a list of open questions is provided and the consequences on the PSP problem is proposed.
Collapse
Affiliation(s)
- Christophe Guyeux
- FEMTO-ST Institute, Université de Franche-Comté/CNRS/ENSMM/UTBM, Besançon, France
| | - Jean-Marc Nicod
- FEMTO-ST Institute, Université de Franche-Comté/CNRS/ENSMM/UTBM, Besançon, France
| | - Laurent Philippe
- FEMTO-ST Institute, Université de Franche-Comté/CNRS/ENSMM/UTBM, Besançon, France
| | - Jacques M. Bahi
- FEMTO-ST Institute, Université de Franche-Comté/CNRS/ENSMM/UTBM, Besançon, France
| |
Collapse
|
38
|
Valot B, Guyeux C, Rolland JY, Mazouzi K, Bertrand X, Hocquet D. What It Takes to Be a Pseudomonas aeruginosa? The Core Genome of the Opportunistic Pathogen Updated. PLoS One 2015; 10:e0126468. [PMID: 25961859 PMCID: PMC4427113 DOI: 10.1371/journal.pone.0126468] [Citation(s) in RCA: 76] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2015] [Accepted: 04/02/2015] [Indexed: 11/19/2022] Open
Abstract
Pseudomonas aeruginosa is an opportunistic bacterial pathogen able to thrive in highly diverse ecological niches and to infect compromised patients. Its genome exhibits a mosaic structure composed of a core genome into which accessory genes are inserted en bloc at specific sites. The size and the content of the core genome are open for debate as their estimation depends on the set of genomes considered and the pipeline of gene detection and clustering. Here, we redefined the size and the content of the core genome of P. aeruginosa from fully re-analyzed genomes of 17 reference strains. After the optimization of gene detection and clustering parameters, the core genome was defined at 5,233 orthologs, which represented ~ 88% of the average genome. Extrapolation indicated that our panel was suitable to estimate the core genome that will remain constant even if new genomes are added. The core genome contained resistance determinants to the major antibiotic families as well as most metabolic, respiratory, and virulence genes. Although some virulence genes were accessory, they often related to conserved biological functions. Long-standing prophage elements were subjected to a genetic drift to eventually display a G+C content as higher as that of the core genome. This contrasts with the low G+C content of highly conserved ribosomal genes. The conservation of metabolic and respiratory genes could guarantee the ability of the species to thrive on a variety of carbon sources for energy in aerobiosis and anaerobiosis. Virtually all the strains, of environmental or clinical origin, have the complete toolkit to become resistant to the major antipseudomonal compounds and possess basic pathogenic mechanisms to infect humans. The knowledge of the genes shared by the majority of the P. aeruginosa isolates is a prerequisite for designing effective therapeutics to combat the wide variety of human infections.
Collapse
Affiliation(s)
- Benoît Valot
- UMR CNRS 6249, Chrono-environnement, Université de Franche-Comté, Besançon, France
| | - Christophe Guyeux
- UMR CNRS 6174, Institut FEMTO-ST, Département DISC, Université de Franche-Comté, Belfort, France
| | - Julien Yves Rolland
- UMR CNRS 6623, Laboratoire de Mathématiques de Besançon, Université de Franche-Comté, Besançon, France
| | - Kamel Mazouzi
- Mésocentre de calculs, Université de Franche-Comté, Besançon, France
| | - Xavier Bertrand
- UMR CNRS 6249, Chrono-environnement, Université de Franche-Comté, Besançon, France
- Laboratoire d’Hygiène Hospitalière, Centre Hospitalier Régional Universitaire, Besançon, France
| | - Didier Hocquet
- UMR CNRS 6249, Chrono-environnement, Université de Franche-Comté, Besançon, France
- Laboratoire d’Hygiène Hospitalière, Centre Hospitalier Régional Universitaire, Besançon, France
- * E-mail:
| |
Collapse
|
39
|
Elghazel W, Bahi J, Guyeux C, Hakem M, Medjaher K, Zerhouni N. Dependability of wireless sensor networks for industrial prognostics and health management. COMPUT IND 2015. [DOI: 10.1016/j.compind.2014.10.004] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
40
|
Haenni M, Hocquet D, Ponsin C, Cholley P, Guyeux C, Madec JY, Bertrand X. Population structure and antimicrobial susceptibility of Pseudomonas aeruginosa from animal infections in France. BMC Vet Res 2015; 11:9. [PMID: 25604773 PMCID: PMC4307146 DOI: 10.1186/s12917-015-0324-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2014] [Accepted: 01/12/2015] [Indexed: 12/03/2022] Open
Abstract
Background Pseudomonas aeruginosa is a major human pathogen, which also affects animals. It is thought that P. aeruginosa has a non-clonal epidemic population structure, with distinct isolates found in humans, animals or the environment. However, very little is known about the structure of the P. aeruginosa population from diseased animals. Data on antimicrobial resistance are also scarce. Results Thirty-four already registered and 19 new MLST profiles were identified. Interestingly, a few clones were more prevalent, and clones associated to human outbreaks were also detected. Multidrug resistance phenotypes were overall rare. Conclusion We highlight the non clonal structure of the population and show a higher prevalence of specific clones, possibly correlating with higher pathogenicity. The low proportion of antimicrobial resistance contrasts with the high resistance rate of human isolates. Electronic supplementary material The online version of this article (doi:10.1186/s12917-015-0324-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Marisa Haenni
- Agence Nationale de Sécurité Sanitaire (Anses), Unité Antibiorésistance et Virulence Bactériennes, Lyon, France.
| | - Didier Hocquet
- Laboratoire d'Hygiène Hospitalière, UMR 6249 CNRS Chrono-environnement, Université de Franche-Comté, Centre Hospitalier Universitaire, Besançon, France.
| | - Cécile Ponsin
- Agence Nationale de Sécurité Sanitaire (Anses), Unité Antibiorésistance et Virulence Bactériennes, Lyon, France.
| | - Pascal Cholley
- Laboratoire d'Hygiène Hospitalière, UMR 6249 CNRS Chrono-environnement, Université de Franche-Comté, Centre Hospitalier Universitaire, Besançon, France.
| | - Christophe Guyeux
- Département DISC, Institut FEMTO-ST, UMR 6174 CNRS, Université de Franche-Comté, Belfort Cedex, France.
| | - Jean-Yves Madec
- Agence Nationale de Sécurité Sanitaire (Anses), Unité Antibiorésistance et Virulence Bactériennes, Lyon, France.
| | - Xavier Bertrand
- Laboratoire d'Hygiène Hospitalière, UMR 6249 CNRS Chrono-environnement, Université de Franche-Comté, Centre Hospitalier Universitaire, Besançon, France.
| |
Collapse
|
41
|
Guyeux C, Makhoul A, Atoui I, Tawbi S, Bahi JM. A Complete Security Framework for Wireless Sensor Networks. International Journal of Information Technology and Web Engineering 2015. [DOI: 10.4018/ijitwe.2015010103] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Wireless sensor networks are often deployed in public or otherwise untrusted and even hostile environments, which prompt a number of security issues. Although security is a necessity in other types of networks, it is much more so in sensor networks due to the resource-constraint, susceptibility to physical capture, and wireless nature. Till now, most of the security approaches proposed for sensor networks present single solution for particular and single problem. Therefore, to address the special security needs of sensor networks as a whole we introduce a security framework. In their framework, the authors emphasize the following areas: (1) secure communication infrastructure, (2) secure scheduling, and (3) a secure data aggregation algorithm. Due to resource constraints, specific strategies are often necessary to preserve the network's lifetime and its quality of service. For instance, to reduce communication costs, data can be aggregated through the network, or nodes can go to sleep mode periodically (nodes scheduling). These strategies must be proven as secure, but protocols used to guarantee this security must be compatible with the resource preservation requirement. To achieve this goal, secure communications in such networks will be defined, together with the notions of secure scheduling and secure aggregation. The concepts of indistinguability, nonmalleability, and message detection resistance will thus be adapted to communications in wireless sensor networks. Finally, some of these security properties will be evaluated in concrete case studies.
Collapse
Affiliation(s)
| | | | - Ibrahim Atoui
- Faculty of Sciences, Lebanese University, Hadath, Lebanon
| | - Samar Tawbi
- Faculty of Sciences, Lebanese University, Hadath, Lebanon
| | - Jacques M. Bahi
- University of Franche-Comté (FEMTO-ST), Belfort Cedex, France
| |
Collapse
|
42
|
Cholley P, Ka R, Guyeux C, Thouverez M, Guessennd N, Ghebremedhin B, Frank T, Bertrand X, Hocquet D. Population structure of clinical Pseudomonas aeruginosa from West and Central African countries. PLoS One 2014; 9:e107008. [PMID: 25187957 PMCID: PMC4154784 DOI: 10.1371/journal.pone.0107008] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Accepted: 08/06/2014] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Pseudomonas aeruginosa (PA) has a non-clonal, epidemic population with a few widely distributed and frequently encountered sequence types (STs) called 'high-risk clusters'. Clinical P. aeruginosa (clinPA) has been studied in all inhabited continents excepted in Africa, where a very few isolates have been analyzed. Here, we characterized a collection of clinPA isolates from four countries of West and Central Africa. METHODOLOGY 184 non-redundant isolates of clinPA from hospitals of Senegal, Ivory Coast, Nigeria, and Central African Republic were genotyped by MLST. We assessed their resistance level to antibiotics by agar diffusion and identified the extended-spectrum β-lactamases (ESBLs) and metallo-β-lactamases (MBLs) by sequencing. The population structure of the species was determined by a nucleotide-based analysis of the entire PA MLST database and further localized on the phylogenetic tree (i) the sequence types (STs) of the present collection, (ii) the STs by continents, (iii) ESBL- and MBL-producing STs from the MLST database. PRINCIPAL FINDINGS We found 80 distinct STs, of which 24 had no relationship with any known STs. 'High-risk' international clonal complexes (CC155, CC244, CC235) were frequently found in West and Central Africa. The five VIM-2-producing isolates belonged to CC233 and CC244. GES-1 and GES-9 enzymes were produced by one CC235 and one ST1469 isolate, respectively. We showed the spread of 'high-risk' international clonal complexes, often described as multidrug-resistant on other continents, with a fully susceptible phenotype. The MBL- and ESBL-producing STs were scattered throughout the phylogenetic tree and our data suggest a poor association between a continent and a specific phylogroup. CONCLUSIONS ESBL- and MBL-encoding genes are borne by both successful international clonal complexes and distinct local STs in clinPA of West and Central Africa. Furthermore, our data suggest that the spread of a ST could be either due to its antibiotic resistance or to features independent from the resistance to antibiotics.
Collapse
Affiliation(s)
- Pascal Cholley
- Laboratoire d'Hygiène Hospitalière, UMR 6249 CNRS Chrono-environnement, Université de Franche-Comté, Centre Hospitalier Universitaire, Besançon, France
| | - Roughyatou Ka
- Laboratoire de Bactériologie, Centre Hospitalier National Universitaire de Fann, Dakar, Senegal
| | - Christophe Guyeux
- Département d'Informatique des Systèmes Complexes, UMR 6174 CNRS, Université de Franche-Comté, Belfort, France
| | - Michelle Thouverez
- Laboratoire d'Hygiène Hospitalière, UMR 6249 CNRS Chrono-environnement, Université de Franche-Comté, Centre Hospitalier Universitaire, Besançon, France
| | | | | | | | - Xavier Bertrand
- Laboratoire d'Hygiène Hospitalière, UMR 6249 CNRS Chrono-environnement, Université de Franche-Comté, Centre Hospitalier Universitaire, Besançon, France
| | - Didier Hocquet
- Laboratoire d'Hygiène Hospitalière, UMR 6249 CNRS Chrono-environnement, Université de Franche-Comté, Centre Hospitalier Universitaire, Besançon, France
- Centre de Ressources Biologiques Ferdinand Cabanne – Filière microbiologie, Centre Hospitalier Régional Universitaire, Besançon, France
- * E-mail:
| |
Collapse
|
43
|
Liapis E, Pantel A, Robert J, Nicolas-Chanoine MH, Cavalié L, van der Mee-Marquet N, de Champs C, Aissa N, Eloy C, Blanc V, Guyeux C, Hocquet D, Lavigne JP, Bertrand X. Molecular epidemiology of OXA-48-producing Klebsiella pneumoniae in France. Clin Microbiol Infect 2014; 20:O1121-3. [PMID: 24942039 DOI: 10.1111/1469-0691.12727] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2014] [Revised: 06/11/2014] [Accepted: 06/15/2014] [Indexed: 11/27/2022]
Abstract
We characterized 53 OXA-48-producing Klebsiella pneumoniae (OXA-48-Kp) isolated between 2011 and 2013 in 21 French hospitals. All the isolates were genotyped using MLST and PFGE and the population structure of the species was determined by a nucleotide-based analysis of the entire K. pneumoniae MLST database. Most of the OXA-48-Kp isolates also produced CTX-M-15 and remained susceptible to imipenem and meropenem. The isolates were distributed into 20 STs, of which five were dominant (ST15, ST101, ST147, ST395 and ST405). All the OXA-48-Kp clustered in the major clade of K. pneumoniae KpI.
Collapse
Affiliation(s)
- E Liapis
- Service d'Hygiène Hospitalière, UMR 6249 Chrono-environnement, Centre Hospitalier Régional Universitaire, Université de Franche-Comté, Besançon, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Bréchet C, Plantin J, Sauget M, Thouverez M, Talon D, Cholley P, Guyeux C, Hocquet D, Bertrand X. Wastewater Treatment Plants Release Large Amounts of Extended-Spectrum β-Lactamase–Producing Escherichia coli Into the Environment. Clin Infect Dis 2014; 58:1658-65. [DOI: 10.1093/cid/ciu190] [Citation(s) in RCA: 116] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
|
45
|
Fang X, Wang Q, Guyeux C, Bahi JM. FPGA acceleration of a pseudorandom number generator based on chaotic iterations. Journal of Information Security and Applications 2014. [DOI: 10.1016/j.jisa.2014.02.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
46
|
Bahi JM, Guyeux C, Mazouzi K, Philippe L. Computational investigations of folded self-avoiding walks related to protein folding. Comput Biol Chem 2013; 47:246-56. [DOI: 10.1016/j.compbiolchem.2013.10.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Accepted: 10/01/2013] [Indexed: 11/25/2022]
|
47
|
Bahi JM, Fang X, Guyeux C, Larger L. FPGA Design for Pseudorandom Number Generator Based on Chaotic Iteration used in Information Hiding Application. ACTA ACUST UNITED AC 2013. [DOI: 10.12785/amis/070607] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
48
|
Bahi JM, Couchot JF, Guyeux C, Salomon M. Neural networks and chaos: construction, evaluation of chaotic networks, and prediction of chaos with multilayer feedforward networks. Chaos 2012; 22:013122. [PMID: 22462998 DOI: 10.1063/1.3685524] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Many research works deal with chaotic neural networks for various fields of application. Unfortunately, up to now, these networks are usually claimed to be chaotic without any mathematical proof. The purpose of this paper is to establish, based on a rigorous theoretical framework, an equivalence between chaotic iterations according to Devaney and a particular class of neural networks. On the one hand, we show how to build such a network, on the other hand, we provide a method to check if a neural network is a chaotic one. Finally, the ability of classical feedforward multilayer perceptrons to learn sets of data obtained from a dynamical system is regarded. Various boolean functions are iterated on finite states. Iterations of some of them are proven to be chaotic as it is defined by Devaney. In that context, important differences occur in the training process, establishing with various neural networks that chaotic behaviors are far more difficult to learn.
Collapse
Affiliation(s)
- Jacques M Bahi
- FEMTO-ST Institute, UMR 6174 CNRS, University of Franche-Comté, IUT de Belfort-Montbéliard, BP 527, 90016 Belfort Cedex, France
| | | | | | | |
Collapse
|