1
|
Buton N, Coste F, Le Cunff Y. Predicting enzymatic function of protein sequences with attention. Bioinformatics 2023; 39:btad620. [PMID: 37874958 PMCID: PMC10612403 DOI: 10.1093/bioinformatics/btad620] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 09/11/2023] [Accepted: 10/22/2023] [Indexed: 10/26/2023] Open
Abstract
MOTIVATION There is a growing number of available protein sequences, but only a limited amount has been manually annotated. For example, only 0.25% of all entries of UniProtKB are reviewed by human annotators. Further developing automatic tools to infer protein function from sequence alone can alleviate part of this gap. In this article, we investigate the potential of Transformer deep neural networks on a specific case of functional sequence annotation: the prediction of enzymatic classes. RESULTS We show that our EnzBert transformer models, trained to predict Enzyme Commission (EC) numbers by specialization of a protein language model, outperforms state-of-the-art tools for monofunctional enzyme class prediction based on sequences only. Accuracy is improved from 84% to 95% on the prediction of EC numbers at level two on the EC40 benchmark. To evaluate the prediction quality at level four, the most detailed level of EC numbers, we built two new time-based benchmarks for comparison with state-of-the-art methods ECPred and DeepEC: the macro-F1 score is respectively improved from 41% to 54% and from 20% to 26%. Finally, we also show that using a simple combination of attention maps is on par with, or better than, other classical interpretability methods on the EC prediction task. More specifically, important residues identified by attention maps tend to correspond to known catalytic sites. Quantitatively, we report a max F-Gain score of 96.05%, while classical interpretability methods reach 91.44% at best. AVAILABILITY AND IMPLEMENTATION Source code and datasets are respectively available at https://gitlab.inria.fr/nbuton/tfpc and https://doi.org/10.5281/zenodo.7253910.
Collapse
Affiliation(s)
- Nicolas Buton
- Univ Rennes, Inria, CNRS, IRISA—UMR 6074, Rennes 35000, France
| | - François Coste
- Univ Rennes, Inria, CNRS, IRISA—UMR 6074, Rennes 35000, France
| | - Yann Le Cunff
- Univ Rennes, Inria, CNRS, IRISA—UMR 6074, Rennes 35000, France
| |
Collapse
|
2
|
Caranfil A, Le Cunff Y, Kervrann C. BayesTICS: Local temporal image correlation spectroscopy and Bayesian simulation technique for sparse estimation of diffusion in fluorescence imaging. Biol Imaging 2023; 3:e5. [PMID: 38487689 PMCID: PMC10936362 DOI: 10.1017/s2633903x23000041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 10/29/2022] [Accepted: 01/28/2023] [Indexed: 03/17/2024]
Abstract
The dynamics and fusion of vesicles during the last steps of exocytosis are not well established yet in cell biology. An open issue is the characterization of the diffusion process at the plasma membrane. Total internal reflection fluorescence microscopy (TIRFM) has been successfully used to analyze the coordination of proteins involved in this mechanism. It enables to capture dynamics of proteins with high frame rate and reasonable signal-to-noise values. Nevertheless, methodological approaches that can analyze and estimate diffusion in local small areas at the scale of a single diffusing spot within cells, are still lacking. To address this issue, we propose a novel correlation-based method for local diffusion estimation. As a starting point, we consider Fick's second law of diffusion that relates the diffusive flux to the gradient of the concentration. Then, we derive an explicit parametric model which is further fitted to time-correlation signals computed from regions of interest (ROI) containing individual spots. Our modeling and Bayesian estimation framework are well appropriate to represent isolated diffusion events and are robust to noise, ROI sizes, and localization of spots in ROIs. The performance of BayesTICS is shown on both synthetic and real TIRFM images depicting Transferrin Receptor proteins.
Collapse
Affiliation(s)
- Anca Caranfil
- SERPICO Project-Team, INRIA Rennes, UMR144 CNRS Institut Curie, PSL Research, Sorbonne Université, Campus universitaire de Beaulieu, Rennes, France
- CeDRE Team, GDR UMR6290-CNRS, Faculty of Medicine, University of Rennes 1, Rennes, France
| | - Yann Le Cunff
- CeDRE Team, GDR UMR6290-CNRS, Faculty of Medicine, University of Rennes 1, Rennes, France
- Dyliss Team, Univ Rennes, CNRS, Inria, IRISA, UMR 6074, Campus de Beaulieu, Rennes, France
| | - Charles Kervrann
- SERPICO Project-Team, INRIA Rennes, UMR144 CNRS Institut Curie, PSL Research, Sorbonne Université, Campus universitaire de Beaulieu, Rennes, France
| |
Collapse
|
3
|
Buffet-Bataillon S, Bouguen G, Fleury F, Cattoir V, Le Cunff Y. Gut microbiota analysis for prediction of clinical relapse in Crohn's disease. Sci Rep 2022; 12:19929. [PMID: 36402792 PMCID: PMC9675750 DOI: 10.1038/s41598-022-23757-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 11/04/2022] [Indexed: 11/20/2022] Open
Abstract
The role of intestinal bacterial microbiota has been described as key in the pathophysiology of Crohn's disease (CD). CD is characterized by frequent relapses after periods of remission which are not entirely understood. In this paper, we investigate whether the heterogeneity in microbiota profiles in CD patients could be a suitable predictor for these relapses. This prospective observational study involved 259 CD patients, in which 41 provided an additional total of 62 consecutive fecal samples, with an average interval of 25 weeks in between each of these samples. Fecal microbiota was analyzed by massive genomic sequencing through 16 S rRNA amplicon sampling. We found that our 259 CD patients could be split into three distinct subgroups of microbiota (G1, G2, G3). From G1 to G3, we noticed a progressive decrease in alpha diversity (p ≤ 0.0001) but no change in the fecal calprotectin (FC) level. Focusing on the 103 consecutive samples from 41 CD patients, we showed that the patients microbiota profiles were remarkably stable over time and associated with increasing symptom severity. Investigating further this microbiota/severity association revealed that the first signs of aggravation are (1) a loss of the main anti-inflammatory Short-Chain Fatty Acids (SCFAs) Roseburia, Eubacterium, Subdoligranumum, Ruminococcus (P < 0.05), (2) an increase in pro-inflammatory pathogens Proteus, Finegoldia (P < 0.05) while (3) an increase of other minor SCFA producers such as Ezakiella, Anaerococcus, Megasphaera, Anaeroglobus, Fenollaria (P < 0.05). Further aggravation of clinical signs is significantly linked to the subsequent loss of these minor SCFAs species and to an increase in other proinflammatory Proteobacteria such as Klebsiella, Pseudomonas, Salmonella, Acinetobacter, Hafnia and proinflammatory Firmicutes such as Staphylococcus, Enterococcus, Streptococcus. (P < 0.05). To our knowledge, this is the first study (1) specifically identifying subgroups of microbiota profiles in CD patients, (2) relating these groups to the evolution of symptoms over time and (3) showing a two-step process in CD symptoms' worsening. This paves the way towards a better understanding of patient-to-patient heterogeneity, as well as providing early warning signals of future aggravation of the symptoms and eventually adapting empirically treatments.
Collapse
Affiliation(s)
- Sylvie Buffet-Bataillon
- grid.410368.80000 0001 2191 9284INSERM, Institut NUMECAN (Nutrition Metabolisms and Cancer), CHU Rennes, Université Rennes 1, 35000 Rennes, France
| | - Guillaume Bouguen
- grid.410368.80000 0001 2191 9284CIC 1414, INSERM, Institut NUMECAN (Nutrition Metabolisms and Cancer), CHU Rennes, Université Rennes 1, 35000 Rennes, France
| | - François Fleury
- grid.410368.80000 0001 2191 9284INSERM, Institut NUMECAN (Nutrition Metabolisms and Cancer), CHU Rennes, Université Rennes 1, 35000 Rennes, France
| | - Vincent Cattoir
- grid.410368.80000 0001 2191 9284U1230, INSERM, CHU Rennes, Université Rennes 1, 35000 Rennes, France
| | - Yann Le Cunff
- grid.410368.80000 0001 2191 9284Dyliss - Dynamics, Logics and Inference for biological Systems and Sequences, Inria Rennes – Bretagne Atlantique, Université Rennes 1, Rennes, France
| |
Collapse
|
4
|
Bouvrais H, Chesneau L, Le Cunff Y, Fairbrass D, Soler N, Pastezeur S, Pécot T, Kervrann C, Pécréaux J. The coordination of spindle-positioning forces during the asymmetric division of the Caenorhabditis elegans zygote. EMBO Rep 2021; 22:e50770. [PMID: 33900015 DOI: 10.15252/embr.202050770] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 02/22/2021] [Accepted: 03/03/2021] [Indexed: 12/28/2022] Open
Abstract
In Caenorhabditis elegans zygote, astral microtubules generate forces essential to position the mitotic spindle, by pushing against and pulling from the cortex. Measuring microtubule dynamics there, we revealed the presence of two populations, corresponding to pulling and pushing events. It offers a unique opportunity to study, under physiological conditions, the variations of both spindle-positioning forces along space and time. We propose a threefold control of pulling force, by polarity, spindle position and mitotic progression. We showed that the sole anteroposterior asymmetry in dynein on-rate, encoding pulling force imbalance, is sufficient to cause posterior spindle displacement. The positional regulation, reflecting the number of microtubule contacts in the posterior-most region, reinforces this imbalance only in late anaphase. Furthermore, we exhibited the first direct proof that dynein processivity increases along mitosis. It reflects the temporal control of pulling forces, which strengthens at anaphase onset following mitotic progression and independently from chromatid separation. In contrast, the pushing force remains constant and symmetric and contributes to maintaining the spindle at the cell centre during metaphase.
Collapse
Affiliation(s)
| | | | - Yann Le Cunff
- CNRS, IGDR - UMR 6290, University of Rennes, Rennes, France
| | | | - Nina Soler
- CNRS, IGDR - UMR 6290, University of Rennes, Rennes, France
| | | | - Thierry Pécot
- INRIA, Centre Rennes - Bretagne Atlantique, Rennes, France
| | | | | |
Collapse
|
5
|
Kusters R, Misevic D, Berry H, Cully A, Le Cunff Y, Dandoy L, Díaz-Rodríguez N, Ficher M, Grizou J, Othmani A, Palpanas T, Komorowski M, Loiseau P, Moulin Frier C, Nanini S, Quercia D, Sebag M, Soulié Fogelman F, Taleb S, Tupikina L, Sahu V, Vie JJ, Wehbi F. Interdisciplinary Research in Artificial Intelligence: Challenges and Opportunities. Front Big Data 2020; 3:577974. [PMID: 33693418 PMCID: PMC7931862 DOI: 10.3389/fdata.2020.577974] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Accepted: 10/28/2020] [Indexed: 11/25/2022] Open
Abstract
The use of artificial intelligence (AI) in a variety of research fields is speeding up multiple digital revolutions, from shifting paradigms in healthcare, precision medicine and wearable sensing, to public services and education offered to the masses around the world, to future cities made optimally efficient by autonomous driving. When a revolution happens, the consequences are not obvious straight away, and to date, there is no uniformly adapted framework to guide AI research to ensure a sustainable societal transition. To answer this need, here we analyze three key challenges to interdisciplinary AI research, and deliver three broad conclusions: 1) future development of AI should not only impact other scientific domains but should also take inspiration and benefit from other fields of science, 2) AI research must be accompanied by decision explainability, dataset bias transparency as well as development of evaluation methodologies and creation of regulatory agencies to ensure responsibility, and 3) AI education should receive more attention, efforts and innovation from the educational and scientific communities. Our analysis is of interest not only to AI practitioners but also to other researchers and the general public as it offers ways to guide the emerging collaborations and interactions toward the most fruitful outcomes.
Collapse
Affiliation(s)
- Remy Kusters
- INSERM U1284, Université de Paris, Center for Research and Interdisciplinarity (CRI), Paris, France
| | - Dusan Misevic
- INSERM U1284, Université de Paris, Center for Research and Interdisciplinarity (CRI), Paris, France
| | | | | | | | - Loic Dandoy
- INSERM U1284, Université de Paris, Center for Research and Interdisciplinarity (CRI), Paris, France
| | - Natalia Díaz-Rodríguez
- Inria Flowers, Paris and Bordeaux, France
- ENSTA Paris, Institut Polytechnique Paris, Paris, France
| | - Marion Ficher
- INSERM U1284, Université de Paris, Center for Research and Interdisciplinarity (CRI), Paris, France
| | - Jonathan Grizou
- INSERM U1284, Université de Paris, Center for Research and Interdisciplinarity (CRI), Paris, France
| | - Alice Othmani
- Université Paris-Est, LISSI, Vitry sur Seine, France
| | - Themis Palpanas
- Université de Paris, France and French University Institute (IUF), Paris, France
| | | | - Patrick Loiseau
- Université Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG, Grenoble, France
| | | | - Santino Nanini
- INSERM U1284, Université de Paris, Center for Research and Interdisciplinarity (CRI), Paris, France
| | | | - Michele Sebag
- TAU, LRI-CNRS–INRIA, Universite Paris-Saclay, France
| | | | - Sofiane Taleb
- INSERM U1284, Université de Paris, Center for Research and Interdisciplinarity (CRI), Paris, France
| | - Liubov Tupikina
- INSERM U1284, Université de Paris, Center for Research and Interdisciplinarity (CRI), Paris, France
- Nokia Bell Labs, Paris, France
| | - Vaibhav Sahu
- INSERM U1284, Université de Paris, Center for Research and Interdisciplinarity (CRI), Paris, France
| | | | - Fatima Wehbi
- INSERM U1284, Université de Paris, Center for Research and Interdisciplinarity (CRI), Paris, France
| |
Collapse
|
6
|
Bidaud-Meynard A, Nicolle O, Heck M, Le Cunff Y, Michaux G. A V0-ATPase-dependent apical trafficking pathway maintains the polarity of the intestinal absorptive membrane. Development 2019; 146:dev.174508. [PMID: 31110027 DOI: 10.1242/dev.174508] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Accepted: 05/07/2019] [Indexed: 12/18/2022]
Abstract
Intestine function relies on the strong polarity of intestinal epithelial cells and the array of microvilli forming a brush border at their luminal pole. Combining a genetic RNA interference (RNAi) screen with in vivo super-resolution imaging in the Caenorhabditis elegans intestine, we found that the V0 sector of the vacuolar ATPase (V0-ATPase) controls a late apical trafficking step, involving Ras-related protein 11 (RAB-11)+ endosomes and the N-ethylmaleimide-sensitive factor-attachment protein receptor (SNARE) synaptosome-associated protein 29 (SNAP-29), and is necessary to maintain the polarized localization of both apical polarity modules and brush border proteins. We show that the V0-ATPase pathway also genetically interacts with glycosphingolipids and clathrin in enterocyte polarity maintenance. Finally, we demonstrate that silencing of the V0-ATPase fully recapitulates the severe structural, polarity and trafficking defects observed in enterocytes from individuals with microvillus inclusion disease (MVID) and use this new in vivo MVID model to follow the dynamics of microvillus inclusions. Thus, we describe a new function for V0-ATPase in apical trafficking and epithelial polarity maintenance and the promising use of the C. elegans intestine as an in vivo model to better understand the molecular mechanisms of rare genetic enteropathies.
Collapse
Affiliation(s)
- Aurélien Bidaud-Meynard
- Univ Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes) - UMR 6290, F-35000 Rennes, France
| | - Ophélie Nicolle
- Univ Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes) - UMR 6290, F-35000 Rennes, France
| | - Markus Heck
- Univ Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes) - UMR 6290, F-35000 Rennes, France
| | - Yann Le Cunff
- Univ Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes) - UMR 6290, F-35000 Rennes, France
| | - Grégoire Michaux
- Univ Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes) - UMR 6290, F-35000 Rennes, France
| |
Collapse
|
7
|
Le Cunff Y, Pakdaman K. Reproduction cost reduces demographic stochasticity and enhances inter-individual compatibility. J Theor Biol 2014; 360:263-270. [PMID: 25036438 DOI: 10.1016/j.jtbi.2014.07.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2014] [Revised: 07/03/2014] [Accepted: 07/07/2014] [Indexed: 10/25/2022]
Abstract
A population׳s survival depends on its ability to adapt to constraints impinging upon it. As such, adaptation is at the heart of an increasing number of theoretical models. In this paper, we propose a bottom-up evolutionary model to explore the relationship between individual evolutionary dynamics and population-level survival. To do so, we extend a well-established model of gene network evolution by introducing a cost for reproduction. As a result population sizes fluctuate and populations can even go extinct. We find that if a population survives a small and critical number of generations, it will reach a quasi-stationary state which ensures long-term survival. In a constant environment, individual adaptation occurs in response to changes in a populations genetic composition. We show that genetic compatibility increases over generations as a by-product of selection for robustness, thus preventing extinction. We also demonstrate that the number of reproductive opportunities per individual, initial population size, and mutation rates all influence population survival. Finally, mixing different populations reveals that individual properties of gene networks co-evolve with the genetic composition of the population in order to maximize an individuals reproductive success.
Collapse
Affiliation(s)
- Yann Le Cunff
- Institut Jacques Monod, CNRS UMR 7592, Univ Paris Diderot, Paris Cité Sorbonne, F-750205 Paris, France; Max Planck Research Group on 'Modeling the Evolution of Aging', 18057 Rostock, Germany.
| | - Khashayar Pakdaman
- Institut Jacques Monod, CNRS UMR 7592, Univ Paris Diderot, Paris Cité Sorbonne, F-750205 Paris, France
| |
Collapse
|
8
|
Le Cunff Y, Baudisch A, Pakdaman K. How evolving heterogeneity distributions of resource allocation strategies shape mortality patterns. PLoS Comput Biol 2013; 9:e1002825. [PMID: 23341758 PMCID: PMC3547821 DOI: 10.1371/journal.pcbi.1002825] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2012] [Accepted: 10/19/2012] [Indexed: 11/19/2022] Open
Abstract
It is well established that individuals age differently. Yet the nature of these inter-individual differences is still largely unknown. For humans, two main hypotheses have been recently formulated: individuals may experience differences in aging rate or aging timing. This issue is central because it directly influences predictions for human lifespan and provides strong insights into the biological determinants of aging. In this article, we propose a model which lets population heterogeneity emerge from an evolutionary algorithm. We find that whether individuals differ in (i) aging rate or (ii) timing leads to different emerging population heterogeneity. Yet, in both cases, the same mortality patterns are observed at the population level. These patterns qualitatively reproduce those of yeasts, flies, worms and humans. Such findings, supported by an extensive parameter exploration, suggest that mortality patterns across species and their potential shapes belong to a limited and robust set of possible curves. In addition, we use our model to shed light on the notion of subpopulations, link population heterogeneity with the experimental results of stress induction experiments and provide predictions about the expected mortality patterns. As biology is moving towards the study of the distribution of individual-based measures, the model and framework we propose here paves the way for evolutionary interpretations of empirical and experimental data linking the individual level to the population level.
Collapse
Affiliation(s)
- Yann Le Cunff
- Institut Jacques Monod, CNRS UMR 7592, Univ Paris Diderot, Paris Cité Sorbonne, Paris, France.
| | | | | |
Collapse
|
9
|
Le Cunff Y, Pakdaman K. Phenotype–genotype relation in Wagner's canalization model. J Theor Biol 2012; 314:69-83. [DOI: 10.1016/j.jtbi.2012.08.020] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2011] [Revised: 06/22/2012] [Accepted: 08/20/2012] [Indexed: 02/04/2023]
|