1
|
Ana DP, O SJ, Flavia T, Zhang Y, Jorge FL. Longitudinal host-microbiome dynamics of metatranscription identify hallmarks of progression in periodontitis. MICROBIOME 2025; 13:119. [PMID: 40369640 PMCID: PMC12077055 DOI: 10.1186/s40168-025-02108-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/29/2025] [Accepted: 04/08/2025] [Indexed: 05/16/2025]
Abstract
BACKGROUND In periodontitis, the interplay between the host and microbiome generates a self-perpetuating cycle of inflammation of tooth-supporting tissues, potentially leading to tooth loss. Despite increasing knowledge of the phylogenetic compositional changes of the periodontal microbiome, the current understanding of in situ activities of the oral microbiome and the interactions among community members and with the host is still limited. Prior studies on the subgingival plaque metatranscriptome have been cross-sectional, allowing for only a snapshot of a highly variable microbiome, and do not include the transcriptome profiles from the host, a critical element in the progression of the disease. RESULTS To identify the host-microbiome interactions in the subgingival milieu that lead to periodontitis progression, we conducted a longitudinal analysis of the host-microbiome metatranscriptome from clinically stable and progressing sites in 15 participants over 1 year. Our research uncovered a distinct timeline of activities of microbial and host responses linked to disease progression, revealing a significant clinical and metabolic change point (the moment in time when the statistical properties of a time series change) at the 6-month mark of the study, with 1722 genes differentially expressed (DE) in the host and 111,705 in the subgingival microbiome. Genes associated with immune response, especially antigen presentation genes, were highly up-regulated in stable sites before the 6-month change point but not in the progressing sites. Activation of cobalamin, porphyrin, and motility in the microbiome contribute to the progression of the disease. Conversely, inhibition of lipopolysaccharide and glycosphingolipid biosynthesis in stable sites coincided with increased immune response. Correlation delay analysis revealed that the positive feedback loop of activities leading to progression consists of immune regulation and response activation in the host that leads to an increase in potassium ion transport and cobalamin biosynthesis in the microbiome, which in turn induces the immune response. Causality analysis identified two clusters of microbiome genes whose progression can accurately predict the outcomes at specific sites with high confidence (AUC = 0.98095 and 0.97619). CONCLUSIONS A specific timeline of host-microbiome activities characterizes the progression of the disease. The metabolic activities of the dysbiotic microbiome and the host are responsible for the positive feedback loop of reciprocally reinforced interactions leading to progression and tissue destruction. Video Abstract.
Collapse
Affiliation(s)
- Duran-Pinedo Ana
- Department of Oral Biology, University of Florida, College of Dentistry, 1395 Center Drive Gainesville, Gainesville, FL, 32610 - 0424, USA
| | - Solbiati Jose O
- Department of Oral Biology, University of Florida, College of Dentistry, 1395 Center Drive Gainesville, Gainesville, FL, 32610 - 0424, USA
| | - Teles Flavia
- Department of Basic & Translational Sciences, University of Pennsylvania, School of Dental Medicine, 240 South 40 Street, Philadelphia, PA, 19104 - 6030, USA
- Center for Innovation and Precision Dentistry (CiPD), University of Pennsylvania, School of Dental Medicine, 240 South 40 Street, Philadelphia, PA, 19104 - 6030, USA
| | - Yanping Zhang
- Gene Expression & Genotyping Core, Interdisciplinary Center for Biotechnology Research, University of Florida, 178 B CGRC, 2033 Mowry Road, Gainesville, FL, 32610, USA
| | - Frias-Lopez Jorge
- Department of Oral Biology, University of Florida, College of Dentistry, 1395 Center Drive Gainesville, Gainesville, FL, 32610 - 0424, USA.
| |
Collapse
|
2
|
Deka H, Pawar A, Battula M, Ghfar AA, Assal ME, Chikhale RV. Identification and Design of Novel Potential Antimicrobial Peptides Targeting Mycobacterial Protein Kinase PknB. Protein J 2024; 43:858-868. [PMID: 39014259 PMCID: PMC11345320 DOI: 10.1007/s10930-024-10218-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/21/2024] [Indexed: 07/18/2024]
Abstract
Antimicrobial peptides have gradually gained advantages over small molecule inhibitors for their multifunctional effects, synthesising accessibility and target specificity. The current study aims to determine an antimicrobial peptide to inhibit PknB, a serine/threonine protein kinase (STPK), by binding efficiently at the helically oriented hinge region. A library of 5626 antimicrobial peptides from publicly available repositories has been prepared and categorised based on the length. Molecular docking using ADCP helped to find the multiple conformations of the subjected peptides. For each peptide served as input the tool outputs 100 poses of the subjected peptide. To maintain an efficient binding for relatively a longer duration, only those peptides were chosen which were seen to bind constantly to the active site of the receptor protein over all the poses observed. Each peptide had different number of constituent amino acid residues; the peptides were classified based on the length into five groups. In each group the peptide length incremented upto four residues from the initial length form. Five peptides were selected for Molecular Dynamic simulation in Gromacs based on higher binding affinity. Post-dynamic analysis and the frame comparison inferred that neither the shorter nor the longer peptide but an intermediate length of 15 mer peptide bound well to the receptor. Residual substitution to the selected peptides was performed to enhance the targeted interaction. The new complexes considered were further analysed using the Elastic Network Model (ENM) for the functional site's intrinsic dynamic movement to estimate the new peptide's role. The study sheds light on prospects that besides the length of peptides, the combination of constituent residues equally plays a pivotal role in peptide-based inhibitor generation. The study envisages the challenges of fine-tuned peptide recovery and the scope of Machine Learning (ML) and Deep Learning (DL) algorithm development. As the study was primarily meant for generation of therapeutics for Tuberculosis (TB), the peptide proposed by this study demands meticulous invitro analysis prior to clinical applications.
Collapse
Affiliation(s)
- Hemchandra Deka
- SilicoScientia Private Limited, Nagananda Commercial Complex, No. 07/3, 15/1, 18th Main Road, Jayanagar 9th Block, Bengaluru, 5600413, India
| | - Atul Pawar
- SilicoScientia Private Limited, Nagananda Commercial Complex, No. 07/3, 15/1, 18th Main Road, Jayanagar 9th Block, Bengaluru, 5600413, India
| | - Monishka Battula
- Department of Bioinformatics, Rajiv Gandhi Institute of IT and Biotechnology, Bharati Vidyapeeth Deemed to be University, Pune-Satara Road, Pune, India
| | - Ayman A Ghfar
- Chemistry Department, College of Science, King Saud University, Riyadh, 11451, Saudi Arabia
| | - Mohamed E Assal
- Chemistry Department, College of Science, King Saud University, Riyadh, 11451, Saudi Arabia
| | - Rupesh V Chikhale
- Department of Pharmaceutical and Biological Chemistry, School of Pharmacy, University College London, Brunswick Square, London, UK.
| |
Collapse
|
3
|
Abadeh R, Aminafshar M, Ghaderi-Zefrehei M, Chamani M. A new gene tree algorithm employing DNA sequences of bovine genome using discrete Fourier transformation. PLoS One 2023; 18:e0277480. [PMID: 36893167 PMCID: PMC9997877 DOI: 10.1371/journal.pone.0277480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Accepted: 10/28/2022] [Indexed: 03/10/2023] Open
Abstract
Within the realms of human thoughts on nature, Fourier analysis is considered as one of the greatest ideas currently put forwarded. The Fourier transform shows that any periodic function can be rewritten as the sum of sinusoidal functions. Having a Fourier transform view on real-world problems like the DNA sequence of genes, would make things intuitively simple to understand in comparison with their initial formal domain view. In this study we used discrete Fourier transform (DFT) on DNA sequences of a set of genes in the bovine genome known to govern milk production, in order to develop a new gene clustering algorithm. The implementation of this algorithm is very user-friendly and requires only simple routine mathematical operations. By transforming the configuration of gene sequences into frequency domain, we sought to elucidate important features and reveal hidden gene properties. This is biologically appealing since no information is lost via this transformation and we are therefore not reducing the number of degrees of freedom. The results from different clustering methods were integrated using evidence accumulation algorithms to provide in insilico validation of our results. We propose using candidate gene sequences accompanied by other genes of biologically unknown function. These will then be assigned some degree of relevant annotation by using our proposed algorithm. Current knowledge in biological gene clustering investigation is also lacking, and so DFT-based methods will help shine a light on use of these algorithms for biological insight.
Collapse
Affiliation(s)
- Roxana Abadeh
- Department of Animal Science, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Mehdi Aminafshar
- Department of Animal Science, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | | | - Mohammad Chamani
- Department of Animal Science, Science and Research Branch, Islamic Azad University, Tehran, Iran
| |
Collapse
|
4
|
Rewiring of the Liver Transcriptome across Multiple Time-Scales Is Associated with the Weight Loss-Independent Resolution of NAFLD Following RYGB. Metabolites 2022; 12:metabo12040318. [DOI: 10.3390/metabo12040318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 03/28/2022] [Accepted: 03/29/2022] [Indexed: 02/06/2023] Open
Abstract
Roux-en-Y gastric bypass (RYGB) surgery potently improves obesity and a myriad of obesity-associated co-morbidities including type 2 diabetes and non-alcoholic fatty liver disease (NAFLD). Time-series omics data are increasingly being utilized to provide insight into the mechanistic underpinnings that correspond to metabolic adaptations in RYGB. However, the conventional computational biology methods used to interpret these temporal multi-dimensional datasets have been generally limited to pathway enrichment analysis (PEA) of isolated pair-wise comparisons based on either experimental condition or time point, neither of which adequately capture responses to perturbations that span multiple time scales. To address this, we have developed a novel graph network-based analysis workflow designed to identify modules enriched with biomolecules that share common dynamic profiles, where the network is constructed from all known biological interactions available through the Kyoto Encyclopedia of Genes and Genomes (KEGG) resource. This methodology was applied to time-series RNAseq transcriptomics data collected on rodent liver samples following RYGB, and those of sham-operated and weight-matched control groups, to elucidate the molecular pathways involved in the improvement of as NAFLD. We report several network modules exhibiting a statistically significant enrichment of genes whose expression trends capture acute-phase as well as long term physiological responses to RYGB in a single analysis. Of note, we found the HIF1 and P53 signaling cascades to be associated with the immediate and the long-term response to RYGB, respectively. The discovery of less intuitive network modules that may have gone overlooked with conventional PEA techniques provides a framework for identifying novel drug targets for NAFLD and other metabolic syndrome co-morbidities.
Collapse
|
5
|
Zimmermann MT. Molecular Modeling is an Enabling Approach to Complement and Enhance Channelopathy Research. Compr Physiol 2022; 12:3141-3166. [PMID: 35578963 DOI: 10.1002/cphy.c190047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Hundreds of human membrane proteins form channels that transport necessary ions and compounds, including drugs and metabolites, yet details of their normal function or how function is altered by genetic variants to cause diseases are often unknown. Without this knowledge, researchers are less equipped to develop approaches to diagnose and treat channelopathies. High-resolution computational approaches such as molecular modeling enable researchers to investigate channelopathy protein function, facilitate detailed hypothesis generation, and produce data that is difficult to gather experimentally. Molecular modeling can be tailored to each physiologic context that a protein may act within, some of which may currently be difficult or impossible to assay experimentally. Because many genomic variants are observed in channelopathy proteins from high-throughput sequencing studies, methods with mechanistic value are needed to interpret their effects. The eminent field of structural bioinformatics integrates techniques from multiple disciplines including molecular modeling, computational chemistry, biophysics, and biochemistry, to develop mechanistic hypotheses and enhance the information available for understanding function. Molecular modeling and simulation access 3D and time-dependent information, not currently predictable from sequence. Thus, molecular modeling is valuable for increasing the resolution with which the natural function of protein channels can be investigated, and for interpreting how genomic variants alter them to produce physiologic changes that manifest as channelopathies. © 2022 American Physiological Society. Compr Physiol 12:3141-3166, 2022.
Collapse
Affiliation(s)
- Michael T Zimmermann
- Bioinformatics Research and Development Laboratory, Genomic Sciences and Precision Medicine Center, Medical College of Wisconsin, Milwaukee, Wisconsin, USA.,Clinical and Translational Sciences Institute, Medical College of Wisconsin, Milwaukee, Wisconsin, USA.,Department of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| |
Collapse
|
6
|
Identification of volatile producing enzymes in higher fungi: Combining analytical and bioinformatic methods. Methods Enzymol 2022; 664:221-242. [PMID: 35331375 DOI: 10.1016/bs.mie.2021.12.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Filamentous fungi harbor the genetic potential for the biosynthesis of several secondary metabolites including various volatile organic compounds (VOCs). Nonetheless, under standard laboratory conditions, many of these VOCs are not formed. Furthermore, little is known about enzymes involved in the production of fungal VOCs. To tap these interesting topics, we developed an approach to identify enzymes putatively involved in the fungal VOC biosynthesis. In this chapter, we highlight different fungal cultivation methods and techniques for the extraction of VOCs, including a method that allows the noninvasive analysis of VOCs. In addition using terpene synthases as an example, it is depicted how enzymes putatively involved in VOC synthesis can be identified by means of bioinformatic approaches. Transcriptomic data of chosen genes combined with volatilome data obtained during different developmental stages is demonstrated as a powerful tool to identify enzymes putatively involved in fungal VOC biosynthesis. Especially with regard to subsequent enzyme characterization, this procedure is a target-oriented way to save time and efforts by considering only the most important enzymes.
Collapse
|
7
|
Bodein A, Scott-Boyer MP, Perin O, Lê Cao KA, Droit A. timeOmics: an R package for longitudinal multi-omics data integration. Bioinformatics 2022; 38:577-579. [PMID: 34554215 DOI: 10.1093/bioinformatics/btab664] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 08/17/2021] [Accepted: 09/15/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Multi-omics data integration enables the global analysis of biological systems and discovery of new biological insights. Multi-omics experimental designs have been further extended with a longitudinal dimension to study dynamic relationships between molecules. However, methods that integrate longitudinal multi-omics data are still in their infancy. RESULTS We introduce the R package timeOmics, a generic analytical framework for the integration of longitudinal multi-omics data. The framework includes pre-processing, modeling and clustering to identify molecular features strongly associated with time. We illustrate this framework in a case study to detect seasonal patterns of mRNA, metabolites, gut taxa and clinical variables in patients with diabetes mellitus from the integrative Human Microbiome Project. AVAILABILITYAND IMPLEMENTATION timeOmics is available on Bioconductor and github.com/abodein/timeOmics. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Antoine Bodein
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC G1V 0A6, Canada
| | - Marie-Pier Scott-Boyer
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC G1V 0A6, Canada
| | - Olivier Perin
- Digital Sciences Department, L'Oréal Advanced Research, Aulnay-sous-bois 93600, France
| | - Kim-Anh Lê Cao
- Melbourne Integrative Genomics, School of Mathematics and Statistics, University of Melbourne, Melbourne, VIC 3010, Australia
| | - Arnaud Droit
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC G1V 0A6, Canada
| |
Collapse
|
8
|
Sharma A, Johnson KB, Bie B, Rhoades EE, Sen A, Kida Y, Hockings J, Gatta A, Davenport J, Arcangelini C, Ritzu J, DeVecchio J, Hughen R, Wei M, Thomas Budd G, Lynn Henry N, Eng C, Foss J, Rotroff DM. A Multimodal Approach to Discover Biomarkers for Taxane-Induced Peripheral Neuropathy (TIPN): A Study Protocol. Technol Cancer Res Treat 2022; 21:15330338221127169. [PMID: 36172750 PMCID: PMC9523841 DOI: 10.1177/15330338221127169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Introduction: Taxanes are a class of chemotherapeutics commonly used to treat various solid tumors, including breast and ovarian cancers. Taxane-induced peripheral neuropathy (TIPN) occurs in up to 70% of patients, impacting quality of life both during and after treatment. TIPN typically manifests as tingling and numbness in the hands and feet and can cause irreversible loss of function of peripheral nerves. TIPN can be dose-limiting, potentially impacting clinical outcomes. The mechanisms underlying TIPN are poorly understood. As such, there are limited treatment options and no tools to provide early detection of those who will develop TIPN. Although some patients may have a genetic predisposition, genetic biomarkers have been inconsistent in predicting chemotherapy-induced peripheral neuropathy (CIPN). Moreover, other molecular markers (eg, metabolites, mRNA, miRNA, proteins) may be informative for predicting CIPN, but remain largely unexplored. We anticipate that combinations of multiple biomarkers will be required to consistently predict those who will develop TIPN. Methods: To address this clinical gap of identifying patients at risk of TIPN, we initiated the Genetics and Inflammatory Markers for CIPN (GENIE) study. This longitudinal multicenter observational study uses a novel, multimodal approach to evaluate genomic variation, metabolites, DNA methylation, gene expression, and circulating cytokines/chemokines prior to, during, and after taxane treatment in 400 patients with breast cancer. Molecular and patient reported data will be collected prior to, during, and after taxane therapy. Multi-modal data will be used to develop a set of comprehensive predictive biomarker signatures of TIPN. Conclusion: The goal of this study is to enable early detection of patients at risk of developing TIPN, provide a tool to modify taxane treatment to minimize morbidity from TIPN, and improved patient quality of life. Here we provide a brief review of the current state of research into CIPN and TIPN and introduce the GENIE study design.
Collapse
Affiliation(s)
- Anukriti Sharma
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, OH, USA
| | - Ken B. Johnson
- Department of Anesthesiology, University of Utah, UT, USA
| | - Bihua Bie
- Department of Anesthesiology, Cleveland Clinic, OH, USA
| | | | - Alper Sen
- Department of Anesthesiology, University of Utah, UT, USA
| | - Yuri Kida
- Department of Anesthesiology, University of Utah, UT, USA
| | - Jennifer Hockings
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, OH, USA
- Department of Pharmacy, Cleveland Clinic, OH, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Alycia Gatta
- Taussig Cancer Institute, Cleveland Clinic, OH, USA
| | | | | | | | - Jennifer DeVecchio
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, OH, USA
| | - Ron Hughen
- Department of Anesthesiology, University of Utah, UT, USA
| | - Mei Wei
- Huntsman Cancer Institute, University of Utah, Salt Lake City, UT
| | - G. Thomas Budd
- Taussig Cancer Institute, Cleveland Clinic, OH, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - N. Lynn Henry
- University of Michigan Rogel Cancer Center, Ann Arbor, MI, USA
| | - Charis Eng
- Taussig Cancer Institute, Cleveland Clinic, OH, USA
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, OH, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
- Department of Genetics and Genome Sciences, Case Western Reserve University School of Medicine, Cleveland, OH, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Joseph Foss
- Department of Anesthesiology, Cleveland Clinic, OH, USA
| | - Daniel M. Rotroff
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, OH, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH, USA
- Endocrinology and Metabolism Institute, Cleveland Clinic, Cleveland, OH, USA
| |
Collapse
|
9
|
Cardona L, Cao KAL, Puig-Castellví F, Bureau C, Madigou C, Mazéas L, Chapleur O. Integrative Analyses to Investigate the Link between Microbial Activity and Metabolite Degradation during Anaerobic Digestion. J Proteome Res 2020; 19:3981-3992. [DOI: 10.1021/acs.jproteome.0c00251] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Affiliation(s)
- Laëtitia Cardona
- Université Paris-Saclay, INRAE, PROSE, 1 rue Pierre-Gilles de Gennes, CS 10030, 92761 Antony Cedex, France
| | - Kim Anh Lê Cao
- Melbourne Integrative Genomics, School of Mathematics and Statistics, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Francesc Puig-Castellví
- Université Paris-Saclay, INRAE, PROSE, 1 rue Pierre-Gilles de Gennes, CS 10030, 92761 Antony Cedex, France
- Université Paris-Saclay, INRAE, AgroParisTech, UMR SayFood, 75005 Paris, France
| | - Chrystelle Bureau
- Université Paris-Saclay, INRAE, PROSE, 1 rue Pierre-Gilles de Gennes, CS 10030, 92761 Antony Cedex, France
| | - Céline Madigou
- Acquisitions et Analyses de Données pour l’Histoire naturelle, 2AD—UMS 2700 CNRS MNHN, Muséum national d’Histoire naturelle, CP26, 57 rue Cuvier, 75231 Paris Cedex 05, France
| | - Laurent Mazéas
- Université Paris-Saclay, INRAE, PROSE, 1 rue Pierre-Gilles de Gennes, CS 10030, 92761 Antony Cedex, France
| | - Olivier Chapleur
- Université Paris-Saclay, INRAE, PROSE, 1 rue Pierre-Gilles de Gennes, CS 10030, 92761 Antony Cedex, France
| |
Collapse
|
10
|
Chandereng T, Gitter A. Lag penalized weighted correlation for time series clustering. BMC Bioinformatics 2020; 21:21. [PMID: 31948388 PMCID: PMC6966853 DOI: 10.1186/s12859-019-3324-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2019] [Accepted: 12/16/2019] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND The similarity or distance measure used for clustering can generate intuitive and interpretable clusters when it is tailored to the unique characteristics of the data. In time series datasets generated with high-throughput biological assays, measurements such as gene expression levels or protein phosphorylation intensities are collected sequentially over time, and the similarity score should capture this special temporal structure. RESULTS We propose a clustering similarity measure called Lag Penalized Weighted Correlation (LPWC) to group pairs of time series that exhibit closely-related behaviors over time, even if the timing is not perfectly synchronized. LPWC aligns time series profiles to identify common temporal patterns. It down-weights aligned profiles based on the length of the temporal lags that are introduced. We demonstrate the advantages of LPWC versus existing time series and general clustering algorithms. In a simulated dataset based on the biologically-motivated impulse model, LPWC is the only method to recover the true clusters for almost all simulated genes. LPWC also identifies clusters with distinct temporal patterns in our yeast osmotic stress response and axolotl limb regeneration case studies. CONCLUSIONS LPWC achieves both of its time series clustering goals. It groups time series with correlated changes over time, even if those patterns occur earlier or later in some of the time series. In addition, it refrains from introducing large shifts in time when searching for temporal patterns by applying a lag penalty. The LPWC R package is available at https://github.com/gitter-lab/LPWC and CRAN under a MIT license.
Collapse
Affiliation(s)
- Thevaa Chandereng
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI USA
- Morgridge Institute of Research, Madison, WI USA
- Department of Statistics, University of Wisconsin-Madison, Madison, WI USA
| | - Anthony Gitter
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI USA
- Morgridge Institute of Research, Madison, WI USA
| |
Collapse
|
11
|
Bodein A, Chapleur O, Droit A, Lê Cao KA. A Generic Multivariate Framework for the Integration of Microbiome Longitudinal Studies With Other Data Types. Front Genet 2019; 10:963. [PMID: 31803221 PMCID: PMC6875829 DOI: 10.3389/fgene.2019.00963] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2019] [Accepted: 09/10/2019] [Indexed: 12/12/2022] Open
Abstract
Simultaneous profiling of biospecimens using different technological platforms enables the study of many data types, encompassing microbial communities, omics, and meta-omics as well as clinical or chemistry variables. Reduction in costs now enables longitudinal or time course studies on the same biological material or system. The overall aim of such studies is to investigate relationships between these longitudinal measures in a holistic manner to further decipher the link between molecular mechanisms and microbial community structures, or host-microbiota interactions. However, analytical frameworks enabling an integrated analysis between microbial communities and other types of biological, clinical, or phenotypic data are still in their infancy. The challenges include few time points that may be unevenly spaced and unmatched between different data types, a small number of unique individual biospecimens, and high individual variability. Those challenges are further exacerbated by the inherent characteristics of microbial communities-derived data (e.g., sparse, compositional). We propose a generic data-driven framework to integrate different types of longitudinal data measured on the same biological specimens with microbial community data and select key temporal features with strong associations within the same sample group. The framework ranges from filtering and modeling to integration using smoothing splines and multivariate dimension reduction methods to address some of the analytical challenges of microbiome-derived data. We illustrate our framework on different types of multi-omics case studies in bioreactor experiments as well as human studies.
Collapse
Affiliation(s)
- Antoine Bodein
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Olivier Chapleur
- Hydrosystems and Biopresses Research Unit, Irstea, Antony, France
| | - Arnaud Droit
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Kim-Anh Lê Cao
- Melbourne Integrative Genomics, School of Mathematics and Statistics, University of Melbourne, Melbourne, VIC, Australia
| |
Collapse
|
12
|
Computational Analysis of the Molecular Mechanism of RamR Mutations Contributing to Antimicrobial Resistance in Salmonella enterica. Sci Rep 2017; 7:13418. [PMID: 29042652 PMCID: PMC5645378 DOI: 10.1038/s41598-017-14008-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2017] [Accepted: 10/04/2017] [Indexed: 11/09/2022] Open
Abstract
Antimicrobial resistance (AMR) in pathogenic microorganisms with multidrug resistance (MDR) constitutes a severe threat to human health. A major causative mechanism of AMR is mediated through the multidrug efflux pump (MEP). The resistance-nodulation-division superfamily (RND family) of Gram-negative bacteria is usually the major cause of MDR in clinical studies. In Salmonella enterica, the RND pump is translated from the acrAB gene, which is regulated by the activator RamA. Many MEP-caused AMR strains have high ramA gene expression due to mutations in RamR, which has a homodimeric structure comprising the dimerization domain and DNA-binding domain (DBD). Three mutations on the dimerization domain, namely Y59H, M84I, and E160D, are far from the DBD; the molecular mechanism through which they influence RamR’s binding affinity to the ramA gene promoter and consequently disrupt RamA remains unclear. The present study conducted molecular dynamics simulations, binding free energy calculations, and normal mode analysis to investigate the mechanism through which Y59H, M84I, and E160D mutations on the dimerization domain influence the binding affinity of RamR to the ramA promoter. The present results suggest that the three mutations alter the RamR structure, resulting in decreased DNA-binding affinity.
Collapse
|