1
|
Pathway-based, reaction-specific annotation of disease variants for elucidation of molecular phenotypes. Database (Oxford) 2024; 2024:baae031. [PMID: 38713862 DOI: 10.1093/database/baae031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 02/23/2024] [Accepted: 04/01/2024] [Indexed: 05/09/2024]
Abstract
Germline and somatic mutations can give rise to proteins with altered activity, including both gain and loss-of-function. The effects of these variants can be captured in disease-specific reactions and pathways that highlight the resulting changes to normal biology. A disease reaction is defined as an aberrant reaction in which a variant protein participates. A disease pathway is defined as a pathway that contains a disease reaction. Annotation of disease variants as participants of disease reactions and disease pathways can provide a standardized overview of molecular phenotypes of pathogenic variants that is amenable to computational mining and mathematical modeling. Reactome (https://reactome.org/), an open source, manually curated, peer-reviewed database of human biological pathways, in addition to providing annotations for >11 000 unique human proteins in the context of ∼15 000 wild-type reactions within more than 2000 wild-type pathways, also provides annotations for >4000 disease variants of close to 400 genes as participants of ∼800 disease reactions in the context of ∼400 disease pathways. Functional annotation of disease variants proceeds from normal gene functions, described in wild-type reactions and pathways, through disease variants whose divergence from normal molecular behaviors has been experimentally verified, to extrapolation from molecular phenotypes of characterized variants to variants of unknown significance using criteria of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Reactome's data model enables mapping of disease variant datasets to specific disease reactions within disease pathways, providing a platform to infer pathway output impacts of numerous human disease variants and model organism orthologs, complementing computational predictions of variant pathogenicity. Database URL: https://reactome.org/.
Collapse
|
2
|
Fecal microbiota transplantation stimulates type 2 and tolerogenic immune responses in a mouse model. Anaerobe 2024; 86:102841. [PMID: 38521227 DOI: 10.1016/j.anaerobe.2024.102841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/03/2024] [Accepted: 03/17/2024] [Indexed: 03/25/2024]
Abstract
OBJECTIVES Clostridioides difficile infection (CDI) is the leading hospital-acquired infection in North America. While previous work on fecal microbiota transplantation (FMT), a highly effective treatment for CDI, has focused on colonization resistance mounted against C. difficile by FMT-delivered commensals, the effects of FMT on host gene expression are relatively unexplored. This study aims to identify transcriptional changes associated with FMT, particularly changes associated with protective immune responses. METHODS Gene expression was assessed on day 2 and day 7 after FMT in mice after antibiotic-induced dysbiosis. Flow cytometry was also performed on colon and mesenteric lymph nodes at day 7 to investigate changes in immune cell populations. RESULTS FMT administration after antibiotic-induced dysbiosis successfully restored microbial alpha diversity to levels of donor mice by day 7 post-FMT. Bulk RNA sequencing of cecal tissue at day 2 identified immune genes, including both pro-inflammatory and Type 2 immune pathways as upregulated after FMT. RNA sequencing was repeated on day 7 post-FMT, and expression of these immune genes was decreased along with upregulation of genes associated with restoration of intestinal homeostasis. Immunoprofiling on day 7 identified increased colonic CD45+ immune cells that exhibited dampened Type 1 and heightened regulatory and Type 2 responses. These include an increased abundance of eosinophils, alternatively activated macrophages, Th2, and T regulatory cell populations. CONCLUSION These results highlight the impact of FMT on host gene expression, providing evidence that FMT restores intestinal homeostasis after antibiotic treatment and facilitates tolerogenic and Type 2 immune responses.
Collapse
|
3
|
Association of neurotransmitter pathway polygenic risk with specific symptom profiles in psychosis. Mol Psychiatry 2024:10.1038/s41380-024-02457-0. [PMID: 38491343 DOI: 10.1038/s41380-024-02457-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 01/19/2024] [Accepted: 01/24/2024] [Indexed: 03/18/2024]
Abstract
A primary goal of psychiatry is to better understand the pathways that link genetic risk to psychiatric symptoms. Here, we tested association of diagnosis and endophenotypes with overall and neurotransmitter pathway-specific polygenic risk in patients with early-stage psychosis. Subjects included 205 demographically diverse cases with a psychotic disorder who underwent comprehensive psychiatric and neurological phenotyping and 115 matched controls. Following genotyping, we calculated polygenic scores (PGSs) for schizophrenia (SZ) and bipolar disorder (BP) using Psychiatric Genomics Consortium GWAS summary statistics. To test if overall genetic risk can be partitioned into affected neurotransmitter pathways, we calculated pathway PGSs (pPGSs) for SZ risk affecting each of four major neurotransmitter systems: glutamate, GABA, dopamine, and serotonin. Psychosis subjects had elevated SZ PGS versus controls; cases with SZ or BP diagnoses had stronger SZ or BP risk, respectively. There was no significant association within psychosis cases between individual symptom measures and overall PGS. However, neurotransmitter-specific pPGSs were moderately associated with specific endophenotypes; notably, glutamate was associated with SZ diagnosis and with deficits in cognitive control during task-based fMRI, while dopamine was associated with global functioning. Finally, unbiased endophenotype-driven clustering identified three diagnostically mixed case groups that separated on primary deficits of positive symptoms, negative symptoms, global functioning, and cognitive control. All clusters showed strong genome-wide risk. Cluster 2, characterized by deficits in cognitive control and negative symptoms, additionally showed specific risk concentrated in glutamatergic and GABAergic pathways. Due to the intensive characterization of our subjects, the present study was limited to a relatively small cohort. As such, results should be followed up with additional research at the population and mechanism level. Our study suggests pathway-based PGS analysis may be a powerful path forward to study genetic mechanisms driving psychiatric endophenotypes.
Collapse
|
4
|
Neuromodulator regulation and emotions: insights from the crosstalk of cell signaling. Front Mol Neurosci 2024; 17:1376762. [PMID: 38516040 PMCID: PMC10954900 DOI: 10.3389/fnmol.2024.1376762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Accepted: 02/26/2024] [Indexed: 03/23/2024] Open
Abstract
The unraveling of the regulatory mechanisms that govern neuronal excitability is a major challenge for neuroscientists worldwide. Neurotransmitters play a critical role in maintaining the balance between excitatory and inhibitory activity in the brain. The balance controls cognitive functions and emotional responses. Glutamate and γ-aminobutyric acid (GABA) are the primary excitatory and inhibitory neurotransmitters of the brain, respectively. Disruptions in the balance between excitatory and inhibitory transmission are implicated in several psychiatric disorders, including anxiety disorders, depression, and schizophrenia. Neuromodulators such as dopamine and acetylcholine control cognition and emotion by regulating the excitatory/inhibitory balance initiated by glutamate and GABA. Dopamine is closely associated with reward-related behaviors, while acetylcholine plays a role in aversive and attentional behaviors. Although the physiological roles of neuromodulators have been extensively studied neuroanatomically and electrophysiologically, few researchers have explored the interplay between neuronal excitability and cell signaling and the resulting impact on emotion regulation. This review provides an in-depth understanding of "cell signaling crosstalk" in the context of neuronal excitability and emotion regulation. It also anticipates that the next generation of neurochemical analyses, facilitated by integrated phosphorylation studies, will shed more light on this topic.
Collapse
|
5
|
Identification of diagnostic signatures for ischemic stroke by machine learning algorithm. J Stroke Cerebrovasc Dis 2024; 33:107564. [PMID: 38215553 DOI: 10.1016/j.jstrokecerebrovasdis.2024.107564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 12/25/2023] [Accepted: 01/07/2024] [Indexed: 01/14/2024] Open
Abstract
OBJECTIVE Ischemic stroke (IS) is one of the major diseases threatening human health and survival and a leading cause of acquired mortality and disability in adults. The aim of this study was to screen diagnostic features of IS and to explore the characteristics of immune cell infiltration in IS pathogenesis. METHODS The microarray data of IS (GSE16561, GSE58294, GSE37587, and GSE124026) in the GEO database were merged after removing the batch effect. Then integrated bioinformatic analysis and machine-learning strategies were adopted to analyze the functional correlation and select diagnostic signatures. The WGCNA was used to identify the co-expression modules related to IS. The CIBERSORT algorithm was performed to assess the inflammatory state of IS and to investigate the correlation between diagnostic signatures and infiltrating immune cells. RESULTS Functional analysis of dysregulated genes showed that immune response-regulating signaling pathway and pattern recognition receptor activity were enriched in the pathophysiology of IS. The turquoise module was identified as the significant module with IS. By using Lasso and SVM-RFE learning methods, we finally obtained four diagnostic genes, including LAMP2, CR1, CLEC4E, and F5. The corresponding results of AUC of ROC prediction model in training and validation cohort were 0.954 and 0.862, respectively. The immune cell infiltration analysis suggested that plasma cells, resting and activated NK cells, activated dendritic cells, memory B cells, CD8+ T cells, naïve CD4+ T cells, and resting mast cells may be involved in the development of IS. Additionally, these diagnostic signatures might be correlated with multiple immune cells in varying degrees. CONCLUSION We identified four biologically relevant genes (LAMP2, CR1, CLEC4E, and F5) with diagnostic effects for IS, our results further provide novel insights regarding molecular mechanisms associated with various immune cells that related to IS for future investigations.
Collapse
|
6
|
Early IGF-1 receptor inhibition in mice mimics preterm human brain disorders and reveals a therapeutic target. SCIENCE ADVANCES 2024; 10:eadk8123. [PMID: 38427732 PMCID: PMC10906931 DOI: 10.1126/sciadv.adk8123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 01/29/2024] [Indexed: 03/03/2024]
Abstract
Besides recent advances in neonatal care, preterm newborns still develop sex-biased behavioral alterations. Preterms fail to receive placental insulin-like growth factor-1 (IGF-1), a major fetal growth hormone in utero, and low IGF-1 serum levels correlate with preterm poor neurodevelopmental outcomes. Here, we mimicked IGF-1 deficiency of preterm newborns in mice by perinatal administration of an IGF-1 receptor antagonist. This resulted in sex-biased brain microstructural, functional, and behavioral alterations, resembling those of ex-preterm children, which we characterized performing parallel mouse/human behavioral tests. Pharmacological enhancement of GABAergic tonic inhibition by the U.S. Food and Drug Administration-approved drug ganaxolone rescued functional/behavioral alterations in mice. Establishing an unprecedented mouse model of prematurity, our work dissects the mechanisms at the core of abnormal behaviors and identifies a readily translatable therapeutic strategy for preterm brain disorders.
Collapse
|
7
|
tRFUniverse: A comprehensive resource for the interactive analyses of tRNA-derived ncRNAs in human cancer. iScience 2024; 27:108810. [PMID: 38303722 PMCID: PMC10831894 DOI: 10.1016/j.isci.2024.108810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 08/02/2023] [Accepted: 01/02/2024] [Indexed: 02/03/2024] Open
Abstract
tRNA-derived ncRNAs are a heterogeneous class of non-coding RNAs recently proposed to be active regulators of gene expression and be involved in many diseases, including cancer. Consequently, several online resources on tRNA-derived ncRNAs have been released. Although interesting, such resources present only basic features and do not adequately exploit the wealth of knowledge available about tRNA-derived ncRNAs. Therefore, we introduce tRFUniverse, a novel online resource for the analysis of tRNA-derived ncRNAs in human cancer. tRFUniverse presents an extensive collection of classes of tRNA-derived ncRNAs analyzed across all the TCGA and TARGET tumor cohorts, NCI-60 cell lines, and biological fluids. Moreover, public AGO CLASH/CLIP-Seq data were analyzed to identify the molecular interactions between tRNA-derived ncRNAs and other transcripts. Importantly, tRFUniverse combines in a single resource a comprehensive set of features that we believe may be helpful to investigate the involvement of tRNA-derived ncRNAs in cancer biology.
Collapse
|
8
|
Single-cell RNA sequencing reveals the epithelial cell, fibroblast, and key gene alterations in chronic rhinosinusitis with nasal polyps. Sci Rep 2024; 14:2270. [PMID: 38280891 PMCID: PMC10821928 DOI: 10.1038/s41598-024-52341-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 01/17/2024] [Indexed: 01/29/2024] Open
Abstract
Chronic rhinosinusitis with nasal polyps (CRSwNP) is a chronic inflammatory disease of the nasal mucosa, and epithelial-mesenchymal transition (EMT) is thought to be an essential process in the pathogenesis of CRSwNP. However, the mechanisms of epithelial and fibroblastic changes at the single-cell level are unclear. In this study, we investigated the epithelial cell, fibroblast, and key gene alterations in the development of CRSwNP. We revealed major cell types involved in CRSwNP and nasal mucosal inflammation formation, then mapped epithelial and fibroblast subpopulations. We showed that the apical and glandular epithelial cells and the ADGRB3+ and POSTN+ fibroblasts were the key cell subtypes in the progression of CRSwNP. Pseudotime and cell cycle analysis identified dynamic changes between epithelial cells and fibroblasts during its development. WFDC2 and CCL26 were identified as the key marker genes involved in the development of CRSwNP and were validated by IHC staining, which may provide a potential novel target for future CRSwNP therapy. ScRNA-seq data provided insights into the cellular landscape and the relationship between epithelial cells and fibroblasts in the progression of CRSwNP. WFDC2 and CCL26 were identified as the key genes involved in the development of CRSwNP and may be the potential markers for gene therapy.
Collapse
|
9
|
The Reactome Pathway Knowledgebase 2024. Nucleic Acids Res 2024; 52:D672-D678. [PMID: 37941124 PMCID: PMC10767911 DOI: 10.1093/nar/gkad1025] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 10/14/2023] [Accepted: 10/20/2023] [Indexed: 11/10/2023] Open
Abstract
The Reactome Knowledgebase (https://reactome.org), an Elixir and GCBR core biological data resource, provides manually curated molecular details of a broad range of normal and disease-related biological processes. Processes are annotated as an ordered network of molecular transformations in a single consistent data model. Reactome thus functions both as a digital archive of manually curated human biological processes and as a tool for discovering functional relationships in data such as gene expression profiles or somatic mutation catalogs from tumor cells. Here we review progress towards annotation of the entire human proteome, targeted annotation of disease-causing genetic variants of proteins and of small-molecule drugs in a pathway context, and towards supporting explicit annotation of cell- and tissue-specific pathways. Finally, we briefly discuss issues involved in making Reactome more fully interoperable with other related resources such as the Gene Ontology and maintaining the resulting community resource network.
Collapse
|
10
|
Patterns of change in regulatory modules of chemical reaction systems induced by network modification. PNAS NEXUS 2024; 3:pgad441. [PMID: 38292559 PMCID: PMC10825507 DOI: 10.1093/pnasnexus/pgad441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Accepted: 12/04/2023] [Indexed: 02/01/2024]
Abstract
Cellular functions are realized through the dynamics of chemical reaction networks formed by thousands of chemical reactions. Numerical studies have empirically demonstrated that small differences in network structures among species or tissues can cause substantial changes in dynamics. However, a general principle for behavior changes in response to network structure modifications is not known. The chemical reaction system possesses substructures called buffering structures, which are characterized by a certain topological index being zero. It was proven that the steady-state response to modulation of reaction parameters inside a buffering structure is localized in the buffering structure. In this study, we developed a method to systematically identify the loss or creation of buffering structures induced by the addition of a single degradation reaction from network structure alone. This makes it possible to predict the qualitative and macroscopic changes in regulation that will be caused by the network modification. This method was applied to two reaction systems: the central metabolic system and the mitogen-activated protein kinases signal transduction system. Our method enables identification of reactions that are important for biological functions in living systems.
Collapse
|
11
|
Vitamin D analog calcitriol for breast cancer therapy; an integrated drug discovery approach. J Biomol Struct Dyn 2023; 41:11017-11043. [PMID: 37054526 DOI: 10.1080/07391102.2023.2199866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 12/11/2022] [Indexed: 04/15/2023]
Abstract
As breast cancer remains leading cause of cancer death globally, it is essential to develop an affordable breast cancer therapy in underdeveloped countries. Drug repurposing offers potential to address gaps in breast cancer treatment. Molecular networking studies were performed for drug repurposing approach by using heterogeneous data. The PPI networks were built to select the target genes from the EGFR overexpression signaling pathway and its associated family members. The selected genes EGFR, ErbB2, ErbB4 and ErbB3 were allowed to interact with 2637 drugs, leads to PDI network construction of 78, 61, 15 and 19 drugs, respectively. As drugs approved for treating non cancer-related diseases or disorders are clinically safe, effective, and affordable, these drugs were given considerable attention. Calcitriol had shown significant binding affinities with all four receptors than standard neratinib. The RMSD, RMSF, and H-bond analysis of protein-ligand complexes from molecular dynamics simulation (100 ns), confirmed the stable binding of calcitriol with ErbB2 and EGFR receptors. In addition, MMGBSA and MMP BSA also affirmed the docking results. These in-silico results were validated with in-vitro cytotoxicity studies in SK-BR-3 and Vero cells. The IC50 value of calcitriol (43.07 mg/ml) was found to be lower than neratinib (61.50 mg/ml) in SK-BR-3 cells. In Vero cells the IC50 value of calcitriol (431.05 mg/ml) was higher than neratinib (404.95 mg/ml). It demonstrates that calcitriol suggestively downregulated the SK-BR-3 cell viability in a dose-dependent manner. These implications revealed calcitriol has shown better cytotoxicity and decreased the proliferation rate of breast cancer cells than neratinib.Communicated by Ramaswamy H. Sarma.
Collapse
|
12
|
Over-representation analysis of angiogenic factors in immunosuppressive mechanisms in neoplasms and neurological conditions during COVID-19. Microb Pathog 2023; 185:106386. [PMID: 37865274 DOI: 10.1016/j.micpath.2023.106386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/27/2023] [Accepted: 10/09/2023] [Indexed: 10/23/2023]
Abstract
BACKGROUND Recent studies emphasized the necessity to identify key (human) biological processes and pathways targeted by the Coronaviridae family of viruses, especially Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Coronavirus Disease (COVID-19) caused up to 33-55 % death rates in COVID-19 patients with malignant neoplasms and Alzheimer's disease. Given this scenario, we identified biological processes and pathways involved in various diseases which are most likely affected by COVID-19. METHODS The COVID-19 DisGeNET data set (v4.0) contains the associations between various diseases and human genes known to interact with viruses from Coronaviridae family and were obtained from the IntAct Coronavirus data set annotated with DisGeNET data. We constructed the disease-gene network to identify genes that are involved in various comorbid diseased states. Communities from the disease-gene network were identified using Louvain method and functional enrichment through over-representation analysis methodology was used to discover significant biological processes and pathways shared between COVID-19 and other diseases. RESULT The COVID-19 DisGeNET data set (v4.0) comprised of 828 human genes and 10,473 diseases (including various phenotypes) that together constituted nodes in the disease-gene network. Each of the 70,210 edges connects a human gene with an associated disease. The top 10 genes linked to most number of diseases were VEGFA, BCL2, CTNNB1, ALB, COX2, AGT, HLA-A, HMOX1, FGF2 and COMT. The most vulnerable group of patients thus discovered had comorbid conditions such as carcinomas, malignant neoplasms and Alzheimer's disease. Finally, we identified 15 potentially useful biological processes and pathways for improved therapies. Vascular endothelial growth factor (VEGF) is the key mediator of angiogenesis in cancer. It is widely distributed in the brain and plays a crucial role in brain inflammation regulating the level of angiopoietins. With a degree of 1899, VEGFA was associated with maximum number of diseases in the disease-gene network. Previous studies have indicated that increased levels of VEGFA in the blood results in dyspnea, Pulmonary Edema (PE), Acute Lung Injury (ALI) and Acute Respiratory Distress Syndrome (ARDS). In case of COVID-19 patients with neoplasms and other neurological symptoms, our results indicate VEGFA as a therapeutic target for inflammation suppression. As VEGFs are known to disproportionately affect cancer patients, improving endothelial permeability and vasodilation with anti-VEGF therapy could lead to suppression of inflammation and also improve oxygenation. As an outcome of our study, we make case for clinical investigations towards anti-VEGF therapies for such comorbid conditions affected by COVID-19 for better therapeutic outcomes.
Collapse
|
13
|
CellTICS: an explainable neural network for cell-type identification and interpretation based on single-cell RNA-seq data. Brief Bioinform 2023; 25:bbad449. [PMID: 38061196 PMCID: PMC10703497 DOI: 10.1093/bib/bbad449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 10/30/2023] [Accepted: 11/14/2023] [Indexed: 12/18/2023] Open
Abstract
Identifying cell types is crucial for understanding the functional units of an organism. Machine learning has shown promising performance in identifying cell types, but many existing methods lack biological significance due to poor interpretability. However, it is of the utmost importance to understand what makes cells share the same function and form a specific cell type, motivating us to propose a biologically interpretable method. CellTICS prioritizes marker genes with cell-type-specific expression, using a hierarchy of biological pathways for neural network construction, and applying a multi-predictive-layer strategy to predict cell and sub-cell types. CellTICS usually outperforms existing methods in prediction accuracy. Moreover, CellTICS can reveal pathways that define a cell type or a cell type under specific physiological conditions, such as disease or aging. The nonlinear nature of neural networks enables us to identify many novel pathways. Interestingly, some of the pathways identified by CellTICS exhibit differential expression "variability" rather than differential expression across cell types, indicating that expression stochasticity within a pathway could be an important feature characteristic of a cell type. Overall, CellTICS provides a biologically interpretable method for identifying and characterizing cell types, shedding light on the underlying pathways that define cellular heterogeneity and its role in organismal function. CellTICS is available at https://github.com/qyyin0516/CellTICS.
Collapse
|
14
|
CrossFuse-XGBoost: accurate prediction of the maximum recommended daily dose through multi-feature fusion, cross-validation screening and extreme gradient boosting. Brief Bioinform 2023; 25:bbad511. [PMID: 38216539 PMCID: PMC10786712 DOI: 10.1093/bib/bbad511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 12/04/2023] [Accepted: 12/13/2023] [Indexed: 01/14/2024] Open
Abstract
In the drug development process, approximately 30% of failures are attributed to drug safety issues. In particular, the first-in-human (FIH) trial of a new drug represents one of the highest safety risks, and initial dose selection is crucial for ensuring safety in clinical trials. With traditional dose estimation methods, which extrapolate data from animals to humans, catastrophic events have occurred during Phase I clinical trials due to interspecies differences in compound sensitivity and unknown molecular mechanisms. To address this issue, this study proposes a CrossFuse-extreme gradient boosting (XGBoost) method that can directly predict the maximum recommended daily dose of a compound based on existing human research data, providing a reference for FIH dose selection. This method not only integrates multiple features, including molecular representations, physicochemical properties and compound-protein interactions, but also improves feature selection based on cross-validation. The results demonstrate that the CrossFuse-XGBoost method not only improves prediction accuracy compared to that of existing local weighted methods [k-nearest neighbor (k-NN) and variable k-NN (v-NN)] but also solves the low prediction coverage issue of v-NN, achieving full coverage of the external validation set and enabling more reliable predictions. Furthermore, this study offers a high level of interpretability by identifying the importance of different features in model construction. The 241 features with the most significant impact on the maximum recommended daily dose were selected, providing references for optimizing the structure of new compounds and guiding experimental research. The datasets and source code are freely available at https://github.com/cqmu-lq/CrossFuse-XGBoost.
Collapse
|
15
|
PRANA: an R package for differential co-expression network analysis with the presence of additional covariates. BMC Genomics 2023; 24:687. [PMID: 37974076 PMCID: PMC10652545 DOI: 10.1186/s12864-023-09787-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 11/06/2023] [Indexed: 11/19/2023] Open
Abstract
BACKGROUND Advances in sequencing technology and cost reduction have enabled an emergence of various statistical methods used in RNA-sequencing data, including the differential co-expression network analysis (or differential network analysis). A key benefit of this method is that it takes into consideration the interactions between or among genes and do not require an established knowledge in biological pathways. As of now, none of existing softwares can incorporate covariates that should be adjusted if they are confounding factors while performing the differential network analysis. RESULTS We develop an R package PRANA which a user can easily include multiple covariates. The main R function in this package leverages a novel pseudo-value regression approach for a differential network analysis in RNA-sequencing data. This software is also enclosed with complementary R functions for extracting adjusted p-values and coefficient estimates of all or specific variable for each gene, as well as for identifying the names of genes that are differentially connected (DC, hereafter) between subjects under biologically different conditions from the output. CONCLUSION Herewith, we demonstrate the application of this package in a real data on chronic obstructive pulmonary disease. PRANA is available through the CRAN repositories under the GPL-3 license: https://cran.r-project.org/web/packages/PRANA/index.html .
Collapse
|
16
|
Identification of Cellular Interactions in the Tumor Immune Microenvironment Underlying CD8 T Cell Exhaustion. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.09.566384. [PMID: 38014233 PMCID: PMC10680664 DOI: 10.1101/2023.11.09.566384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
While immune checkpoint inhibitors show success in treating a subset of patients with certain late-stage cancers, these treatments fail in many other patients as a result of mechanisms that have yet to be fully characterized. The process of CD8 T cell exhaustion, by which T cells become dysfunctional in response to prolonged antigen exposure, has been implicated in immunotherapy resistance. Single-cell RNA sequencing (scRNA-seq) produces an abundance of data to analyze this process; however, due to the complexity of the process, contributions of other cell types to a process within a single cell type cannot be simply inferred. We constructed an analysis framework to first rank human skin tumor samples by degree of exhaustion in tumor-infiltrating CD8 T cells and then identify immune cell type-specific gene-regulatory network patterns significantly associated with T cell exhaustion. Using this framework, we further analyzed scRNA-seq data from human tumor and chronic viral infection samples to compare the T cell exhaustion process between these two contexts. In doing so, we identified transcription factor activity in the macrophages of both tissue types associated with this process. Our framework can be applied beyond the tumor immune microenvironment to any system involving cell-cell communication, facilitating insights into key biological processes that underpin the effective treatment of cancer and other complicated diseases.
Collapse
|
17
|
Shortest Hyperpaths in Directed Hypergraphs for Reaction Pathway Inference. J Comput Biol 2023; 30:1198-1225. [PMID: 37906100 DOI: 10.1089/cmb.2023.0242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2023] Open
Abstract
Signaling and metabolic pathways, which consist of chains of reactions that produce target molecules from source compounds, are cornerstones of cellular biology. Properly modeling the reaction networks that represent such pathways requires directed hypergraphs, where each molecule or compound maps to a vertex, and each reaction maps to a hyperedge directed from its set of input reactants to its set of output products. Inferring the most likely series of reactions that produces a given set of targets from a given set of sources, where for each reaction its reactants are produced by prior reactions in the series, corresponds to finding a shortest hyperpath in a directed hypergraph, which is NP-complete. We give the first exact algorithm for general shortest hyperpaths that can find provably optimal solutions for large, real-world, reaction networks. In particular, we derive a novel graph-theoretic characterization of hyperpaths, which we leverage in a new integer linear programming formulation of shortest hyperpaths that for the first time handles cycles, and develop a cutting-plane algorithm that can solve this integer linear program to optimality in practice. Through comprehensive experiments over all of the thousands of instances from the standard Reactome and NCI-PID reaction databases, we demonstrate that our cutting-plane algorithm quickly finds an optimal hyperpath-inferring the most likely pathway-with a median running time of under 10 seconds, and a maximum time of less than 30 minutes, even on instances with thousands of reactions. We also explore for the first time how well hyperpaths infer true pathways, and show that shortest hyperpaths accurately recover known pathways, typically with very high precision and recall. Source code implementing our cutting-plane algorithm for shortest hyperpaths is available free for research use in a new tool called Mmunin.
Collapse
|
18
|
Pathway-based, reaction-specific annotation of disease variants for elucidation of molecular phenotypes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.18.562964. [PMID: 37904913 PMCID: PMC10614924 DOI: 10.1101/2023.10.18.562964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/01/2023]
Abstract
Disease variant annotation in the context of biological reactions and pathways can provide a standardized overview of molecular phenotypes of pathogenic mutations that is amenable to computational mining and mathematical modeling. Reactome, an open source, manually curated, peer-reviewed database of human biological pathways, provides annotations for over 4000 disease variants of close to 400 genes in the context of ∼800 disease reactions constituting ∼400 disease pathways. Functional annotation of disease variants proceeds from normal gene functions, through disease variants whose divergence from normal molecular behaviors has been experimentally verified, to extrapolation from molecular phenotypes of characterized variants to variants of unknown significance using criteria of the American College of Medical Genetics and Genomics (ACMG). Reactome's pathway-based, reaction-specific disease variant dataset and data model provide a platform to infer pathway output impacts of numerous human disease variants and model organism orthologs, complementing computational predictions of variant pathogenicity.
Collapse
|
19
|
A data integration approach unveils a transcriptional signature of type 2 diabetes progression in rat and human islets. PLoS One 2023; 18:e0292579. [PMID: 37816033 PMCID: PMC10564241 DOI: 10.1371/journal.pone.0292579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 09/22/2023] [Indexed: 10/12/2023] Open
Abstract
Pancreatic islet failure is a key characteristic of type 2 diabetes besides insulin resistance. To get molecular insights into the pathology of islets in type 2 diabetes, we developed a computational approach to integrating expression profiles of Goto-Kakizaki and Wistar rat islets from a designed experiment with those of the human islets from an observational study. A principal gene-eigenvector in the expression profiles characterized by up-regulated angiogenesis and down-regulated oxidative phosphorylation was identified conserved across the two species. In the case of Goto-Kakizaki versus Wistar islets, such alteration in gene expression can be verified directly by the treatment-control tests over time, and corresponds to the alteration of α/β-cell distribution obtained by quantifying the islet micrographs. Furthermore, the correspondence between the dual sample- and gene-eigenvectors unveils more delicate structures. In the case of rats, the up- and down-trend of insulin mRNA levels before and after week 8 correspond respectively to the top two principal eigenvectors. In the case of human, the top two principal eigenvectors correspond respectively to the late and early stages of diabetes. According to the aggregated expression signature, a large portion of genes involved in the hypoxia-inducible factor signaling pathway, which activates transcription of angiogenesis, were significantly up-regulated. Furthermore, top-ranked anti-angiogenic genes THBS1 and PEDF indicate the existence of a counteractive mechanism that is in line with thickened and fragmented capillaries found in the deteriorated islets. Overall, the integrative analysis unravels the principal transcriptional alterations underlying the islet deterioration of morphology and insulin secretion along type 2 diabetes progression.
Collapse
|
20
|
High genetic heterogeneity of leukodystrophies in Iranian children: the first report of Iranian Leukodystrophy Registry. Neurogenetics 2023; 24:279-289. [PMID: 37597066 DOI: 10.1007/s10048-023-00730-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Accepted: 08/09/2023] [Indexed: 08/21/2023]
Abstract
Leukodystrophies (LDs) are a heterogeneous group of progressive neurological disorders and characterized by primary involvement of white matter of the central nervous system (CNS). This is the first report of the Iranian LD Registry database to describe the clinical, radiological, and genomic data of Persian patients with leukodystrophies. From 2016 to 2019, patients suspicious of LDs were examined followed by a brain magnetic resonance imaging (MRI). A single gene testing or whole-exome sequencing (WES) was used depending on the neuroradiologic phenotypes. In a few cases, the diagnosis was made by metabolic studies. Based on the MRI pattern, diagnosed patients were divided into cohorts A (hypomyelinating LDs) versus cohort B (Other LDs). The most recent LD classification was utilized for classification of diagnosed patients. For novel variants, in silico analyses were performed to verify their pathogenicity. Out of 680 registered patients, 342 completed the diagnostic evaluations. In total, 245 patients met a diagnosis which in turn 24.5% were categorized in cohort A and the remaining in cohort B. Genetic tests revealed causal variants in 228 patients consisting of 213 variants in 110 genes with 78 novel variants. WES and single gene testing identified a causal variant in 65.5% and 34.5% cases, respectively. The total diagnostic rate of WES was 60.7%. Lysosomal disorders (27.3%; GM2-gangliosidosis-9.8%, MLD-6.1%, KD-4.5%), amino and organic acid disorders (17.15%; Canavan disease-4.5%, L-2-HGA-3.6%), mitochondrial leukodystrophies (12.6%), ion and water homeostasis disorders (7.3%; MLC-4.5%), peroxisomal disorders (6.5%; X-ALD-3.6%), and myelin protein disorders (3.6%; PMLD-3.6%) were the most commonly diagnosed disorders. Thirty-seven percent of cases had a pathogenic variant in nine genes (ARSA, HEXA, ASPA, MLC1, GALC, GJC2, ABCD1, L2HGDH, GCDH). This study highlights the most common types as well as the genetic heterogeneity of LDs in Iranian children.
Collapse
|
21
|
An R package for Survival-based Gene Set Enrichment Analysis. RESEARCH SQUARE 2023:rs.3.rs-3367968. [PMID: 37841872 PMCID: PMC10571627 DOI: 10.21203/rs.3.rs-3367968/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]
Abstract
Functional enrichment analysis is usually used to assess the effects of experimental differences. However, researchers sometimes want to understand the relationship between transcriptomic variation and health outcomes like survival. Therefore, we suggest the use of Survival-based Gene Set Enrichment Analysis (SGSEA) to help determine biological functions associated with a disease's survival. We developed an R package and corresponding Shiny App called SGSEA for this analysis and presented a study of kidney renal clear cell carcinoma (KIRC) to demonstrate the approach. In Gene Set Enrichment Analysis (GSEA), the log-fold change in expression between treatments is used to rank genes, to determine if a biological function has a non-random distribution of altered gene expression. SGSEA is a variation of GSEA using the hazard ratio instead of a log fold change. Our study shows that pathways enriched with genes whose increased transcription is associated with mortality (NES > 0, adjusted p-value < 0.15) have previously been linked to KIRC survival, helping to demonstrate the value of this approach. This approach allows researchers to quickly identify disease variant pathways for further research and provides supplementary information to standard GSEA, all within a single R package or through using the convenient app.
Collapse
|
22
|
Identifying mitophagy-related genes as prognostic biomarkers and therapeutic targets of gastric carcinoma by integrated analysis of single-cell and bulk-RNA sequencing data. Comput Biol Med 2023; 163:107227. [PMID: 37413850 DOI: 10.1016/j.compbiomed.2023.107227] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 06/01/2023] [Accepted: 06/30/2023] [Indexed: 07/08/2023]
Abstract
Gastric carcinoma (GC) is the fourth leading cause of cancer-related mortality worldwide. Patients with advanced GC tend to have poor prognoses and shortened survival. Finding novel predictive biomarkers for GC prognosis is an urgent need. Mitophagy is the selection degradation of damaged mitochondria to maintain cellular homeostasis, which has been shown to play both pro- and anti-tumor effects. This study combined single-cell sequencing data and transcriptomics to screen mitophagy-related genes (MRGs) associated with GC progression and analyze their clinical values. Reverse transcription-quantitative PCR (RT-qPCR) and immunochemistry (IHC) further verified gene expression profiles. A total of 18 DE-MRGs were identified after taking an intersection of single-cell sequencing data and MRGs. Cells with a higher MRG score were mainly distributed in the epithelial cell cluster. Cell-to-cell communications among epithelial cells with other cell types were significantly upregulated. We established and validated a reliable nomogram model based on DE-MRGs (GABARAPL2 and CDC37) and traditional clinicopathological parameters. GABARAPL2 and CDC37 displayed different immune infiltration states. Given the significant correlation between hub genes and immune checkpoints, targeting MRGs in GC may supplement more benefits to patients who received immunotherapy. In conclusion, GABARAPL2 and CDC37 may be prognostic biomarkers and candidate therapeutic targets of GC.
Collapse
|
23
|
Heterogeneous message passing for heterogeneous networks. Phys Rev E 2023; 108:034310. [PMID: 37849099 DOI: 10.1103/physreve.108.034310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 09/01/2023] [Indexed: 10/19/2023]
Abstract
Message passing (MP) is a computational technique used to find approximate solutions to a variety of problems defined on networks. MP approximations are generally accurate in locally treelike networks but require corrections to maintain their accuracy level in networks rich with short cycles. However, MP may already be computationally challenging on very large networks and additional costs incurred by correcting for cycles could be prohibitive. We show how the issue can be addressed. By allowing each node in the network to have its own level of approximation, one can focus on improving the accuracy of MP approaches in a targeted manner. We perform a systematic analysis of 109 real-world networks and show that our node-based MP approximation is able to increase both the accuracy and speed of traditional MP approaches. We find that, compared to conventional MP, a heterogeneous approach based on a simple heuristic is more accurate in 81% of tested networks, faster in 64% of cases, and both more accurate and faster in 49% of cases.
Collapse
|
24
|
Ancestry-related differences in chromatin accessibility and gene expression of APOE ε4 are associated with Alzheimer's disease risk. Alzheimers Dement 2023; 19:3902-3915. [PMID: 37037656 PMCID: PMC10529851 DOI: 10.1002/alz.13075] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 03/03/2023] [Accepted: 03/08/2023] [Indexed: 04/12/2023]
Abstract
INTRODUCTION European local ancestry (ELA) surrounding apolipoprotein E (APOE) ε4 confers higher risk for Alzheimer's disease (AD) compared to African local ancestry (ALA). We demonstrated significantly higher APOE ε4 expression in ELA versus ALA in AD brains from APOE ε4/ε4 carriers. Chromatin accessibility differences could contribute to these expression changes. METHODS We performed single nuclei assays for transposase accessible chromatin sequencing from the frontal cortex of six ALA and six ELA AD brains, homozygous for local ancestry and APOE ε4. RESULTS Our results showed an increased chromatin accessibility at the APOE ε4 promoter area in ELA versus ALA astrocytes. This increased accessibility in ELA astrocytes extended genome wide. Genes with increased accessibility in ELA in astrocytes were enriched for synapsis, cholesterol processing, and astrocyte reactivity. DISCUSSION Our results suggest that increased chromatin accessibility of APOE ε4 in ELA astrocytes contributes to the observed elevated APOE ε4 expression, corresponding to the increased AD risk in ELA versus ALA APOE ε4/ε4 carriers.
Collapse
|
25
|
Cancer-keeper genes as therapeutic targets. iScience 2023; 26:107296. [PMID: 37520717 PMCID: PMC10382876 DOI: 10.1016/j.isci.2023.107296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 05/18/2023] [Accepted: 07/03/2023] [Indexed: 08/01/2023] Open
Abstract
Finding cancer-driver genes has been a central theme of cancer research. We took a different perspective; instead of considering normal cells, we focused on cancerous cells and genes that maintained abnormal cell growth, which we named cancer-keeper genes (CKGs). Intervening CKGs may rectify aberrant cell growth, making them potential cancer therapeutic targets. We introduced control-hub genes and developed an efficient algorithm by extending network controllability theory. Control hub are essential for maintaining cancerous states and thus can be taken as CKGs. We applied our CKG-based approach to bladder cancer (BLCA). All genes on the cell-cycle and p53 pathways in BLCA were identified as CKGs, showing their importance in cancer. We discovered that sensitive CKGs - genes easily altered by structural perturbation - were particularly suitable therapeutic targets. Experiments on cell lines and a mouse model confirmed that six sensitive CKGs effectively suppressed cancer cell growth, demonstrating the immense therapeutic potential of CKGs.
Collapse
|
26
|
Coherent pathway enrichment estimation by modeling inter-pathway dependencies using regularized regression. Bioinformatics 2023; 39:btad522. [PMID: 37610338 PMCID: PMC10471899 DOI: 10.1093/bioinformatics/btad522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2022] [Revised: 07/04/2023] [Accepted: 08/22/2023] [Indexed: 08/24/2023] Open
Abstract
MOTIVATION Gene set enrichment methods are a common tool to improve the interpretability of gene lists as obtained, for example, from differential gene expression analyses. They are based on computing whether dysregulated genes are located in certain biological pathways more often than expected by chance. Gene set enrichment tools rely on pre-existing pathway databases such as KEGG, Reactome, or the Gene Ontology. These databases are increasing in size and in the number of redundancies between pathways, which complicates the statistical enrichment computation. RESULTS We address this problem and develop a novel gene set enrichment method, called pareg, which is based on a regularized generalized linear model and directly incorporates dependencies between gene sets related to certain biological functions, for example, due to shared genes, in the enrichment computation. We show that pareg is more robust to noise than competing methods. Additionally, we demonstrate the ability of our method to recover known pathways as well as to suggest novel treatment targets in an exploratory analysis using breast cancer samples from TCGA. AVAILABILITY AND IMPLEMENTATION pareg is freely available as an R package on Bioconductor (https://bioconductor.org/packages/release/bioc/html/pareg.html) as well as on https://github.com/cbg-ethz/pareg. The GitHub repository also contains the Snakemake workflows needed to reproduce all results presented here.
Collapse
|
27
|
PATH-SURVEYOR: pathway level survival enquiry for immuno-oncology and drug repurposing. BMC Bioinformatics 2023; 24:266. [PMID: 37380943 PMCID: PMC10303868 DOI: 10.1186/s12859-023-05393-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Accepted: 06/19/2023] [Indexed: 06/30/2023] Open
Abstract
Pathway-level survival analysis offers the opportunity to examine molecular pathways and immune signatures that influence patient outcomes. However, available survival analysis algorithms are limited in pathway-level function and lack a streamlined analytical process. Here we present a comprehensive pathway-level survival analysis suite, PATH-SURVEYOR, which includes a Shiny user interface with extensive features for systematic exploration of pathways and covariates in a Cox proportional-hazard model. Moreover, our framework offers an integrative strategy for performing Hazard Ratio ranked Gene Set Enrichment Analysis and pathway clustering. As an example, we applied our tool in a combined cohort of melanoma patients treated with checkpoint inhibition (ICI) and identified several immune populations and biomarkers predictive of ICI efficacy. We also analyzed gene expression data of pediatric acute myeloid leukemia (AML) and performed an inverse association of drug targets with the patient's clinical endpoint. Our analysis derived several drug targets in high-risk KMT2A-fusion-positive patients, which were then validated in AML cell lines in the Genomics of Drug Sensitivity database. Altogether, the tool offers a comprehensive suite for pathway-level survival analysis and a user interface for exploring drug targets, molecular features, and immune populations at different resolutions.
Collapse
|
28
|
Signaling mechanisms in renal compensatory hypertrophy revealed by multi-omics. Nat Commun 2023; 14:3481. [PMID: 37328470 PMCID: PMC10276015 DOI: 10.1038/s41467-023-38958-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 05/24/2023] [Indexed: 06/18/2023] Open
Abstract
Loss of a kidney results in compensatory growth of the remaining kidney, a phenomenon of considerable clinical importance. However, the mechanisms involved are largely unknown. Here, we use a multi-omic approach in a unilateral nephrectomy model in male mice to identify signaling processes associated with renal compensatory hypertrophy, demonstrating that the lipid-activated transcription factor peroxisome proliferator-activated receptor alpha (PPARα) is an important determinant of proximal tubule cell size and is a likely mediator of compensatory proximal tubule hypertrophy.
Collapse
|
29
|
Hyperconnectivity of Two Separate Long-Range Cholinergic Systems Contributes to the Reorganization of the Brain Functional Connectivity during Nicotine Withdrawal in Male Mice. eNeuro 2023; 10:ENEURO.0019-23.2023. [PMID: 37295945 PMCID: PMC10306126 DOI: 10.1523/eneuro.0019-23.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 04/13/2023] [Accepted: 04/25/2023] [Indexed: 06/12/2023] Open
Abstract
Chronic nicotine results in dependence with withdrawal symptoms on discontinuation of use, through desensitization of nicotinic acetylcholine receptors and altered cholinergic neurotransmission. Nicotine withdrawal is associated with increased whole-brain functional connectivity and decreased network modularity; however, the role of cholinergic neurons in those changes is unknown. To identify the contribution of nicotinic receptors and cholinergic regions to changes in the functional network, we analyzed the contribution of the main cholinergic regions to brain-wide activation of the immediate early-gene Fos during withdrawal in male mice and correlated these changes with the expression of nicotinic receptor mRNA throughout the brain. We show that the main functional connectivity modules included the main long-range cholinergic regions, which were highly synchronized with the rest of the brain. However, despite this hyperconnectivity, they were organized into two anticorrelated networks that were separated into basal forebrain-projecting and brainstem-thalamic-projecting cholinergic regions, validating a long-standing hypothesis of the organization of the brain cholinergic systems. Moreover, baseline (without nicotine) expression of Chrna2, Chrna3, Chrna10, and Chrnd mRNA of each brain region correlated with withdrawal-induced changes in Fos expression. Finally, by mining the Allen Brain mRNA expression database, we were able to identify 1755 gene candidates and three pathways (Sox2-Oct4-Nanog, JAK-STAT, and MeCP2-GABA) that may contribute to nicotine withdrawal-induced Fos expression. These results identify the dual contribution of the basal forebrain and brainstem-thalamic cholinergic systems to whole-brain functional connectivity during withdrawal; and identify nicotinic receptors and novel cellular pathways that may be critical for the transition to nicotine dependence.
Collapse
|
30
|
Bioinformatics and network biology approach to identifying type 2 diabetes genes and pathways that influence the progression of breast cancer. Heliyon 2023; 9:e16151. [PMID: 37234659 PMCID: PMC10205526 DOI: 10.1016/j.heliyon.2023.e16151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 04/28/2023] [Accepted: 05/07/2023] [Indexed: 05/28/2023] Open
Abstract
Breast cancer is the second most prevalent malignancy affecting women. Postmenopausal women breast tumor is one of the top causes of death in women, accounting for 23% of cancer cases. Type 2 diabetes, a worldwide pandemic, has been connected to a heightened risk of several malignancies, although its association with breast cancer is still uncertain. In comparison to non-diabetic women, women with T2DM had a 23% elevated likelihood of developing breast cancer. It is difficult to determine causative or genetic susceptibility that connect T2DM and breast cancer. We created a large-scale network-based quantitative approach employing unbiased methods to discover abnormally amplified genes in both T2DM and breast cancer, to solve these issues. We performed transcriptome analysis to uncover identical genetic biomarkers and pathways to clarify the connection between T2DM and breast cancer patients. In this study, two RNA-seq datasets (GSE103001 and GSE86468) from the Gene Expression Omnibus (GEO) are used to identify mutually differentially expressed genes (DEGs) for breast cancer and T2DM, as well as common pathways and prospective medicines. Firstly, 45 shared genes (30 upregulated and 15 downregulated) between T2D and breast cancer were detected. We employed gene ontology and pathway enrichment to characterize prevalent DEGs' molecular processes and signal transduction pathways and observed that T2DM has certain connections to the progression of breast cancer. Using several computational and statistical approaches, we created a protein-protein interactions (PPI) network and revealed hub genes. These hub genes can be potential biomarkers, which may also lead to new therapeutic strategies for investigated diseases. We conducted TF-gene interactions, gene-microRNA interactions, protein-drug interactions, and gene-disease associations to find potential connections between T2DM and breast cancer pathologies. We assume that the potential drugs that emerged from this study could be useful therapeutic values. Researchers, doctors, biotechnologists, and many others may benefit from this research.
Collapse
|
31
|
Influence maximization: Divide and conquer. Phys Rev E 2023; 107:054306. [PMID: 37329077 DOI: 10.1103/physreve.107.054306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 05/03/2023] [Indexed: 06/18/2023]
Abstract
The problem of influence maximization, i.e., finding the set of nodes having maximal influence on a network, is of great importance for several applications. In the past two decades, many heuristic metrics to spot influencers have been proposed. Here, we introduce a framework to boost the performance of such metrics. The framework consists in dividing the network into sectors of influence, and then selecting the most influential nodes within these sectors. We explore three different methodologies to find sectors in a network: graph partitioning, graph hyperbolic embedding, and community structure. The framework is validated with a systematic analysis of real and synthetic networks. We show that the gain in performance generated by dividing a network into sectors before selecting the influential spreaders increases as the modularity and heterogeneity of the network increase. Also, we show that the division of the network into sectors can be efficiently performed in a time that scales linearly with the network size, thus making the framework applicable to large-scale influence maximization problems.
Collapse
|
32
|
Metaboverse enables automated discovery and visualization of diverse metabolic regulatory patterns. Nat Cell Biol 2023; 25:616-625. [PMID: 37012464 PMCID: PMC10104781 DOI: 10.1038/s41556-023-01117-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Accepted: 02/24/2023] [Indexed: 04/05/2023]
Abstract
Metabolism is intertwined with various cellular processes, including controlling cell fate, influencing tumorigenesis, participating in stress responses and more. Metabolism is a complex, interdependent network, and local perturbations can have indirect effects that are pervasive across the metabolic network. Current analytical and technical limitations have long created a bottleneck in metabolic data interpretation. To address these shortcomings, we developed Metaboverse, a user-friendly tool to facilitate data exploration and hypothesis generation. Here we introduce algorithms that leverage the metabolic network to extract complex reaction patterns from data. To minimize the impact of missing measurements within the network, we introduce methods that enable pattern recognition across multiple reactions. Using Metaboverse, we identify a previously undescribed metabolite signature that correlated with survival outcomes in early stage lung adenocarcinoma patients. Using a yeast model, we identify metabolic responses suggesting an adaptive role of citrate homeostasis during mitochondrial dysfunction facilitated by the citrate transporter, Ctp1. We demonstrate that Metaboverse augments the user's ability to extract meaningful patterns from multi-omics datasets to develop actionable hypotheses.
Collapse
|
33
|
Hyperconnectivity of two separate long-range cholinergic systems contributes to the reorganization of the brain functional connectivity during nicotine withdrawal in male mice. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.29.534836. [PMID: 37034602 PMCID: PMC10081261 DOI: 10.1101/2023.03.29.534836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Abstract
Chronic nicotine results in dependence with withdrawal symptoms upon discontinuation of use, through desensitization of nicotinic acetylcholine receptors and altered cholinergic neurotransmission. Nicotine withdrawal is associated with increased whole-brain functional connectivity and decreased network modularity, however, the role of cholinergic neurons in those changes is unknown. To identify the contribution of nicotinic receptors and cholinergic regions to changes in the functional network, we analyzed the contribution of the main cholinergic regions to brain-wide activation of the immediate early-gene FOS during withdrawal in male mice and correlated these changes with the expression of nicotinic receptor mRNA throughout the brain. We show that the main functional connectivity modules included the main long-range cholinergic regions, which were highly synchronized with the rest of the brain. However, despite this hyperconnectivity they were organized into two anticorrelated networks that were separated into basal forebrain projecting and brainstem-thalamic projecting cholinergic regions, validating a long-standing hypothesis of the organization of the brain cholinergic systems. Moreover, baseline (without nicotine) expression of Chrna2 , Chrna3 , Chrna10 , and Chrnd mRNA of each brain region correlated with withdrawal-induced changes in FOS expression. Finally, by mining the Allen Brain mRNA expression database, we were able to identify 1755 gene candidates and three pathways (Sox2-Oct4-Nanog, JAK-STAT, and MeCP2-GABA) that may contribute to nicotine withdrawal-induced FOS expression. These results identify the dual contribution of the basal forebrain and brainstem-thalamic cholinergic systems to whole-brain functional connectivity during withdrawal; and identify nicotinic receptors and novel cellular pathways that may be critical for the transition to nicotine dependence. Significance Statement Discontinuation of nicotine use in dependent users is associated with increased whole-brain activation and functional connectivity and leads to withdrawal symptoms. Here we investigated the contribution of the nicotinic cholinergic receptors and main cholinergic projecting brain areas in the whole-brain changes associated with withdrawal. This not only allowed us to visualize and confirm the previously described duality of the cholinergic brain system using this novel methodology, but also identify nicotinic receptors together with 1751 other genes that contribute, and could thus be targets for treatments against, nicotine withdrawal and dependence.
Collapse
|
34
|
FLT3ITD drives context-specific changes in cell identity and variable interferon dependence during AML initiation. Blood 2023; 141:1442-1456. [PMID: 36395068 PMCID: PMC10082380 DOI: 10.1182/blood.2022016889] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 11/14/2022] [Accepted: 11/15/2022] [Indexed: 11/18/2022] Open
Abstract
Acute myeloid leukemia (AML) initiation requires multiple rate-limiting mutations to cooperatively reprogram progenitor cell identity. For example, FLT3 internal tandem duplication (FLT3ITD) mutations cooperate with a variety of different initiating mutations to reprogram myeloid progenitor fate. These initiating mutations often skew toward either pediatric or adult AML patient populations, though FLT3ITD itself occurs at similar frequencies in both age groups. This raises the question of whether FLT3ITD might induce distinct transcriptional programs and unmask distinct therapeutic vulnerabilities when paired with pediatric, as opposed to adult AML-initiating mutations. To explore this possibility, we compared AML evolution in mice that carried Flt3ITD/NUP98-HOXD13 (NHD13) or Flt3ITD/Runx1DEL mutation pairs, which are respectively most common in pediatric and adult AML. Single-cell analyses and epigenome profiling revealed distinct interactions between Flt3ITD and its cooperating mutations. Whereas Flt3ITD and Flt3ITD/Runx1DEL caused aberrant expansion of myeloid progenitors, Flt3ITD/NHD13 drove the emergence of a pre-AML population that did not resemble normal hematopoietic progenitors. Differences between Flt3ITD/Runx1DEL and Flt3ITD/NHD13 cooperative target gene expression extended to fully transformed AML as well. Flt3ITD/NHD13 cooperative target genes were enriched in human NUP98-translocated AML. Flt3ITD/NHD13 selectively hijacked type I interferon signaling to drive expansion of the pre-AML population. Blocking interferon signaling delayed AML initiation and extended survival. Thus, common AML driver mutations, such as FLT3ITD, can coopt different mechanisms of transformation in different genetic contexts. Furthermore, pediatric-biased NUP98 fusions convey actionable interferon dependence.
Collapse
|
35
|
DRPPM-PATH-SURVEIOR: Plug-and-Play Survival Analysis of Pathway-level Signatures and Immune Components. RESEARCH SQUARE 2023:rs.3.rs-2688545. [PMID: 36993526 PMCID: PMC10055629 DOI: 10.21203/rs.3.rs-2688545/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Pathway-level survival analysis offers the opportunity to examine molecular pathways and immune signatures that influence patient outcomes. However, available survival analysis algorithms are limited in pathway-level function and lack a streamlined analytical process. Here we present a comprehensive pathway-level survival analysis suite, DRPPM-PATH-SURVEIOR, which includes a Shiny user interface with extensive features for systematic exploration of pathways and covariates in a Cox proportional-hazard model. Moreover, our framework offers an integrative strategy for performing Hazard Ratio ranked Gene Set Enrichment Analysis (GSEA) and pathway clustering. As an example, we applied our tool in a combined cohort of melanoma patients treated with checkpoint inhibition (ICI) and identified several immune populations and biomarkers predictive of ICI efficacy. We also analyzed gene expression data of pediatric acute myeloid leukemia (AML) and performed an inverse association of drug targets with the patient's clinical endpoint. Our analysis derived several drug targets in high-risk KMT2A-fusion-positive patients, which were then validated in AML cell lines in the Genomics of Drug Sensitivity database. Altogether, the tool offers a comprehensive suite for pathway-level survival analysis and a user interface for exploring drug targets, molecular features, and immune populations at different resolutions.
Collapse
|
36
|
Differentially expressed genes in systemic sclerosis: Towards predictive medicine with new molecular tools for clinicians. Autoimmun Rev 2023; 22:103314. [PMID: 36918090 DOI: 10.1016/j.autrev.2023.103314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 03/09/2023] [Indexed: 03/13/2023]
Abstract
Systemic sclerosis (SSc) is a rare and chronic autoimmune disease characterized by a pathogenic triad of immune dysregulation, vasculopathy, and progressive fibrosis. Clinical tools commonly used to assess patients, such as the modified Rodnan skin score, difference between limited or diffuse forms of skin involvement, presence of lung, heart or kidney involvement, or of various autoantibodies, are important prognostic factors, but still fail to reflect the large heterogeneity of the disease. SSc treatment options are diverse, ranging from conventional drugs to autologous hematopoietic stem cell transplantation, and predicting response is challenging. Genome-wide technologies, such as high throughput microarray analyses and RNA sequencing, allow accurate, unbiased, and broad assessment of alterations in expression levels of multiple genes. In recent years, many studies have shown robust changes in the gene expression profiles of SSc patients compared to healthy controls, mainly in skin tissues and peripheral blood cells. The objective analysis of molecular patterns in SSc is a powerful tool that can further classify SSc patients with similar clinical phenotypes and help predict response to therapy. In this review, we describe the journey from the first discovery of differentially expressed genes to the identification of enriched pathways and intrinsic subsets identified in SSc, using machine learning algorithms. Finally, we discuss the use of these new tools to predict the efficacy of various treatments, including stem cell transplantation. We suggest that the use of RNA gene expression-based classifications according to molecular subsets may bring us one step closer to precision medicine in Systemic Sclerosis.
Collapse
|
37
|
Protein-metabolite interactomics of carbohydrate metabolism reveal regulation of lactate dehydrogenase. Science 2023; 379:996-1003. [PMID: 36893255 PMCID: PMC10262665 DOI: 10.1126/science.abm3452] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 02/07/2023] [Indexed: 03/11/2023]
Abstract
Metabolic networks are interconnected and influence diverse cellular processes. The protein-metabolite interactions that mediate these networks are frequently low affinity and challenging to systematically discover. We developed mass spectrometry integrated with equilibrium dialysis for the discovery of allostery systematically (MIDAS) to identify such interactions. Analysis of 33 enzymes from human carbohydrate metabolism identified 830 protein-metabolite interactions, including known regulators, substrates, and products as well as previously unreported interactions. We functionally validated a subset of interactions, including the isoform-specific inhibition of lactate dehydrogenase by long-chain acyl-coenzyme A. Cell treatment with fatty acids caused a loss of pyruvate-lactate interconversion dependent on lactate dehydrogenase isoform expression. These protein-metabolite interactions may contribute to the dynamic, tissue-specific metabolic flexibility that enables growth and survival in an ever-changing nutrient environment.
Collapse
|
38
|
Data-independent acquisition phosphoproteomics of urinary extracellular vesicles enables renal cell carcinoma grade differentiation. Mol Cell Proteomics 2023; 22:100536. [PMID: 36997065 PMCID: PMC10165457 DOI: 10.1016/j.mcpro.2023.100536] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 03/01/2023] [Accepted: 03/24/2023] [Indexed: 03/31/2023] Open
Abstract
Translating the research capability and knowledge in cancer signaling into clinical settings has been slow and ineffective. Recently, extracellular vesicles (EVs) have emerged as a promising source for developing disease phosphoprotein markers to monitor disease status. This study focuses on the development of a robust data-independent acquisition (DIA) using mass spectrometry to profile urinary EV phosphoproteomics for renal cell cancer (RCC) grades differentiation. We examined gas-phase fractionated (GPF) library, direct DIA (library-free), forbidden zones, and several different windowing schemes. After the development of a DIA mass spectrometry method for EV phosphoproteomics, we applied the strategy to identify and quantify urinary EV phosphoproteomes from 57 individuals representing low-grade clear cell RCC, high-grade clear cell RCC, chronic kidney disease (CKD), and healthy control (HC) individuals. Urinary EVs were efficiently isolated by functional magnetic beads, and EV phosphopeptides were subsequently enriched by PolyMAC. We quantified 2,584 unique phosphosites and observed that multiple prominent cancer-related pathways, such as ErbB signaling, renal cell carcinoma, and regulation of actin cytoskeleton, were only upregulated in high-grade clear cell RCC. These results show that EV phosphoproteome analysis utilizing our optimized procedure of EV isolation, phosphopeptide enrichment, and DIA method provides a powerful tool for future clinical applications.
Collapse
|
39
|
Single-cell biological network inference using a heterogeneous graph transformer. Nat Commun 2023; 14:964. [PMID: 36810839 PMCID: PMC9944243 DOI: 10.1038/s41467-023-36559-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Accepted: 02/06/2023] [Indexed: 02/23/2023] Open
Abstract
Single-cell multi-omics (scMulti-omics) allows the quantification of multiple modalities simultaneously to capture the intricacy of complex molecular mechanisms and cellular heterogeneity. Existing tools cannot effectively infer the active biological networks in diverse cell types and the response of these networks to external stimuli. Here we present DeepMAPS for biological network inference from scMulti-omics. It models scMulti-omics in a heterogeneous graph and learns relations among cells and genes within both local and global contexts in a robust manner using a multi-head graph transformer. Benchmarking results indicate DeepMAPS performs better than existing tools in cell clustering and biological network construction. It also showcases competitive capability in deriving cell-type-specific biological networks in lung tumor leukocyte CITE-seq data and matched diffuse small lymphocytic lymphoma scRNA-seq and scATAC-seq data. In addition, we deploy a DeepMAPS webserver equipped with multiple functionalities and visualizations to improve the usability and reproducibility of scMulti-omics data analysis.
Collapse
|
40
|
Machine Learning Methods for Cancer Classification Using Gene Expression Data: A Review. Bioengineering (Basel) 2023; 10:bioengineering10020173. [PMID: 36829667 PMCID: PMC9952758 DOI: 10.3390/bioengineering10020173] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 01/24/2023] [Accepted: 01/26/2023] [Indexed: 01/31/2023] Open
Abstract
Cancer is a term that denotes a group of diseases caused by the abnormal growth of cells that can spread in different parts of the body. According to the World Health Organization (WHO), cancer is the second major cause of death after cardiovascular diseases. Gene expression can play a fundamental role in the early detection of cancer, as it is indicative of the biochemical processes in tissue and cells, as well as the genetic characteristics of an organism. Deoxyribonucleic acid (DNA) microarrays and ribonucleic acid (RNA)-sequencing methods for gene expression data allow quantifying the expression levels of genes and produce valuable data for computational analysis. This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods. Both conventional and deep learning-based approaches are reviewed, with an emphasis on the application of deep learning models due to their comparative advantages for identifying gene patterns that are distinctive for various types of cancers. Relevant works that employ the most commonly used deep neural network architectures are covered, including multi-layer perceptrons, as well as convolutional, recurrent, graph, and transformer networks. This survey also presents an overview of the data collection methods for gene expression analysis and lists important datasets that are commonly used for supervised machine learning for this task. Furthermore, we review pertinent techniques for feature engineering and data preprocessing that are typically used to handle the high dimensionality of gene expression data, caused by a large number of genes present in data samples. The paper concludes with a discussion of future research directions for machine learning-based gene expression analysis for cancer classification.
Collapse
|
41
|
Identifying Network Biomarkers for Alzheimer's Disease Using Single-Cell RNA Sequencing Data. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023; 1423:207-214. [PMID: 37525046 DOI: 10.1007/978-3-031-31978-5_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/02/2023]
Abstract
System-level network-based approaches are an emerging field in the biomedical domain since biological networks can be used to analyze complicated biological processes and complex human disorders more efficiently. Network biomarkers are groups of interconnected molecular components causing perturbations in the entire network topology that can be used as indicators of pathogenic biological processes when studying a given disease. Although in the last years computational systems-based approaches have gained ground on the path to discovering new network biomarkers, in complex diseases like Alzheimer's disease (AD), this approach has still much to offer. Especially the adoption of single-cell RNA sequencing (scRNA-seq) has now become the dominant technology for the study of stochastic gene expression. Toward this orientation, we propose an R workflow that extracts disease-perturbed subpathways within a pathway network. We construct a gene-gene interaction network integrated with scRNA-seq expression profiles, and after network processing and pruning, the most active subnetworks are isolated from the entire network topology. The proposed methodology was applied on a real AD-based scRNA-seq data, providing already existing and new potential AD biomarkers in gene network context.
Collapse
|
42
|
Differential microRNA profiles in elderly males with seborrheic dermatitis. Sci Rep 2022; 12:21241. [PMID: 36481792 PMCID: PMC9732001 DOI: 10.1038/s41598-022-24383-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 11/15/2022] [Indexed: 12/13/2022] Open
Abstract
Seborrheic dermatitis (SD) is one of the most common skin diseases characterized by inflammatory symptoms and cell proliferation, which has increased incidence in patients older than 50 years. Although the roles of microRNAs (miRNAs) have been investigated in several diseases, miRNA profiles of patients with SD remain unknown. This study aimed to identify differentially expressed miRNAs (DEMs) in lesions of elderly male patients with SD. We used a microarray-based approach to identify DEMs in lesions compared to those in non-lesions of patients with SD. Furthermore, Gene Ontology and pathway enrichment analysis were performed using bioinformatics tools to elucidate the functional significance of the target mRNAs of DEMs in lesions of patients with SD. Expression levels of two miRNAs-hsa-miR-6831-5p and hsa-miR-7107-5p-were downregulated, whereas those of six miRNAs-hsa-miR-20a-5p, hsa-miR-191-5p, hsa-miR-127-3p, hsa-miR-106b-5p, hsa-miR-342-3p, and hsa-miR-6824-5p-were upregulated. Functions of the SD-related miRNAs were predicted to be significantly associated with typical dermatological pathogenesis, such as cell proliferation, cell cycle, apoptosis, and immune regulation. In summary, SD alters the miRNA profile, and target mRNAs of the DEMs are related to immune responses and cell proliferation, which are the two main processes in SD pathogenesis.
Collapse
|
43
|
Excessive HSP70/TLR2 activation leads to remodeling of the tumor immune microenvironment to resist chemotherapy sensitivity of mFOLFOX in colorectal cancer. Clin Immunol 2022; 245:109157. [DOI: 10.1016/j.clim.2022.109157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Revised: 09/12/2022] [Accepted: 10/09/2022] [Indexed: 11/30/2022]
|
44
|
Bioinformatics Prediction and Machine Learning on Gene Expression Data Identifies Novel Gene Candidates in Gastric Cancer. Genes (Basel) 2022; 13:genes13122233. [PMID: 36553500 PMCID: PMC9778573 DOI: 10.3390/genes13122233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 11/21/2022] [Accepted: 11/25/2022] [Indexed: 11/29/2022] Open
Abstract
Gastric cancer (GC) is one of the five most common cancers in the world and unfortunately has a high mortality rate. To date, the pathogenesis and disease genes of GC are unclear, so the need for new diagnostic and prognostic strategies for GC is undeniable. Despite particular findings in this regard, a holistic approach encompassing molecular data from different biological levels for GC has been lacking. To translate Big Data into system-level biomarkers, in this study, we integrated three different GC gene expression data with three different biological networks for the first time and captured biologically significant (i.e., reporter) transcripts, hub proteins, transcription factors, and receptor molecules of GC. We analyzed the revealed biomolecules with independent RNA-seq data for their diagnostic and prognostic capabilities. While this holistic approach uncovered biomolecules already associated with GC, it also revealed novel system biomarker candidates for GC. Classification performances of novel candidate biomarkers with machine learning approaches were investigated. With this study, AES, CEBPZ, GRK6, HPGDS, SKIL, and SP3 were identified for the first time as diagnostic and/or prognostic biomarker candidates for GC. Consequently, we have provided valuable data for further experimental and clinical efforts that may be useful for the diagnosis and/or prognosis of GC.
Collapse
|
45
|
Hepatitis C Core Protein Induces a Genotype-Specific Susceptibility of Hepatocytes to TNF-Induced Death In Vitro and In Vivo. Viruses 2022; 14:v14112521. [PMID: 36423130 PMCID: PMC9692671 DOI: 10.3390/v14112521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 11/01/2022] [Accepted: 11/10/2022] [Indexed: 11/16/2022] Open
Abstract
Hepatitis C virus (HCV) core protein is a multifunctional protein that is involved in the proliferation, inflammation, and apoptosis mechanism of hepatocytes. HCV core protein genetic variability has been implicated in various outcomes of HCV pathology and treatment. In the present study, we aimed to analyze the role of the HCV core protein in tumor necrosis factor α (TNFα)-induced death under the viewpoint of HCV genetic variability. Immortalized hepatocytes (IHH), and not the Huh 7.5 hepatoma cell line, stably expressing HCV subtype 4a and HCV subtype 4f core proteins showed that only the HCV 4a core protein could increase sensitivity to TNFα-induced death. Development of two transgenic mice expressing the two different core proteins under the liver-specific promoter of transthyretin (TTR) allowed for the in vivo assessment of the role of the core in TNFα-induced death. Using the TNFα-dependent model of lipopolysaccharide/D-galactosamine (LPS/Dgal), we were able to recapitulate the in vitro results in IHH cells in vivo. Transgenic mice expressing the HCV 4a core protein were more susceptible to the LPS/Dgal model, while mice expressing the HCV 4f core protein had the same susceptibility as their littermate controls. Transcriptome analysis in liver biopsies from these transgenic mice gave insights into HCV core molecular pathogenesis while linking HCV core protein genetic variability to differential pathology in vivo.
Collapse
|
46
|
Yiqi Huayu decoction alleviates bleomycin-induced pulmonary fibrosis in rats by inhibiting senescence. Front Pharmacol 2022; 13:1033919. [PMID: 36386126 PMCID: PMC9649452 DOI: 10.3389/fphar.2022.1033919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 10/17/2022] [Indexed: 01/10/2023] Open
Abstract
Overview: In treating pulmonary fibrosis (PF), traditional Chinese medicine (TCM) has received much attention, but its mechanism is unclear. The pharmacological mechanisms of TCM can be explored through network pharmacology. However, due to its virtual screening properties, it still needs to be verified by in vitro or in vivo experiments. Therefore, we investigated the anti-PF mechanism of Yiqi Huayu Decoction (YHD) by combining network pharmacology with in vivo experiments. Methods: Firstly, we used classical bleomycin (BLM)-induced rat model of PF and administrated fibrotic rats with YHD (low-, medium-, and high-dose). We comprehensively assessed the treatment effect of YHD according to body weight, lung coefficient, lung function, and histopathologic examination. Second, we predict the potential targets by ultra-high-performance liquid chromatography-tandem mass spectrometry (UHPLC-MS/MS) combined with network pharmacology. In brief, we obtained the chemical ingredients of YHD based on the UHPLC-MS/MS and TCMSP database. We collected drug targets from TCMSP, HERB, and Swiss target prediction databases based on active ingredients. Disease targets were acquired from drug libraries, Genecards, HERB, and TTD databases. The intersecting targets of drugs and disease were screened out. The STRING database can obtain protein-protein interaction (PPI) networks and hub target proteins. Molecular Complex Detection (MCODE) clustering analysis combined with enrichment analysis can explore the possible biological mechanisms of YHD. Enrichment analyses were conducted through the R package and the David database, including the Kyoto Encyclopedia of Genes and Genomes (KEGG), Gene Ontology (GO), and Reactome. Then, we further validated the target genes and target proteins predicted by network pharmacology. Protein and gene expression detection by immunohistochemistry, Western blot (WB), and real-time quantitative PCR (rt-qPCR). Results: The results showed that high-dose YHD effectively attenuated BLM-induced lung injury and fibrosis in rats, as evidenced by improved lung function, relief of inflammatory response, and reduced collagen deposition. We screened nine core targets and cellular senescence pathways by UHPLC-MS/MS analysis and network pharmacology. We subsequently validated key targets of cellular senescence signaling pathways. WB and rt-qPCR indicated that high-dose YHD decreased protein and gene expression of senescence-related markers, including p53 (TP53), p21 (CDKN1A), and p16 (CDKN2A). Increased reactive oxygen species (ROS) are upstream triggers of the senescence program. The senescence-associated secretory phenotypes (SASPs), containing interleukin 6 (IL-6), tumor necrosis factor-alpha (TNF-α), and transforming growth factor-β1 (TGF-β1), can further exacerbate the progression of senescence. High-dose YHD inhibited ROS production in lung tissue and consistently reduced the SASPs expression in serum. Conclusion: Our study suggests that YHD improves lung pathological injury and lung function in PF rats. This protective effect may be related to the ability of YHD to inhibit cellular senescence.
Collapse
|
47
|
Applying differential network analysis to longitudinal gene expression in response to perturbations. Front Genet 2022; 13:1026487. [PMID: 36324501 PMCID: PMC9618823 DOI: 10.3389/fgene.2022.1026487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 10/03/2022] [Indexed: 11/17/2022] Open
Abstract
Differential Network (DN) analysis is a method that has long been used to interpret changes in gene expression data and provide biological insights. The method identifies the rewiring of gene networks in response to external perturbations. Our study applies the DN method to the analysis of RNA-sequencing (RNA-seq) time series datasets. We focus on expression changes: (i) in saliva of a human subject after pneumococcal vaccination (PPSV23) and (ii) in primary B cells treated ex vivo with a monoclonal antibody drug (Rituximab). The DN method enabled us to identify the activation of biological pathways consistent with the mechanisms of action of the PPSV23 vaccine and target pathways of Rituximab. The community detection algorithm on the DN revealed clusters of genes characterized by collective temporal behavior. All saliva and some B cell DN communities showed characteristic time signatures, outlining a chronological order in pathway activation in response to the perturbation. Moreover, we identified early and delayed responses within network modules in the saliva dataset and three temporal patterns in the B cell data.
Collapse
|
48
|
Function-based classification of hazardous biological sequences: Demonstration of a new paradigm for biohazard assessments. Front Bioeng Biotechnol 2022; 10:979497. [PMID: 36277394 PMCID: PMC9585941 DOI: 10.3389/fbioe.2022.979497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 08/31/2022] [Indexed: 12/04/2022] Open
Abstract
Bioengineering applies analytical and engineering principles to identify functional biological building blocks for biotechnology applications. While these building blocks are leveraged to improve the human condition, the lack of simplistic, machine-readable definition of biohazards at the function level is creating a gap for biosafety practices. More specifically, traditional safety practices focus on the biohazards of known pathogens at the organism-level and may not accurately consider novel biodesigns with engineered functionalities at the genetic component-level. This gap is motivating the need for a paradigm shift from organism-centric procedures to function-centric biohazard identification and classification practices. To address this challenge, we present a novel methodology for classifying biohazards at the individual sequence level, which we then compiled to distinguish the biohazardous property of pathogenicity at the whole genome level. Our methodology is rooted in compilation of hazardous functions, defined as a set of sequences and associated metadata that describe coarse-level functions associated with pathogens (e.g., adherence, immune subversion). We demonstrate that the resulting database can be used to develop hazardous “fingerprints” based on the functional metadata categories. We verified that these hazardous functions are found at higher levels in pathogens compared to non-pathogens, and hierarchical clustering of the fingerprints can distinguish between these two groups. The methodology presented here defines the hazardous functions associated with bioengineering functional building blocks at the sequence level, which provide a foundational framework for classifying biological hazards at the organism level, thus leading to the improvement and standardization of current biosecurity and biosafety practices.
Collapse
|
49
|
Novel clinical, molecular and bioinformatics insights into the genetic background of autism. Hum Genomics 2022; 16:39. [PMID: 36117207 PMCID: PMC9482726 DOI: 10.1186/s40246-022-00415-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 09/12/2022] [Indexed: 11/10/2022] Open
Abstract
Background Clinical classification of autistic patients based on current WHO criteria provides a valuable but simplified depiction of the true nature of the disorder. Our goal is to determine the biology of the disorder and the ASD-associated genes that lead to differences in the severity and variability of clinical features, which can enhance the ability to predict clinical outcomes. Method Novel Whole Exome Sequencing data from children (n = 33) with ASD were collected along with extended cognitive and linguistic assessments. A machine learning methodology and a literature-based approach took into consideration known effects of genetic variation on the translated proteins, linking them with specific ASD clinical manifestations, namely non-verbal IQ, memory, attention and oral language deficits. Results Linear regression polygenic risk score results included the classification of severe and mild ASD samples with a 81.81% prediction accuracy. The literature-based approach revealed 14 genes present in all sub-phenotypes (independent of severity) and others which seem to impair individual ones, highlighting genetic profiles specific to mild and severe ASD, which concern non-verbal IQ, memory, attention and oral language skills. Conclusions These genes can potentially contribute toward a diagnostic gene-set for determining ASD severity. However, due to the limited number of patients in this study, our classification approach is mostly centered on the prediction and verification of these genes and does not hold a diagnostic nature per se. Substantial further experimentation is required to validate their role as diagnostic markers. The use of these genes as input for functional analysis highlights important biological processes and bridges the gap between genotype and phenotype in ASD.
Collapse
|
50
|
Application of proteomics in shrimp and shrimp aquaculture. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY. PART D, GENOMICS & PROTEOMICS 2022; 43:101015. [PMID: 35870418 DOI: 10.1016/j.cbd.2022.101015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 07/11/2022] [Accepted: 07/11/2022] [Indexed: 06/15/2023]
Abstract
Since proteins play an important role in the life of an organism, many researchers are now looking at how genes and proteins interact to form different proteins. It is anticipated that the creation of adequate tools for rapid analysis of proteins will accelerate the determination of functional aspects of these biomolecules and develop new biomarkers and therapeutic targets for the diagnosis and treatment of various diseases. Though shrimp contains high-quality marine proteins, there are reports about the heavy losses to the shrimp industry due to the poor quality of shrimp production and many times due to mass mortality also. Frequent outbreaks of diseases, water pollution, and quality of feed are some of the most recognized reasons for such losses. In the seafood export market, shrimp occupies the top position in currency earnings and strengthens the economy of many developing nations. Therefore, it is vital for shrimp-producing companies they produce healthy shrimp with high-quality protein. Though aquaculture is a very competitive market, global awareness regarding the use of scientific knowledge and emerging technologies to obtain better-farmed organisms through sustainable production has enhanced the importance of proteomics in seafood biology research. Proteomics, as a powerful tool, has therefore been increasingly used to address several issues in shrimp aquaculture. In the present paper, efforts have been made to address some of them, particularly the role of proteomics in reproduction, breeding and spawning, immunological responses and disease resistance capacity, nutrition and health, microbiome and probiotics, quality and safety of shrimp production, bioinformatics applications in proteomics, the discovery of protein biomarkers, and mitigating biotic and abiotic stresses. Future challenges and research directions on proteomics in shrimp aquaculture have also been discussed.
Collapse
|