1
|
NetActivity enhances transcriptional signals by combining gene expression into robust gene set activity scores through interpretable autoencoders. Nucleic Acids Res 2024; 52:e44. [PMID: 38597610 PMCID: PMC11109970 DOI: 10.1093/nar/gkae197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 01/23/2024] [Accepted: 03/12/2024] [Indexed: 04/11/2024] Open
Abstract
Grouping gene expression into gene set activity scores (GSAS) provides better biological insights than studying individual genes. However, existing gene set projection methods cannot return representative, robust, and interpretable GSAS. We developed NetActivity, a machine learning framework that generates GSAS based on a sparsely-connected autoencoder, where each neuron in the inner layer represents a gene set. We proposed a three-tier training that yielded representative, robust, and interpretable GSAS. NetActivity model was trained with 1518 GO biological processes terms and KEGG pathways and all GTEx samples. NetActivity generates GSAS robust to the initialization parameters and representative of the original transcriptome, and assigned higher importance to more biologically relevant genes. Moreover, NetActivity returns GSAS with a more consistent definition and higher interpretability than GSVA and hipathia, state-of-the-art gene set projection methods. Finally, NetActivity enables combining bulk RNA-seq and microarray datasets in a meta-analysis of prostate cancer progression, highlighting gene sets related to cell division, key for disease progression. When applied to metastatic prostate cancer, gene sets associated with cancer progression were also altered due to drug resistance, while a classical enrichment analysis identified gene sets irrelevant to the phenotype. NetActivity is publicly available in Bioconductor and GitHub.
Collapse
|
2
|
Genetically predicted hypotaurine levels mediate the relationship between immune cells and intracerebral hemorrhage. Int Immunopharmacol 2024; 132:112049. [PMID: 38608476 DOI: 10.1016/j.intimp.2024.112049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Revised: 04/03/2024] [Accepted: 04/06/2024] [Indexed: 04/14/2024]
Abstract
The evidence supports a strong link between immune cells and intracerebral hemorrhage (ICH). Nonetheless, the specific cause-and-effect associations between immune cells and ICH remain indeterminate. Here, our primary investigation compared immune cell infiltration in the ICH and sham groups using the GSE24265 dataset. Afterward, we extensively examined the relationship between immune cells and ICH by applying a two-sample Mendelian randomization (MR) analysis to identify the particular immune cells that may be associated with the initiation and advancement of ICH. Nevertheless, the specific processes that regulate the cause-and-effect connection between immune cells and ICH remain unknown. In this study, our objective was to investigate the connections between immune cell characteristics and plasma metabolites, as well as the links between plasma components and ICH. Our investigation uncovered that the levels of hypotaurine play a key role in the advancement of ICH, influencing the ratio of switched memory B cells among lymphocytes. Thus, our findings provide novel insights into the potential biological mechanisms underlying immune cell-mediated ICH.
Collapse
|
3
|
Drug Repurposing: A Leading Strategy for New Threats and Targets. ACS Pharmacol Transl Sci 2024; 7:915-932. [PMID: 38633585 PMCID: PMC11019736 DOI: 10.1021/acsptsci.3c00361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 03/01/2024] [Accepted: 03/06/2024] [Indexed: 04/19/2024]
Abstract
Less than 6% of rare illnesses have an appropriate treatment option. Repurposed medications for new indications are a cost-effective and time-saving strategy that results in excellent success rates, which may significantly lower the risk associated with therapeutic development for rare illnesses. It is becoming a realistic alternative to repurposing "conventional" medications to treat joint and rare diseases considering the significant failure rates, high expenses, and sluggish stride of innovative medication advancement. This is due to delisted compounds, cheaper research fees, and faster development time frames. Repurposed drug competitors have been developed using strategic decisions based on data analysis, interpretation, and investigational approaches, but technical and regulatory restrictions must also be considered. Combining experimental and computational methodologies generates innovative new medicinal applications. It is a one-of-a-kind strategy for repurposing human-safe pharmaceuticals to treat uncommon and difficult-to-treat ailments. It is a very effective method for discovering and creating novel medications. Several pharmaceutical firms have developed novel therapies by repositioning old medications. Repurposing drugs is practical, cost-effective, and speedy and generally involves lower risks when compared to developing a new drug from the beginning.
Collapse
|
4
|
Luteolin Protects Against 6-Hydoroxydopamine-Induced Cell Death via an Upregulation of HRD1 and SEL1L. Neurochem Res 2024; 49:117-128. [PMID: 37632637 PMCID: PMC10776467 DOI: 10.1007/s11064-023-04019-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Revised: 08/17/2023] [Accepted: 08/18/2023] [Indexed: 08/28/2023]
Abstract
Parkinson's Disease (PD) is caused by many factors and endoplasmic reticulum (ER) stress is considered as one of the responsible factors for it. ER stress induces the activation of the ubiquitin-proteasome system to degrade unfolded proteins and suppress cell death. The ubiquitin ligase 3-hydroxy-3-methylglutaryl-coenzyme A reductase degradation 1 (HRD1) and its stabilizing molecule, the suppressor/enhancer lin-12-like (SEL1L), can suppress the ER stress via the ubiquitin-proteasome system, and that HRD1 can also suppress cell death in familial and nonfamilial PD models. These findings indicate that HRD1 and SEL1L might be key proteins for the treatment of PD. Our study aimed to identify the compounds with the effects of upregulating the HRD1 expression and suppressing neuronal cell death in a 6-hydroxydopamine (6-OHDA)-induced cellular PD model. Our screening by the Drug Gene Budger, a drug repositioning tool, identified luteolin as a candidate compound for the desired modulation of the HRD1 expression. Subsequently, we confirmed that low concentrations of luteolin did not show cytotoxicity in SH-SY5Y cells, and used these low concentrations in the subsequent experiments. Next, we demonsrated that luteolin increased HRD1 and SEL1L mRNA levels and protein expressions. Furthermore, luteolin inhibited 6-OHDA-induced cell death and suppressed ER stress response caused by exposure to 6-OHDA. Finally, luteolin did not reppress 6-OHDA-induced cell death when expression of HRD1 or SEL1L was suppressed by RNA interference. These findings suggest that luteolin might be a novel therapeutic agent for PD due to its ability to suppress ER stress through the activation of HRD1 and SEL1L.
Collapse
|
5
|
TidyGEO: preparing analysis-ready datasets from Gene Expression Omnibus. J Integr Bioinform 2023; 0:jib-2023-0021. [PMID: 38047898 DOI: 10.1515/jib-2023-0021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 11/20/2023] [Indexed: 12/05/2023] Open
Abstract
TidyGEO is a Web-based tool for downloading, tidying, and reformatting data series from Gene Expression Omnibus (GEO). As a freely accessible repository with data from over 6 million biological samples across more than 4000 organisms, GEO provides diverse opportunities for secondary research. Although scientists may find assay data relevant to a given research question, most analyses require sample-level annotations. In GEO, such annotations are stored alongside assay data in delimited, text-based files. However, the structure and semantics of the annotations vary widely from one series to another, and many annotations are not useful for analysis purposes. Thus, every GEO series must be tidied before it is analyzed. Manual approaches may be used, but these are error prone and take time away from other research tasks. Custom computer scripts can be written, but many scientists lack the computational expertise to create such scripts. To address these challenges, we created TidyGEO, which supports essential data-cleaning tasks for sample-level annotations, such as selecting informative columns, renaming columns, splitting or merging columns, standardizing data values, and filtering samples. Additionally, users can integrate annotations with assay data, restructure assay data, and generate code that enables others to reproduce these steps.
Collapse
|
6
|
De novo drug design based on patient gene expression profiles via deep learning. Mol Inform 2023; 42:e2300064. [PMID: 37475603 DOI: 10.1002/minf.202300064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 05/25/2023] [Accepted: 07/20/2023] [Indexed: 07/22/2023]
Abstract
Computational de novo drug design is a challenging issue in medicine, and it is desirable to consider all of the relevant information of the biological systems in a disease state. Here, we propose a novel computational method to generate drug candidate molecular structures from patient gene expression profiles via deep learning, which we call DRAGONET. Our model can generate new molecules that are likely to counteract disease-specific gene expression patterns in patients, which is made possible by exploring the latent space constructed by a transformer-based variational autoencoder and integrating the substructures of disease-correlated molecules. We applied DRAGONET to generate drug candidate molecules for gastric cancer, atopic dermatitis, and Alzheimer's disease, and demonstrated that the newly generated molecules were chemically similar to registered drugs for each disease. This approach is applicable to diseases with unknown therapeutic target proteins and will make a significant contribution to the field of precision medicine.
Collapse
|
7
|
Signatures of Co-Deregulated Genes and Their Transcriptional Regulators in Kidney Cancers. Int J Mol Sci 2023; 24:ijms24076577. [PMID: 37047552 PMCID: PMC10094846 DOI: 10.3390/ijms24076577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2023] [Revised: 03/14/2023] [Accepted: 03/29/2023] [Indexed: 04/05/2023] Open
Abstract
There are several studies on the deregulated gene expression profiles in kidney cancer, with varying results depending on the tumor histology and other parameters. None of these, however, have identified the networks that the co-deregulated genes (co-DEGs), across different studies, create. Here, we reanalyzed 10 Gene Expression Omnibus (GEO) studies to detect and annotate co-deregulated signatures across different subtypes of kidney cancer or in single-gene perturbation experiments in kidney cancer cells and/or tissue. Using a systems biology approach, we aimed to decipher the networks they form along with their upstream regulators. Differential expression and upstream regulators, including transcription factors [MYC proto-oncogene (MYC), CCAAT enhancer binding protein delta (CEBPD), RELA proto-oncogene, NF-kB subunit (RELA), zinc finger MIZ-type containing 1 (ZMIZ1), negative elongation factor complex member E (NELFE) and Kruppel-like factor 4 (KLF4)] and protein kinases [Casein kinase 2 alpha 1 (CSNK2A1), mitogen-activated protein kinases 1 (MAPK1) and 14 (MAPK14), Sirtuin 1 (SIRT1), Cyclin dependent kinases 1 (CDK1) and 4 (CDK4), Homeodomain interacting protein kinase 2 (HIPK2) and Extracellular signal-regulated kinases 1 and 2 (ERK1/2)], were computed using the Characteristic Direction, as well as GEO2Enrichr and X2K, respectively, and further subjected to GO and KEGG pathways enrichment analyses. Furthermore, using CMap, DrugMatrix and the LINCS L1000 chemical perturbation databases, we highlight putative repurposing drugs, including Etoposide, Haloperidol, BW-B70C, Triamterene, Chlorphenesin, BRD-K79459005 and β-Estradiol 3-benzoate, among others, that may reverse the expression of the identified co-DEGs in kidney cancers. Of these, the cytotoxic effects of Etoposide, Catecholamine, Cyclosporin A, BW-B70C and Lasalocid sodium were validated in vitro. Overall, we identified critical co-DEGs across different subtypes in kidney cancer, and our results provide an innovative framework for their potential use in the future.
Collapse
|
8
|
Deregulated Gene Expression Profiles and Regulatory Networks in Adult and Pediatric RUNX1/RUNX1T1-Positive AML Patients. Cancers (Basel) 2023; 15:cancers15061795. [PMID: 36980682 PMCID: PMC10046396 DOI: 10.3390/cancers15061795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 03/09/2023] [Indexed: 03/18/2023] Open
Abstract
Acute myeloid leukemia (AML) is a heterogeneous and complex disease concerning molecular aberrations and prognosis. RUNX1/RUNX1T1 is a fusion oncogene that results from the chromosomal translocation t(8;21) and plays a crucial role in AML. However, its impact on the transcriptomic profile of different age groups of AML patients is not completely understood. Here, we investigated the deregulated gene expression (DEG) profiles in adult and pediatric RUNX1/RUNX1T1-positive AML patients, and compared their functions and regulatory networks. We retrospectively analyzed gene expression data from two independent Gene Expression Omnibus (GEO) datasets (GSE37642 and GSE75461) and computed their differentially expressed genes and upstream regulators, using limma, GEO2Enrichr, and X2K. For validation purposes, we used the TCGA-LAML (adult) and TARGET-AML (pediatric) patient cohorts. We also analyzed the protein–protein interaction (PPI) networks, as well as those composed of transcription factors (TF), intermediate proteins, and kinases foreseen to regulate the top deregulated genes in each group. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways enrichment analyses were further performed for the DEGs in each dataset. We found that the top upregulated genes in (both adult and pediatric) RUNX1/RUNX1T1-positive AML patients are enriched in extracellular matrix organization, the cell projection membrane, filopodium membrane, and supramolecular fiber. Our data corroborate that RUNX1/RUNX1T1 reprograms a large transcriptional network to establish and maintain leukemia via intricate PPI interactions and kinase-driven phosphorylation events.
Collapse
|
9
|
GAS6-AS1, a long noncoding RNA, functions as a key candidate gene in atrial fibrillation related stroke determined by ceRNA network analysis and WGCNA. BMC Med Genomics 2023; 16:51. [PMID: 36894947 PMCID: PMC9996875 DOI: 10.1186/s12920-023-01478-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 03/03/2023] [Indexed: 03/11/2023] Open
Abstract
BACKGROUND Stroke attributable to atrial fibrillation (AF related stroke, AFST) accounts for 13 ~ 26% of ischemic stroke. It has been found that AFST patients have a higher risk of disability and mortality than those without AF. Additionally, it's still a great challenge to treat AFST patients because its exact mechanism at the molecular level remains unclear. Thus, it's vital to investigate the mechanism of AFST and search for molecular targets of treatment. Long non-coding RNAs (lncRNAs) are related to the pathogenesis of various diseases. However, the role of lncRNAs in AFST remains unclear. In this study, AFST-related lncRNAs are explored using competing endogenous RNA (ceRNA) network analysis and weighted gene co-expression network analysis (WGCNA). METHODS GSE66724 and GSE58294 datasets were downloaded from GEO database. After data preprocessing and probe reannotation, differentially expressed lncRNAs (DELs) and differentially expressed mRNAs (DEMs) between AFST and AF samples were explored. Then, functional enrichment analysis and protein-protein interaction (PPI) network analysis of the DEMs were performed. At the meantime, ceRNA network analysis and WGCNA were performed to identify hub lncRNAs. The hub lncRNAs identified both by ceRNA network analysis and WGCNA were further validated by Comparative Toxicogenomics Database (CTD). RESULTS In all, 19 DELs and 317 DEMs were identified between the AFST and AF samples. Functional enrichment analysis suggested that the DEMs associated with AFST were mainly enriched in the activation of the immune response. Two lncRNAs which overlapped between the three lncRNAs identified by the ceRNA network analysis and the 28 lncRNAs identified by the WGCNA were screened as hub lncRNAs for further validation. Finally, lncRNA GAS6-AS1 turned out to be associated with AFST by CTD validation. CONCLUSION These findings suggested that low expression of GAS6-AS1 might exert an essential role in AFST through downregulating its downstream target mRNAs GOLGA8A and BACH2, and GAS6-AS1 might be a potential target for AFST therapy.
Collapse
|
10
|
Macrophage migration inhibitory factor (MIF) inhibitor iSO-1 promotes staphylococcal protein A-induced osteogenic differentiation by inhibiting NF-κB signaling pathway. Int Immunopharmacol 2023; 115:109600. [PMID: 36577150 DOI: 10.1016/j.intimp.2022.109600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 12/02/2022] [Accepted: 12/12/2022] [Indexed: 12/27/2022]
Abstract
BACKGROUND Osteomyelitis is among the most difficult to treat diseases in the field of orthopedics, and there is a lack of effective treatment modalities. Exploring the mechanisms of its development is beneficial for finding molecular targets for treatment. Increasing evidence suggests that macrophage migration inhibitory factor (MIF), as a proinflammatory mediator, is not only involved in various pathophysiological processes of inflammation but also plays an important role in osteogenic differentiation, while its specific regulatory mechanism in osteomyelitis remains unclear. METHODS In the present study, staphylococcal protein A (SPA)-treated rat bone marrow mesenchymal stem cells (rBMSCs) were used to construct cell models of osteomyelitis. Rat and cell models of osteomyelitis were used to validate the expression levels of MIF, and to further explore the regulatory mechanisms of the MIF inhibitor methyl ester of (S, R)-3-(4-hydroxyphenyl)-4,5-dihydro-5-isoxazole acetic acid (iSO-1) and MIF knockdown on cell model of osteomyelitis toward osteogenic differentiation. RESULTS We found that the expression level of MIF was upregulated in rat and cell models of osteomyelitis and subsequently demonstrated by the GSE30119 dataset that the expression level of MIF was also significantly upregulated in patients with osteomyelitis. Furthermore, SPA promotes MIF expression in rBMSCs while inhibiting the expression of osteogenic-related genes such as Runt-related transcription factor 2 (RUNX2), osteocalcin (OCN), osteopontin (OPN) and collagen type-1 (COL-1) through activation of the nuclear factor kappa-B (NF-κB) pathway. In vivo, we further demonstrated that local injection of iSO-1 significantly increased the osteogenic activity in rat model of osteomyelitis. Importantly, we also demonstrated that MIF knockdown and the MIF inhibitor iSO-1 reversed the SPA-mediated inhibition of osteogenic differentiation of rBMSCs by inhibiting the activation of the NF-κB pathway, as evidenced by the upregulation of osteogenic-related gene expression and enhanced bone mineralization. CONCLUSION ISO-1 and MIF knockdown can reverse the SPA-mediated inhibition of osteogenic differentiation in the rBMSCs model of osteomyelitis by inhibiting the NF-κB signaling pathway, providing a potential target for the treatment of osteomyelitis.
Collapse
|
11
|
Loss of NF1 in Melanoma Confers Sensitivity to SYK Kinase Inhibition. Cancer Res 2023; 83:316-331. [PMID: 36409827 PMCID: PMC9845987 DOI: 10.1158/0008-5472.can-22-0883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 09/21/2022] [Accepted: 11/15/2022] [Indexed: 11/22/2022]
Abstract
Neurofibromin 1 (NF1) loss of function (LoF) mutations are frequent in melanoma and drive hyperactivated RAS and tumor growth. NF1LoF melanoma cells, however, do not show consistent sensitivity to individual MEK, ERK, or PI3K/mTOR inhibitors. To identify more effective therapeutic strategies for treating NF1LoF melanoma, we performed a targeted kinase inhibitor screen. A tool compound named MTX-216 was highly effective in blocking NF1LoF melanoma growth in vitro and in vivo. Single-cell analysis indicated that drug-induced cytotoxicity was linked to effective cosuppression of proliferation marker Ki-67 and ribosomal protein S6 phosphorylation. The antitumor efficacy of MTX-216 was dependent on its ability to inhibit not only PI3K, its nominal target, but also SYK. MTX-216 suppressed expression of a group of genes that regulate mitochondrial electron transport chain and are associated with poor survival in patients with NF1LoF melanoma. Furthermore, combinations of inhibitors targeting either MEK or PI3K/mTOR with an independent SYK kinase inhibitor or SYK knockdown reduced the growth of NF1LoF melanoma cells. These studies provide a path to exploit SYK dependency to selectively target NF1LoF melanoma cells. SIGNIFICANCE A kinase inhibitor screen identifies SYK as a targetable vulnerability in melanoma cells with NF1 loss of function.
Collapse
|
12
|
Identification of Potential Diagnostic Genes of HIV-Infected Immunological Non-Responders on Bioinformatics Analysis. J Inflamm Res 2023; 16:1555-1570. [PMID: 37082297 PMCID: PMC10112482 DOI: 10.2147/jir.s396055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 03/28/2023] [Indexed: 04/22/2023] Open
Abstract
Purpose HIV-infected immunological non-responders (INRs) failed to achieve the normalization of CD4+ T cell counts despite their undetectable viral load. INRs have an increased risk of clinical progressions of Acquired Immunodeficiency Syndrome (AIDS) and non-AIDS events, accompanied by higher mortality rates than immunological responders (IRs). This study aimed to discover the genes, which help to distinguish INRs from IRs and explore the possible mechanism of INRs. Methods Screening DEGs between INRs and IRs using GEO microarray dataset GSE143742. DEG biological functions were investigated using GO and KEGG analysis. DEGs and WGCNA linked modules were intersected to find common genes. Key genes were identified using SVM-RFE and LASSO regression models. ROC analysis was done to evaluate key gene diagnostic effectiveness using GEO database dataset GSE106792. Cytoscape created a miRNA-mRNA-TF network for diagnostic genes. CIBERSORT and flow cytometry examined the INRs and IRs immune microenvironments. In 10 INR and 10 IR clinical samples, diagnostic gene expression was verified by RT-qPCR and Western blot. Results We obtained 190 DEGs between the INR group and IR group. Functional enrichment analysis found a significant enrichment in mitochondria and apoptosis-related pathways. CD69 and ZNF207 were identified as potential diagnostic genes. CD69 and ZNF207 shared a transcription factor, NCOR1, in the miRNA-mRNA-TF network. Immune microenvironment analysis by CIBERSORT showed that IRs had a higher level of resting memory CD4+ T cells, lower level of activated memory CD4+ T cells and resting dendritic cells than INRs, as confirmed by flow cytometry analysis. In addition, CD69 and ZNF207 were correlated with immune cells. Experiments confirmed the expression of the diagnostic genes in INRs and IRs. Conclusion CD69 and ZNF207 were identified as potential diagnostic genes to discriminate INRs from IRs. Our findings offered new clues to diagnostic and therapeutic targets for INRs.
Collapse
|
13
|
Saracatinib, a Selective Src Kinase Inhibitor, Blocks Fibrotic Responses in Preclinical Models of Pulmonary Fibrosis. Am J Respir Crit Care Med 2022; 206:1463-1479. [PMID: 35998281 PMCID: PMC9757097 DOI: 10.1164/rccm.202010-3832oc] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 08/23/2022] [Indexed: 12/24/2022] Open
Abstract
Rationale: Idiopathic pulmonary fibrosis (IPF) is a chronic, progressive, and often fatal disorder. Two U.S. Food and Drug Administration-approved antifibrotic drugs, nintedanib and pirfenidone, slow the rate of decline in lung function, but responses are variable and side effects are common. Objectives: Using an in silico data-driven approach, we identified a robust connection between the transcriptomic perturbations in IPF disease and those induced by saracatinib, a selective Src kinase inhibitor originally developed for oncological indications. Based on these observations, we hypothesized that saracatinib would be effective at attenuating pulmonary fibrosis. Methods: We investigated the antifibrotic efficacy of saracatinib relative to nintedanib and pirfenidone in three preclinical models: 1) in vitro in normal human lung fibroblasts; 2) in vivo in bleomycin and recombinant Ad-TGF-β (adenovirus transforming growth factor-β) murine models of pulmonary fibrosis; and 3) ex vivo in mice and human precision-cut lung slices from these two murine models as well as patients with IPF and healthy donors. Measurements and Main Results: In each model, the effectiveness of saracatinib in blocking fibrogenic responses was equal or superior to nintedanib and pirfenidone. Transcriptomic analyses of TGF-β-stimulated normal human lung fibroblasts identified specific gene sets associated with fibrosis, including epithelial-mesenchymal transition, TGF-β, and WNT signaling that was uniquely altered by saracatinib. Transcriptomic analysis of whole-lung extracts from the two animal models of pulmonary fibrosis revealed that saracatinib reverted many fibrogenic pathways, including epithelial-mesenchymal transition, immune responses, and extracellular matrix organization. Amelioration of fibrosis and inflammatory cascades in human precision-cut lung slices confirmed the potential therapeutic efficacy of saracatinib in human lung fibrosis. Conclusions: These studies identify novel Src-dependent fibrogenic pathways and support the study of the therapeutic effectiveness of saracatinib in IPF treatment.
Collapse
|
14
|
Navigating Transcriptomic Connectivity Mapping Workflows to Link Chemicals with Bioactivities. Chem Res Toxicol 2022; 35:1929-1949. [PMID: 36301716 PMCID: PMC10483698 DOI: 10.1021/acs.chemrestox.2c00245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Screening new compounds for potential bioactivities against cellular targets is vital for drug discovery and chemical safety. Transcriptomics offers an efficient approach for assessing global gene expression changes, but interpreting chemical mechanisms from these data is often challenging. Connectivity mapping is a potential data-driven avenue for linking chemicals to mechanisms based on the observation that many biological processes are associated with unique gene expression signatures (gene signatures). However, mining the effects of a chemical on gene signatures for biological mechanisms is challenging because transcriptomic data contain thousands of noisy genes. New connectivity mapping approaches seeking to distinguish signal from noise continue to be developed, spurred by the promise of discovering chemical mechanisms, new drugs, and disease targets from burgeoning transcriptomic data. Here, we analyze these approaches in terms of diverse transcriptomic technologies, public databases, gene signatures, pattern-matching algorithms, and statistical evaluation criteria. To navigate the complexity of connectivity mapping, we propose a harmonized scheme to coherently organize and compare published workflows. We first standardize concepts underlying transcriptomic profiles and gene signatures based on various transcriptomic technologies such as microarrays, RNA-Seq, and L1000 and discuss the widely used data sources such as Gene Expression Omnibus, ArrayExpress, and MSigDB. Next, we generalize connectivity mapping as a pattern-matching task for finding similarity between a query (e.g., transcriptomic profile for new chemical) and a reference (e.g., gene signature of known target). Published pattern-matching approaches fall into two main categories: vector-based use metrics like correlation, Jaccard index, etc., and aggregation-based use parametric and nonparametric statistics (e.g., gene set enrichment analysis). The statistical methods for evaluating the performance of different approaches are described, along with comparisons reported in the literature on benchmark transcriptomic data sets. Lastly, we review connectivity mapping applications in toxicology and offer guidance on evaluating chemical-induced toxicity with concentration-response transcriptomic data. In addition to serving as a high-level guide and tutorial for understanding and implementing connectivity mapping workflows, we hope this review will stimulate new algorithms for evaluating chemical safety and drug discovery using transcriptomic data.
Collapse
|
15
|
Systematic tissue annotations of genomics samples by modeling unstructured metadata. Nat Commun 2022; 13:6736. [PMID: 36347858 PMCID: PMC9643451 DOI: 10.1038/s41467-022-34435-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 10/25/2022] [Indexed: 11/10/2022] Open
Abstract
There are currently >1.3 million human -omics samples that are publicly available. This valuable resource remains acutely underused because discovering particular samples from this ever-growing data collection remains a significant challenge. The major impediment is that sample attributes are routinely described using varied terminologies written in unstructured natural language. We propose a natural-language-processing-based machine learning approach (NLP-ML) to infer tissue and cell-type annotations for genomics samples based only on their free-text metadata. NLP-ML works by creating numerical representations of sample descriptions and using these representations as features in a supervised learning classifier that predicts tissue/cell-type terms. Our approach significantly outperforms an advanced graph-based reasoning annotation method (MetaSRA) and a baseline exact string matching method (TAGGER). Model similarities between related tissues demonstrate that NLP-ML models capture biologically-meaningful signals in text. Additionally, these models correctly classify tissue-associated biological processes and diseases based on their text descriptions alone. NLP-ML models are nearly as accurate as models based on gene-expression profiles in predicting sample tissue annotations but have the distinct capability to classify samples irrespective of the genomics experiment type based on their text metadata. Python NLP-ML prediction code and trained tissue models are available at https://github.com/krishnanlab/txt2onto .
Collapse
|
16
|
Identification of Potential Repurposable Drugs in Alzheimer’s Disease Exploiting a Bioinformatics Analysis. J Pers Med 2022; 12:jpm12101731. [PMID: 36294870 PMCID: PMC9605472 DOI: 10.3390/jpm12101731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/07/2022] [Accepted: 10/10/2022] [Indexed: 11/17/2022] Open
Abstract
Alzheimer’s disease (AD) is a neurologic disorder causing brain atrophy and the death of brain cells. It is a progressive condition marked by cognitive and behavioral impairment that significantly interferes with daily activities. AD symptoms develop gradually over many years and eventually become more severe, and no cure has been found yet to arrest this process. The present study is directed towards suggesting putative novel solutions and paradigms for fighting AD pathogenesis by exploiting new insights from network medicine and drug repurposing strategies. To identify new drug–AD associations, we exploited SAveRUNNER, a recently developed network-based algorithm for drug repurposing, which quantifies the vicinity of disease-associated genes to drug targets in the human interactome. We complemented the analysis with an in silico validation of the candidate compounds through a gene set enrichment analysis, aiming to determine if the modulation of the gene expression induced by the predicted drugs could be counteracted by the modulation elicited by the disease. We identified some interesting compounds belonging to the beta-blocker family, originally approved for treating hypertension, such as betaxolol, bisoprolol, and metoprolol, whose connection with a lower risk to develop Alzheimer’s disease has already been observed. Moreover, our algorithm predicted multi-kinase inhibitors such as regorafenib, whose beneficial effects were recently investigated for neuroinflammation and AD pathology, and mTOR inhibitors such as sirolimus, whose modulation has been associated with AD.
Collapse
|
17
|
Novel insights into host responses to Japanese Encephalitis Virus infection: Reanalysis of public transcriptome and microRNAome datasets. Virus Res 2022; 320:198887. [PMID: 35953004 DOI: 10.1016/j.virusres.2022.198887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 08/07/2022] [Accepted: 08/08/2022] [Indexed: 10/15/2022]
Abstract
PURPOSE Japanese encephalitis (JE), caused by the Japanese encephalitis virus (JEV), is the principal cause of viral encephalitis in South-East Asian and Western Pacific countries; accounting for 68,000 cases, and up to 20,400 fatalities, annually across the world. Despite being a high-risk condition, there is no specific treatment for JE. Given rapid additions in genomics databases and the power of data reanalysis in addressing critical medical questions, the present study was designed to identify novel host factors that might have potential roles in JEV infection. METHODS We extracted microarray and RNA-Seq data sets from NCBI-GEO and compared mock and JEV-infected samples. Raw data from all the studies were re-analyzed to identify host factors associated with JEV replication. RESULTS We identified several coding and non-coding host factors that had no prior known role in viral infections. Of these, the coding transcripts: Myosin Heavy Chain 10 (MYH10), Progestin and AdipoQ Receptor Family Member 8 (PAQR8), and the microRNAs: hsa-miR-193b-5p, hsa-miR-3714 and hsa-miR-513a-5p were found to be novel host factors deregulated during JEV infection. MYH10 encodes a conventional non-muscle myosin, and mutations in MYH10 have been shown to cause neurological defects. PAQR8 has been associated with epilepsy, which exhibits symptoms similar to JEV infection. JE is a neuro-degenerative disease, and the known involvement of MYH10 and PAQR8 in neurological disorders strongly indicates potential roles of these host factors in JEV infection. Additionally, we observed that MYH10 and PAQR8 had a significant negative correlation with Activating transcription factor 3 (ATF3), which is a previously validated modulator of JEV infection. ATF3 is a transcription factor that binds to the promotors of genes encoding other transcription factors or interferon-stimulated genes and negatively regulates host antiviral responses during JE. CONCLUSION Our findings demonstrate the significance of data reanalysis in the identification of novel host factors that may become targets for diagnosis/ therapy against viral diseases of major concern, such as, JE. The deregulated coding and non-coding transcripts identified in this study need further experimental analysis for validation.
Collapse
|
18
|
Identification of potential hub genes of gastric cancer. Medicine (Baltimore) 2022; 101:e30741. [PMID: 36254003 PMCID: PMC9575828 DOI: 10.1097/md.0000000000030741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Gastric cancer (GC) is a malignant tumor originated from gastric mucosa epithelium. It is the third leading cause of cancer mortality in China. The early symptoms are not obvious. When it is discovered, it has developed to the advanced stage, and the prognosis is poor. In order to screen for potential genes for GC development, this study obtained GSE118916 and GSE109476 from the gene expression omnibus (GEO) database for bioinformatics analysis. METHODS First, GEO2R was used to identify differentially expressed genes (DEG) and the functional annotation of DEGs was performed by gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. The Search Tool for the Retrieval of Interacting Genes (STRING) tool was used to construct protein-protein interaction (PPI) network and the most important modules and hub genes were mined. Real time quantitative polymerase chain reaction assay was performed to verify the expression level of hub genes. RESULTS A total of 139 DEGs were identified. The functional changes of DEGs are mainly concentrated in the cytoskeleton, extracellular matrix and collagen synthesis. Eleven genes were identified as core genes. Bioinformatics analysis shows that the core genes are mainly enriched in many processes related to cell adhesion and collagen. CONCLUSION In summary, the DEGs and hub genes found in this study may be potential diagnostic and therapeutic targets.
Collapse
|
19
|
Systematic transcriptome analysis reveals molecular mechanisms and indications of bupleuri radix. Front Pharmacol 2022; 13:1010520. [PMID: 36304143 PMCID: PMC9592978 DOI: 10.3389/fphar.2022.1010520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 09/29/2022] [Indexed: 11/13/2022] Open
Abstract
Pharmacogenomic analysis based on drug transcriptomic signatures is widely used to identify mechanisms of action and pharmacological indications. Despite accumulating reports on the efficacy of medicinal herbs, related transcriptome-level analyses are lacking. The aim of the present study was to elucidate the underlying molecular mechanisms of action of Bupleuri Radix (BR), a widely used herbal medicine, through a systematic transcriptomic analysis. We analyzed the drug-responsive transcriptome profiling of A549 lung cancer cell line after treating them with multiple doses of BR water (W-BR) and ethanol (E-BR) extracts and their phytochemicals. In vitro validation experiments were performed using both A549 and the immortalized human keratinocyte line HaCaT. Pathway enrichment analysis revealed the anti-cancer effects of BR treatment via inhibition of cell proliferation and induction of apoptosis. Enhanced cell adhesion and migration were observed with the W-BR but not with the E-BR. Comparison with a disease signature database validated an indication of the W-BR for skin disorders. Moreover, W-BR treatment showed the wound-healing effect in skin and lung cells. The main active ingredients of BR showed only the anti-cancer effect of the E-BR and not the wound healing effect of the W-BR, suggesting the need for research on minor ingredients of BR.
Collapse
|
20
|
Signatures of Co-Deregulated Genes and Their Transcriptional Regulators in Lung Cancer. Int J Mol Sci 2022; 23:ijms231810933. [PMID: 36142846 PMCID: PMC9504879 DOI: 10.3390/ijms231810933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 09/13/2022] [Indexed: 11/24/2022] Open
Abstract
Despite the significant progress made towards comprehending the deregulated signatures in lung cancer, these vary from study to study. We reanalyzed 25 studies from the Gene Expression Omnibus (GEO) to detect and annotate co-deregulated signatures in lung cancer and in single-gene or single-drug perturbation experiments. We aimed to decipher the networks that these co-deregulated genes (co-DEGs) form along with their upstream regulators. Differential expression and upstream regulators were computed using Characteristic Direction and Systems Biology tools, including GEO2Enrichr and X2K. Co-deregulated gene expression profiles were further validated across different molecular and immune subtypes in lung adenocarcinoma (TCGA-LUAD) and lung adenocarcinoma (TCGA-LUSC) datasets, as well as using immunohistochemistry data from the Human Protein Atlas, before being subjected to subsequent GO and KEGG enrichment analysis. The functional alterations of the co-upregulated genes in lung cancer were mostly related to immune response regulating the cell surface signaling pathway, in contrast to the co-downregulated genes, which were related to S-nitrosylation. Networks of hub proteins across the co-DEGs consisted of overlapping TFs (SOX2, MYC, KAT2A) and kinases (MAPK14, CSNK2A1 and CDKs). Furthermore, using Connectivity Map we highlighted putative repurposing drugs, including valproic acid, betonicine and astemizole. Similarly, we analyzed the co-DEG signatures in single-gene and single-drug perturbation experiments in lung cancer cell lines. In summary, we identified critical co-DEGs in lung cancer providing an innovative framework for their potential use in developing personalized therapeutic strategies.
Collapse
|
21
|
Exploring Novel Innovation Strategies to Close a Technology Gap in Neurosurgery: The HORAO Crowdsourcing Campaign (Preprint). J Med Internet Res 2022; 25:e42723. [PMID: 37115612 PMCID: PMC10182462 DOI: 10.2196/42723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 02/14/2023] [Accepted: 03/12/2023] [Indexed: 03/14/2023] Open
Abstract
BACKGROUND Scientific research is typically performed by expert individuals or groups who investigate potential solutions in a sequential manner. Given the current worldwide exponential increase in technical innovations, potential solutions for any new problem might already exist, even though they were developed to solve a different problem. Therefore, in crowdsourcing ideation, a research question is explained to a much larger group of individuals beyond the specialist community to obtain a multitude of diverse, outside-the-box solutions. These are then assessed in parallel by a group of experts for their capacity to solve the new problem. The 2 key problems in brain tumor surgery are the difficulty of discerning the exact border between a tumor and the surrounding brain, and the difficulty of identifying the function of a specific area of the brain. Both problems could be solved by a method that visualizes the highly organized fiber tracts within the brain; the absence of fibers would reveal the tumor, whereas the spatial orientation of the tracts would reveal the area's function. To raise awareness about our challenge of developing a means of intraoperative, real-time, noninvasive identification of fiber tracts and tumor borders to improve neurosurgical oncology, we turned to the crowd with a crowdsourcing ideation challenge. OBJECTIVE Our objective was to evaluate the feasibility of a crowdsourcing ideation campaign for finding novel solutions to challenges in neuroscience. The purpose of this paper is to introduce our chosen crowdsourcing method and discuss it in the context of the current literature. METHODS We ran a prize-based crowdsourcing ideation competition called HORAO on the commercial platform HeroX. Prize money previously collected through a crowdfunding campaign was offered as an incentive. Using a multistage approach, an expert jury first selected promising technical solutions based on broad, predefined criteria, coached the respective solvers in the second stage, and finally selected the winners in a conference setting. We performed a postchallenge web-based survey among the solvers crowd to find out about their backgrounds and demographics. RESULTS Our web-based campaign reached more than 20,000 people (views). We received 45 proposals from 32 individuals and 7 teams, working in 26 countries on 4 continents. The postchallenge survey revealed that most of the submissions came from single solvers or teams working in engineering or the natural sciences, with additional submissions from other nonmedical fields. We engaged in further exchanges with 3 out of the 5 finalists and finally initiated a successful scientific collaboration with the winner of the challenge. CONCLUSIONS This open innovation competition is the first of its kind in medical technology research. A prize-based crowdsourcing ideation campaign is a promising strategy for raising awareness about a specific problem, finding innovative solutions, and establishing new scientific collaborations beyond strictly disciplinary domains.
Collapse
|
22
|
Text-based experiment retrieval in genomic databases. J Inf Sci 2022. [DOI: 10.1177/01655515221118670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
With the growing number of genomic data in public repositories, efficient search methodologies have become a basic need to reach the relevant genomic data. However, this need cannot be fulfilled with the current repositories because they offer a limited search option which is a lexical matching of textual descriptions or metadata of the experiments. This technique is insufficient to get the required information needed to detect similarities between experiments within a large data collection. Due to the limitation of the existing repositories, in this study, we develop a text-based experiment retrieval framework by using both lexical and semantic similarity approaches to find similarities between experiments, and their retrieval performance was compared. This study is the first attempt to use text-driven semantic analysis approaches for developing a retrieval framework for experiments. An empirical study was conducted on a large textual description of Arabidopsis microarray experiments from the Gene Expression Omnibus database. In the proposed model, Jaccard similarity was used as a lexical similarity approach; Latent Semantic Analysis, Probabilistic Latent Semantic Analysis and Latent Dirichlet allocation were used as semantic similarity approaches to detect similarities between the textual descriptions of the experiments. According to the experimental results, relevant experiments can be retrieved successfully by text-driven semantic similarity approaches compared with the lexical similarity approach.
Collapse
|
23
|
OMiCC: An expanded and enhanced platform for meta-analysis of public gene expression data. STAR Protoc 2022; 3:101474. [PMID: 35880119 PMCID: PMC9307621 DOI: 10.1016/j.xpro.2022.101474] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
OMiCC (OMics Compendia Commons) is a biologist-friendly web platform that facilitates data reuse and integration. Users can search over 40,000 publicly available gene expression studies, annotate and curate samples, and perform meta-analysis. Since the initial publication, we have incorporated RNA-seq datasets, compendia sharing, RESTful API support, and an additional meta-analysis method based on random effects. Here, we provide a step-by-step guide for using OMiCC. For complete details on the use and execution of this protocol, please refer to Shah et al. (2016). OMiCC (OMics Compendia Commons) is a free web-based tool for gene expression data reuse Search publicly available studies to perform sample group comparisons to explore a disease In meta-analysis, multiple studies are combined to identify coherent signals OMiCC supports crowd-sharing and users can share their own analyses with the community
Publisher’s note: Undertaking any experimental protocol requires adherence to local institutional guidelines for laboratory safety and ethics.
Collapse
|
24
|
Connecting omics signatures and revealing biological mechanisms with iLINCS. Nat Commun 2022; 13:4678. [PMID: 35945222 PMCID: PMC9362980 DOI: 10.1038/s41467-022-32205-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Accepted: 07/21/2022] [Indexed: 11/21/2022] Open
Abstract
There are only a few platforms that integrate multiple omics data types, bioinformatics tools, and interfaces for integrative analyses and visualization that do not require programming skills. Here we present iLINCS ( http://ilincs.org ), an integrative web-based platform for analysis of omics data and signatures of cellular perturbations. The platform facilitates mining and re-analysis of the large collection of omics datasets (>34,000), pre-computed signatures (>200,000), and their connections, as well as the analysis of user-submitted omics signatures of diseases and cellular perturbations. iLINCS analysis workflows integrate vast omics data resources and a range of analytics and interactive visualization tools into a comprehensive platform for analysis of omics signatures. iLINCS user-friendly interfaces enable execution of sophisticated analyses of omics signatures, mechanism of action analysis, and signature-driven drug repositioning. We illustrate the utility of iLINCS with three use cases involving analysis of cancer proteogenomic signatures, COVID 19 transcriptomic signatures and mTOR signaling.
Collapse
|
25
|
Getting Started with LINCS Datasets and Tools. Curr Protoc 2022; 2:e487. [PMID: 35876555 PMCID: PMC9326873 DOI: 10.1002/cpz1.487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The Library of Integrated Network-based Cellular Signatures (LINCS) was an NIH Common Fund program that aimed to expand our knowledge about human cellular responses to chemical, genetic, and microenvironment perturbations. Responses to perturbations were measured by transcriptomics, proteomics, cellular imaging, and other high content assays. The second phase of the LINCS program, which lasted 7 years, involved the engagement of six data and signature generation centers (DSGCs) and one data coordination and integration center (DCIC). The DSGCs and the DCIC developed several digital resources, including tools, databases, and workflows that aim to facilitate the use of the LINCS data and integrate this data with other publicly available data. The digital resources developed by the DSGCs and the DCIC can be used to gain new biological and pharmacological insights that can lead to the development of novel therapeutics. This protocol provides step-by-step instructions for processing the LINCS data into signatures, and utilizing the digital resources developed by the LINCS consortia for hypothesis generation and knowledge discovery. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Navigating L1000 tools and data in CLUE.io Basic Protocol 2: Computing signatures from the L1000 data with the CD method Basic Protocol 3: Analyzing lists of differentially expressed genes and querying them against the L1000 data with BioJupies and the Bulk RNA-seq Appyter Basic Protocol 4: Utilizing the L1000FWD resource for drug discovery Basic Protocol 5: KINOMEscan and the KINOMEscan Appyter Basic Protocol 6: LINCS P100 and GCP Proteomics Assays Basic Protocol 7: The LINCS Joint Project (LJP) Basic Protocol 8: The LINCS Data Portals and SigCom LINCS Basic Protocol 9: Creating and analyzing signatures with iLINCS.
Collapse
|
26
|
Genes and Diseases: Insights from Transcriptomics Studies. Genes (Basel) 2022; 13:genes13071168. [PMID: 35885950 PMCID: PMC9317567 DOI: 10.3390/genes13071168] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 06/13/2022] [Accepted: 06/23/2022] [Indexed: 01/25/2023] Open
Abstract
Results of expression studies can be useful to clarify the genotype-phenotype relationship. However, according to data from recent literature, there is a large group of genes that are revealed as differentially expressed (DE) in many studies, regardless of the biological context. Additional analyses could shed more light on the relationships between genes, their differential expression, and diseases. We generated a set of 9972 disease genes from five gene-phenotype databases (OMIM, ORPHANET, DDG2P, DisGeNet and MalaCards) and a report of the International Union of Immunological Societies. To study transcriptomics of disease and non-disease genes in healthy tissues, we obtained data from the Human Protein Atlas (HPA) website. We analyzed the dependency between expression in healthy tissues and gene occurrence in Gene Expression Omnibus series using tools within the Enrichr libraries. The results of expression studies were annotated with Gene Ontology (GO) and Human Phenotype Ontology (HPO) terms. Using transcriptomics analysis of healthy tissues, we validated the previous findings of higher expression levels of disease genes in pathologically linked tissues compared to other tissues. Preferentially DE genes were generally highly expressed in one or multiple tissues and were enriched for disease genes. According to the results of GO enrichment analyses, both down- and up-regulated DE genes most often took part in immune response, translation and tissue-specific processes. A connection between DE-related pathology and the diversity of HPO terms was found. Investigating a link between expression and phenotype contributes to understanding the mode of development and progression of human diseases.
Collapse
|
27
|
From drug repositioning to target repositioning: prediction of therapeutic targets using genetically perturbed transcriptomic signatures. Bioinformatics 2022; 38:i68-i76. [PMID: 35758779 PMCID: PMC9235496 DOI: 10.1093/bioinformatics/btac240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Motivation A critical element of drug development is the identification of therapeutic targets for diseases. However, the depletion of therapeutic targets is a serious problem. Results In this study, we propose the novel concept of target repositioning, an extension of the concept of drug repositioning, to predict new therapeutic targets for various diseases. Predictions were performed by a trans-disease analysis which integrated genetically perturbed transcriptomic signatures (knockdown of 4345 genes and overexpression of 3114 genes) and disease-specific gene transcriptomic signatures of 79 diseases. The trans-disease method, which takes into account similarities among diseases, enabled us to distinguish the inhibitory from activatory targets and to predict the therapeutic targetability of not only proteins with known target–disease associations but also orphan proteins without known associations. Our proposed method is expected to be useful for understanding the commonality of mechanisms among diseases and for therapeutic target identification in drug discovery. Availability and implementation Supplemental information and software are available at the following website [http://labo.bio.kyutech.ac.jp/~yamani/target_repositioning/]. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
|
28
|
CF-Seq, an accessible web application for rapid re-analysis of cystic fibrosis pathogen RNA sequencing studies. Sci Data 2022; 9:343. [PMID: 35710652 PMCID: PMC9203545 DOI: 10.1038/s41597-022-01431-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 05/25/2022] [Indexed: 01/13/2023] Open
Abstract
Researchers studying cystic fibrosis (CF) pathogens have produced numerous RNA-seq datasets which are available in the gene expression omnibus (GEO). Although these studies are publicly available, substantial computational expertise and manual effort are required to compare similar studies, visualize gene expression patterns within studies, and use published data to generate new experimental hypotheses. Furthermore, it is difficult to filter available studies by domain-relevant attributes such as strain, treatment, or media, or for a researcher to assess how a specific gene responds to various experimental conditions across studies. To reduce these barriers to data re-analysis, we have developed an R Shiny application called CF-Seq, which works with a compendium of 128 studies and 1,322 individual samples from 13 clinically relevant CF pathogens. The application allows users to filter studies by experimental factors and to view complex differential gene expression analyses at the click of a button. Here we present a series of use cases that demonstrate the application is a useful and efficient tool for new hypothesis generation. (CF-Seq: http://scangeo.dartmouth.edu/CFSeq/ ).
Collapse
|
29
|
SigCom LINCS: data and metadata search engine for a million gene expression signatures. Nucleic Acids Res 2022; 50:W697-W709. [PMID: 35524556 PMCID: PMC9252724 DOI: 10.1093/nar/gkac328] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 04/04/2022] [Accepted: 04/20/2022] [Indexed: 12/13/2022] Open
Abstract
Millions of transcriptome samples were generated by the Library of Integrated Network-based Cellular Signatures (LINCS) program. When these data are processed into searchable signatures along with signatures extracted from Genotype-Tissue Expression (GTEx) and Gene Expression Omnibus (GEO), connections between drugs, genes, pathways and diseases can be illuminated. SigCom LINCS is a webserver that serves over a million gene expression signatures processed, analyzed, and visualized from LINCS, GTEx, and GEO. SigCom LINCS is built with Signature Commons, a cloud-agnostic skeleton Data Commons with a focus on serving searchable signatures. SigCom LINCS provides a rapid signature similarity search for mimickers and reversers given sets of up and down genes, a gene set, a single gene, or any search term. Additionally, users of SigCom LINCS can perform a metadata search to find and analyze subsets of signatures and find information about genes and drugs. SigCom LINCS is findable, accessible, interoperable, and reusable (FAIR) with metadata linked to standard ontologies and vocabularies. In addition, all the data and signatures within SigCom LINCS are available via a well-documented API. In summary, SigCom LINCS, available at https://maayanlab.cloud/sigcom-lincs, is a rich webserver resource for accelerating drug and target discovery in systems pharmacology.
Collapse
|
30
|
Deep Learning Applications for the accurate identification of low-transcriptional activity drugs and their mechanism of actions. Pharmacol Res 2022; 180:106225. [PMID: 35452801 DOI: 10.1016/j.phrs.2022.106225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 04/07/2022] [Accepted: 04/15/2022] [Indexed: 10/18/2022]
Abstract
Analysis of drug-induced expression profiles facilitated comprehensive understanding of drug properties. However, many compounds exhibit weak transcription responses though they mostly possess definite pharmacological effects. Actually, as a representative example, over 66.4% of 312,438 molecular signatures in the Library of Integrated Cellular Signatures (LINCS) database exhibit low-transcriptional activities (i.e. TAS-low signatures). When computing the association between TAS-low signatures with shared mechanism of actions (MOAs), commonly used algorithms showed inadequate performance with an average area under receiver operating characteristic curve (AUROC) of 0.55, but the computation accuracy of the same task can be improved by our developed tool Genetic profile activity relationship (GPAR) with an average AUROC of 0.68. Up to 36 out of 74 TAS-low MOAs were well trained with AUROC≥0.7 by GPAR, higher than those by other approaches. Further studies showed that GPAR benefited from the size of training samples more significantly than other approaches. Lastly, in biological validation of the MOA prediction for a TAS-low drug Tropisetron, we found an unreported mechanism that Tropisetron can bind to the glucocorticoid receptor. This study indicated that GPAR can serve as an effective approach for the accurate identification of low-transcriptional activity drugs and their MOAs, thus providing a good tool for drug repurposing with both TAS-low and TAS-high signatures.
Collapse
|
31
|
Bipartite network models to design combination therapies in acute myeloid leukaemia. Nat Commun 2022; 13:2128. [PMID: 35440130 PMCID: PMC9018865 DOI: 10.1038/s41467-022-29793-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 03/30/2022] [Indexed: 12/20/2022] Open
Abstract
Combination therapy is preferred over single-targeted monotherapies for cancer treatment due to its efficiency and safety. However, identifying effective drug combinations costs time and resources. We propose a method for identifying potential drug combinations by bipartite network modelling of patient-related drug response data, specifically the Beat AML dataset. The median of cell viability is used as a drug potency measurement to reconstruct a weighted bipartite network, model drug-biological sample interactions, and find the clusters of nodes inside two projected networks. Then, the clustering results are leveraged to discover effective multi-targeted drug combinations, which are also supported by more evidence using GDSC and ALMANAC databases. The potency and synergy levels of selective drug combinations are corroborated against monotherapy in three cell lines for acute myeloid leukaemia in vitro. In this study, we introduce a nominal data mining approach to improving acute myeloid leukaemia treatment through combinatorial therapy. Identifying effective drug combinations to treat cancer is a challenging task, either experimentally or computationally. Here, the authors develop a bipartite network modelling approach to propose drug combination strategies in acute myeloid leukaemia using patient and cell line drug screening data.
Collapse
|
32
|
PharmOmics: A species- and tissue-specific drug signature database and gene-network-based drug repositioning tool. iScience 2022; 25:104052. [PMID: 35345455 PMCID: PMC8957031 DOI: 10.1016/j.isci.2022.104052] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 01/29/2022] [Accepted: 03/08/2022] [Indexed: 12/29/2022] Open
Abstract
Drug development has been hampered by a high failure rate in clinical trials due to our incomplete understanding of drug functions across organs and species. Therefore, elucidating species- and tissue-specific drug functions can provide insights into therapeutic efficacy, potential adverse effects, and interspecies differences necessary for effective translational medicine. Here, we present PharmOmics, a drug knowledgebase and analytical tool that is hosted on an interactive web server. Using tissue- and species-specific transcriptome data from human, mouse, and rat curated from different databases, we implemented a gene-network-based approach for drug repositioning. We demonstrate the potential of PharmOmics to retrieve known therapeutic drugs and identify drugs with tissue toxicity using in silico performance assessment. We further validated predicted drugs for nonalcoholic fatty liver disease in mice. By combining tissue- and species-specific in vivo drug signatures with gene networks, PharmOmics serves as a complementary tool to support drug characterization and network-based medicine. Development of PharmOmics, a platform for drug repositioning and toxicity prediction Contains >18000 species/tissue-specific gene signatures for 941 drugs and chemicals Benchmarked and validated network-based drug repositioning and toxicity prediction PharmOmics is freely accessible via an online web server to facilitate user access
Collapse
|
33
|
|
34
|
Guidelines for bioinformatics of single-cell sequencing data analysis in Alzheimer's disease: review, recommendation, implementation and application. Mol Neurodegener 2022; 17:17. [PMID: 35236372 PMCID: PMC8889402 DOI: 10.1186/s13024-022-00517-z] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Accepted: 01/18/2022] [Indexed: 12/13/2022] Open
Abstract
Alzheimer's disease (AD) is the most common form of dementia, characterized by progressive cognitive impairment and neurodegeneration. Extensive clinical and genomic studies have revealed biomarkers, risk factors, pathways, and targets of AD in the past decade. However, the exact molecular basis of AD development and progression remains elusive. The emerging single-cell sequencing technology can potentially provide cell-level insights into the disease. Here we systematically review the state-of-the-art bioinformatics approaches to analyze single-cell sequencing data and their applications to AD in 14 major directions, including 1) quality control and normalization, 2) dimension reduction and feature extraction, 3) cell clustering analysis, 4) cell type inference and annotation, 5) differential expression, 6) trajectory inference, 7) copy number variation analysis, 8) integration of single-cell multi-omics, 9) epigenomic analysis, 10) gene network inference, 11) prioritization of cell subpopulations, 12) integrative analysis of human and mouse sc-RNA-seq data, 13) spatial transcriptomics, and 14) comparison of single cell AD mouse model studies and single cell human AD studies. We also address challenges in using human postmortem and mouse tissues and outline future developments in single cell sequencing data analysis. Importantly, we have implemented our recommended workflow for each major analytic direction and applied them to a large single nucleus RNA-sequencing (snRNA-seq) dataset in AD. Key analytic results are reported while the scripts and the data are shared with the research community through GitHub. In summary, this comprehensive review provides insights into various approaches to analyze single cell sequencing data and offers specific guidelines for study design and a variety of analytic directions. The review and the accompanied software tools will serve as a valuable resource for studying cellular and molecular mechanisms of AD, other diseases, or biological systems at the single cell level.
Collapse
|
35
|
Novel roles of phentolamine in protecting axon myelination, muscle atrophy, and functional recovery following nerve injury. Sci Rep 2022; 12:3344. [PMID: 35228612 PMCID: PMC8885794 DOI: 10.1038/s41598-022-07253-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2020] [Accepted: 02/08/2022] [Indexed: 11/25/2022] Open
Abstract
Incomplete functional recovery after peripheral nerve injury (PNI) often results in devastating physical disabilities in human patients. Despite improved progress in surgical and non-surgical approaches, achieving complete functional recovery following PNI remains a challenge. This study demonstrates that phentolamine may hold a significant promise in treating nerve injuries and denervation induced muscle atrophy following PNI. In a sciatic nerve crush injury mouse model, we found that phentolamine treatment enhanced motor and functional recovery, protected axon myelination, and attenuated injury-induced muscle atrophy in mice at 14 days post-injury (dpi) compared to saline treatment. In the soleus of phentolamine treated animals, we observed the downregulation of phosphorylated signal transducer and activator of transcription factor 3 (p-STAT3) as well as muscle atrophy-related genes Myogenin, muscle ring finger 1 (MuRF-1), and Forkhead box O proteins (FoxO1, FoxO3). Our results show that both nerve and muscle recovery are integral components of phentolamine treatment-induced global functional recovery in mice at 14 dpi. Moreover, phentolamine treatment improved locomotor functional recovery in the mice after spinal cord crush (SCC) injury. The fact that phentolamine is an FDA approved non-selective alpha-adrenergic blocker, clinically prescribed for oral anesthesia reversal, hypertension, and erectile dysfunction makes this drug a promising candidate for repurposing in restoring behavioral recovery following PNI and SCC injuries, axonal neuropathy, and muscle wasting disorders.
Collapse
|
36
|
An in silico analysis identifies drugs potentially modulating the cytokine storm triggered by SARS-CoV-2 infection. Sci Rep 2022; 12:1626. [PMID: 35102208 PMCID: PMC8803893 DOI: 10.1038/s41598-022-05597-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 01/11/2022] [Indexed: 12/12/2022] Open
Abstract
The ongoing COVID-19 pandemic is one of the biggest health challenges of recent decades. Among the causes of mortality triggered by SARS-CoV-2 infection, the development of an inflammatory "cytokine storm" (CS) plays a determinant role. Here, we used transcriptomic data from the bronchoalveolar lavage fluid (BALF) of COVID-19 patients undergoing a CS to obtain gene-signatures associated to this pathology. Using these signatures, we interrogated the Connectivity Map (CMap) dataset that contains the effects of over 5000 small molecules on the transcriptome of human cell lines, and looked for molecules which effects on transcription mimic or oppose those of the CS. As expected, molecules that potentiate immune responses such as PKC activators are predicted to worsen the CS. In addition, we identified the negative regulation of female hormones among pathways potentially aggravating the CS, which helps to understand the gender-related differences in COVID-19 mortality. Regarding drugs potentially counteracting the CS, we identified glucocorticoids as a top hit, which validates our approach as this is the primary treatment for this pathology. Interestingly, our analysis also reveals a potential effect of MEK inhibitors in reverting the COVID-19 CS, which is supported by in vitro data that confirms the anti-inflammatory properties of these compounds.
Collapse
|
37
|
Predictive Modeling of Short-Term Rockburst for the Stability of Subsurface Structures Using Machine Learning Approaches: t-SNE, K-Means Clustering and XGBoost. MATHEMATICS 2022. [DOI: 10.3390/math10030449] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Accurate prediction of short-term rockburst has a significant role in improving the safety of workers in mining and geotechnical projects. The rockburst occurrence is nonlinearly correlated with its influencing factors that guarantee imprecise predicting results by employing the traditional methods. In this study, three approaches including including t-distributed stochastic neighbor embedding (t-SNE), K-means clustering, and extreme gradient boosting (XGBoost) were employed to predict the short-term rockburst risk. A total of 93 rockburst patterns with six influential features from micro seismic monitoring events of the Jinping-II hydropower project in China were used to create the database. The original data were randomly split into training and testing sets with a 70/30 splitting ratio. The prediction practice was followed in three steps. Firstly, a state-of-the-art data reduction mechanism t-SNE was employed to reduce the exaggeration of the rockburst database. Secondly, an unsupervised machine learning, i.e., K-means clustering, was adopted to categorize the t-SNE dataset into various clusters. Thirdly, a supervised gradient boosting machine learning method i.e., XGBoost was utilized to predict various levels of short-term rockburst database. The classification accuracy of XGBoost was checked using several performance indices. The results of the proposed model serve as a great benchmark for future short-term rockburst levels prediction with high accuracy.
Collapse
|
38
|
Downregulation of NAGLU in VEC Increases Abnormal Accumulation of Lysosomes and Represents a Predictive Biomarker in Early Atherosclerosis. Front Cell Dev Biol 2022; 9:797047. [PMID: 35155448 PMCID: PMC8826576 DOI: 10.3389/fcell.2021.797047] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 12/27/2021] [Indexed: 12/23/2022] Open
Abstract
Cardiovascular diseases (CVDs), predominantly caused by atherosclerosis (AS), are the leading cause of mortality worldwide. Although a great number of previous studies have attempted to reveal the molecular mechanism of AS, the underlying mechanism has not been fully elucidated. The aberrant expression profiling of vascular endothelial cells (VECs) gene in early atherosclerosis (EAS) was analyzed according to the dataset (GSE132651) downloaded from the Gene Expression Omnibus (GEO) database. We primarily performed functional annotation analysis on the downregulated genes (DRGs). We further identified that α-N-acetylglucosaminidase (NAGLU), one of the DRGs, played a critical role in the progression of EAS. NAGLU is a key enzyme for the degradation of heparan sulfate (HS), and its deficiency could cause lysosomal accumulation and lead to dysfunctions of VECs. We found that siRNA knockdown of NAGLU in human umbilical vein endothelial cell (HUVEC) aggravated the abnormal accumulation of lysosomes and HS. In addition, the expression of NAGLU was reduced in the EAS model constructed by ApoE−/- mice. Furthermore, we also showed that heparin-binding EGF-like growth factor (HB-EGF) protein was upregulated while NAGLU knockdown in HUVEC could specifically bind to vascular endothelial growth factor receptor 2 (VEGFR2) and promote its phosphorylation, ultimately activating the phosphorylation levels of extracellular signal-regulated kinases (ERKs). However, the application of selective VEGFR2 and ERKs inhibitors, SU5614 and PD98059, respectively, could reverse the abnormal lysosomal storage caused by NAGLU knockdown. These results indicated that downregulation of NAGLU in HUVEC increases the abnormal accumulation of lysosomes and may be a potential biomarker for the diagnosis of EAS.
Collapse
|
39
|
Epigenetic landscape of drug responses revealed through large-scale ChIP-seq data analyses. BMC Bioinformatics 2022; 23:51. [PMID: 35073843 PMCID: PMC8785570 DOI: 10.1186/s12859-022-04571-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Accepted: 01/10/2022] [Indexed: 12/14/2022] Open
Abstract
Abstract
Background
Elucidating the modes of action (MoAs) of drugs and drug candidate compounds is critical for guiding translation from drug discovery to clinical application. Despite the development of several data-driven approaches for predicting chemical–disease associations, the molecular cues that organize the epigenetic landscape of drug responses remain poorly understood.
Results
With the use of a computational method, we attempted to elucidate the epigenetic landscape of drug responses, in terms of transcription factors (TFs), through large-scale ChIP-seq data analyses. In the algorithm, we systematically identified TFs that regulate the expression of chemically induced genes by integrating transcriptome data from chemical induction experiments and almost all publicly available ChIP-seq data (consisting of 13,558 experiments). By relating the resultant chemical–TF associations to a repository of associated proteins for a wide range of diseases, we made a comprehensive prediction of chemical–TF–disease associations, which could then be used to account for drug MoAs. Using this approach, we predicted that: (1) cisplatin promotes the anti-tumor activity of TP53 family members but suppresses the cancer-inducing function of MYCs; (2) inhibition of RELA and E2F1 is pivotal for leflunomide to exhibit antiproliferative activity; and (3) CHD8 mediates valproic acid-induced autism.
Conclusions
Our proposed approach has the potential to elucidate the MoAs for both approved drugs and candidate compounds from an epigenetic perspective, thereby revealing new therapeutic targets, and to guide the discovery of unexpected therapeutic effects, side effects, and novel targets and actions.
Collapse
|
40
|
A community challenge for a pancancer drug mechanism of action inference from perturbational profile data. Cell Rep Med 2022; 3:100492. [PMID: 35106508 PMCID: PMC8784774 DOI: 10.1016/j.xcrm.2021.100492] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 08/08/2021] [Accepted: 12/15/2021] [Indexed: 12/14/2022]
Abstract
The Columbia Cancer Target Discovery and Development (CTD2) Center is developing PANACEA, a resource comprising dose-responses and RNA sequencing (RNA-seq) profiles of 25 cell lines perturbed with ∼400 clinical oncology drugs, to study a tumor-specific drug mechanism of action. Here, this resource serves as the basis for a DREAM Challenge assessing the accuracy and sensitivity of computational algorithms for de novo drug polypharmacology predictions. Dose-response and perturbational profiles for 32 kinase inhibitors are provided to 21 teams who are blind to the identity of the compounds. The teams are asked to predict high-affinity binding targets of each compound among ∼1,300 targets cataloged in DrugBank. The best performing methods leverage gene expression profile similarity analysis as well as deep-learning methodologies trained on individual datasets. This study lays the foundation for future integrative analyses of pharmacogenomic data, reconciliation of polypharmacology effects in different tumor contexts, and insights into network-based assessments of drug mechanisms of action. Drug-perturbed RNA sequencing data can be used to identify drug targets Technology-based drug-target definitions often subsume literature definitions Literature and screening datasets provide complementary information on drug mechanisms
Collapse
|
41
|
A human-based multi-gene signature enables quantitative drug repurposing for metabolic disease. eLife 2022; 11:68832. [PMID: 35037854 PMCID: PMC8763401 DOI: 10.7554/elife.68832] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Accepted: 11/26/2021] [Indexed: 12/22/2022] Open
Abstract
Insulin resistance (IR) contributes to the pathophysiology of diabetes, dementia, viral infection, and cardiovascular disease. Drug repurposing (DR) may identify treatments for IR; however, barriers include uncertainty whether in vitro transcriptomic assays yield quantitative pharmacological data, or how to optimise assay design to best reflect in vivo human disease. We developed a clinical-based human tissue IR signature by combining lifestyle-mediated treatment responses (>500 human adipose and muscle biopsies) with biomarkers of disease status (fasting IR from >1200 biopsies). The assay identified a chemically diverse set of >130 positively acting compounds, highly enriched in true positives, that targeted 73 proteins regulating IR pathways. Our multi-gene RNA assay score reflected the quantitative pharmacological properties of a set of epidermal growth factor receptor-related tyrosine kinase inhibitors, providing insight into drug target specificity; an observation supported by deep learning-based genome-wide predicted pharmacology. Several drugs identified are suitable for evaluation in patients, particularly those with either acute or severe chronic IR.
Collapse
|
42
|
Systematic identification of ACE2 expression modulators reveals cardiomyopathy as a risk factor for mortality in COVID-19 patients. Genome Biol 2022; 23:15. [PMID: 35012625 PMCID: PMC8743438 DOI: 10.1186/s13059-021-02589-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Accepted: 12/23/2021] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Angiotensin-converting enzyme 2 (ACE2) is the cell-entry receptor for SARS-CoV-2. It plays critical roles in both the transmission and the pathogenesis of COVID-19. Comprehensive profiling of ACE2 expression patterns could reveal risk factors of severe COVID-19 illness. While the expression of ACE2 in healthy human tissues has been well characterized, it is not known which diseases and drugs might be associated with ACE2 expression. RESULTS We develop GENEVA (GENe Expression Variance Analysis), a semi-automated framework for exploring massive amounts of RNA-seq datasets. We apply GENEVA to 286,650 publicly available RNA-seq samples to identify any previously studied experimental conditions that could be directly or indirectly associated with ACE2 expression. We identify multiple drugs, genetic perturbations, and diseases that are associated with the expression of ACE2, including cardiomyopathy, HNF1A overexpression, and drug treatments with RAD140 and itraconazole. Our joint analysis of seven datasets confirms ACE2 upregulation in all cardiomyopathy categories. Using electronic health records data from 3936 COVID-19 patients, we demonstrate that patients with pre-existing cardiomyopathy have an increased mortality risk than age-matched patients with other cardiovascular conditions. GENEVA is applicable to any genes of interest and is freely accessible at http://genevatool.org . CONCLUSIONS This study identifies multiple diseases and drugs that are associated with the expression of ACE2. The effect of these conditions should be carefully studied in COVID-19 patients. In particular, our analysis identifies cardiomyopathy patients as a high-risk group, with increased ACE2 expression in the heart and increased mortality after SARS-COV-2 infection.
Collapse
|
43
|
Prediction of lncRNA-disease association based on a Laplace normalized random walk with restart algorithm on heterogeneous networks. BMC Bioinformatics 2022; 23:5. [PMID: 34983367 PMCID: PMC8729064 DOI: 10.1186/s12859-021-04538-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 12/15/2021] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND More and more evidence showed that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human sophisticated diseases. Therefore, predicting human lncRNA-disease associations is a challenging and urgently task in bioinformatics to research of human sophisticated diseases. RESULTS In the work, a global network-based computational framework called as LRWRHLDA were proposed which is a universal network-based method. Firstly, four isomorphic networks include lncRNA similarity network, disease similarity network, gene similarity network and miRNA similarity network were constructed. And then, six heterogeneous networks include known lncRNA-disease, lncRNA-gene, lncRNA-miRNA, disease-gene, disease-miRNA, and gene-miRNA associations network were applied to design a multi-layer network. Finally, the Laplace normalized random walk with restart algorithm in this global network is suggested to predict the relationship between lncRNAs and diseases. CONCLUSIONS The ten-fold cross validation is used to evaluate the performance of LRWRHLDA. As a result, LRWRHLDA achieves an AUC of 0.98402, which is higher than other compared methods. Furthermore, LRWRHLDA can predict isolated disease-related lnRNA (isolated lnRNA related disease). The results for colorectal cancer, lung adenocarcinoma, stomach cancer and breast cancer have been verified by other researches. The case studies indicated that our method is effective.
Collapse
|
44
|
Genetic regulation of THBS1 methylation in diabetic retinopathy. Front Endocrinol (Lausanne) 2022; 13:991803. [PMID: 36452318 PMCID: PMC9702561 DOI: 10.3389/fendo.2022.991803] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 10/25/2022] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Diabetic retinopathy (DR) is a common and serious microvascular complication of diabetes mellitus (DM), but its pathological mechanism, especially the formation mechanism of new blood vessels remains unclear. Thrombospondin-1 (THBS1) is a potent endogenous inhibitor of angiogenesis and it was found over expressed in DR in our previous study. Our study aimed to determine whether overexpression of THBS1 is associated with its promoter methylation level, and whether methylation of THBS1 is regulated by genetic variants in DR. METHODS Patients diagnosed with DR and DM patients without retinal problems were included in the case-control study. DNA methylation detection of THBS1 by bisulfite sequencing and genotyping of specific SNPs by MassARRAY analysis were performed in the patients recruited from 2019-2020. Real time quantitative PCR was performed to obtain mRNA expression of THBS1 in the patients recruited from August to October 2022. The differentially methylated CpG loci of THBS1 were identified by logistic regression, and associations between 13 SNPs and methylation levels of CpG loci were tested by methylation quantitative trait loci (meQTLs) analysis. Mediation analysis was applied to determine whether CpG loci were intermediate factors between meQTLs and DR. RESULTS 150 patients diagnosed with DR and 150 DM patients without retinal complications were enrolled in the first recruitment, seven DR patients and seven DM patients were enrolled in the second recruitment. The patients with DR showed promoter hypomethylation of THBS1 (P value = 0.002), and six out of thirty-nine CpG sites within two CpG islands (CGIs) showed hypomethylation(P value < 0.05). THBS1 mRNA expression in peripheral blood was significantly higher in DR patients than in DM patients. Five out of thirteen cis-meQTLs were identified to be associated with CpG sites: rs13329154, rs34973764 and rs5812091 were associated with cis-meQTLs of CpG-4 (P value=0.0145, 0.0095, 0.0158), rs11070177 and rs1847663 were associated with cis-meQTLs of CpG-2 and CpG-3 respectively (P value=0.0201, 0.0275). CpG-4 methylation significantly mediated the effect of the polymorphism rs34973764 on DR (B=0.0535, Boot 95%CI: 0.004~0.1336). CONCLUSION THBS1 overexpression is related to THBS1 hypomethylation in patients with DR. DNA methylation may be genetically controlled in DR.
Collapse
|
45
|
GPCRs Are Optimal Regulators of Complex Biological Systems and Orchestrate the Interface between Health and Disease. Int J Mol Sci 2021; 22:ijms222413387. [PMID: 34948182 PMCID: PMC8708147 DOI: 10.3390/ijms222413387] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 12/08/2021] [Accepted: 12/09/2021] [Indexed: 02/06/2023] Open
Abstract
GPCRs arguably represent the most effective current therapeutic targets for a plethora of diseases. GPCRs also possess a pivotal role in the regulation of the physiological balance between healthy and pathological conditions; thus, their importance in systems biology cannot be underestimated. The molecular diversity of GPCR signaling systems is likely to be closely associated with disease-associated changes in organismal tissue complexity and compartmentalization, thus enabling a nuanced GPCR-based capacity to interdict multiple disease pathomechanisms at a systemic level. GPCRs have been long considered as controllers of communication between tissues and cells. This communication involves the ligand-mediated control of cell surface receptors that then direct their stimuli to impact cell physiology. Given the tremendous success of GPCRs as therapeutic targets, considerable focus has been placed on the ability of these therapeutics to modulate diseases by acting at cell surface receptors. In the past decade, however, attention has focused upon how stable multiprotein GPCR superstructures, termed receptorsomes, both at the cell surface membrane and in the intracellular domain dictate and condition long-term GPCR activities associated with the regulation of protein expression patterns, cellular stress responses and DNA integrity management. The ability of these receptorsomes (often in the absence of typical cell surface ligands) to control complex cellular activities implicates them as key controllers of the functional balance between health and disease. A greater understanding of this function of GPCRs is likely to significantly augment our ability to further employ these proteins in a multitude of diseases.
Collapse
|
46
|
Besca, a single-cell transcriptomics analysis toolkit to accelerate translational research. NAR Genom Bioinform 2021; 3:lqab102. [PMID: 34761219 PMCID: PMC8573822 DOI: 10.1093/nargab/lqab102] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 10/08/2021] [Accepted: 10/12/2021] [Indexed: 02/07/2023] Open
Abstract
Single-cell RNA sequencing (scRNA-seq) revolutionized our understanding of disease biology. The promise it presents to also transform translational research requires highly standardized and robust software workflows. Here, we present the toolkit Besca, which streamlines scRNA-seq analyses and their use to deconvolute bulk RNA-seq data according to current best practices. Beyond a standard workflow covering quality control, filtering, and clustering, two complementary Besca modules, utilizing hierarchical cell signatures and supervised machine learning, automate cell annotation and provide harmonized nomenclatures. Subsequently, the gene expression profiles can be employed to estimate cell type proportions in bulk transcriptomics data. Using multiple, diverse scRNA-seq datasets, some stemming from highly heterogeneous tumor tissue, we show how Besca aids acceleration, interoperability, reusability and interpretability of scRNA-seq data analyses, meeting crucial demands in translational research and beyond.
Collapse
|
47
|
WGCNA Identification of Genes and Pathways Involved in the Pathogenesis of Postmenopausal Osteoporosis. Int J Gen Med 2021; 14:8341-8353. [PMID: 34815706 PMCID: PMC8605872 DOI: 10.2147/ijgm.s336310] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Accepted: 11/03/2021] [Indexed: 11/23/2022] Open
Abstract
Purpose Postmenopausal osteoporosis (PMO) patients may suffer from chronic pain and increased fractures due to brittle bones that seriously affect their normal work and life. Exploring the pathogenesis of PMO can help clinicians construct individualized therapeutic targets. Methods Differentially expressed genes (DEGs) were identified by analyzing the microarray assays of monocytes from 20 PMO and 20 control samples. Weighted correlation network analysis (WGCNA) and gene set enrichment analysis (GAEA) were performed. Genes associated with PMO were identified in the Comparative Toxicogenomics Database (CTD). miRNAs associated with osteoporosis were found in miRNet, and target genes were predicted. Hub genes and functional pathways associated with PMO were also identified. miRNA-mRNA networks were constructed. The association between hub genes and PMO was analyzed in the CTD. Results A total of 1055 genes were up-regulated, and 694 genes were down-regulated in PMO samples (P<0.01). Five modules were identified by WGCNA. The blue module was significantly associated with PMO and selected for further analysis (P < 0.05). A total of 229 genes were significantly associated with PMO gene significance and module membership. Pathway variations were predominantly enriched in mRNA metabolic process, RNA splicing, Notch signaling pathway, apoptosis, cytokine-cytokine receptor interaction and so on. We identified 10 hub genes associated with PMO with different inference scores. Conclusion We identified genes, miRNAs, and pathways associated with PMO. These molecules may participate in the pathogenesis of PMO and serve as therapeutic targets.
Collapse
|
48
|
Multi-omics data integration and network-based analysis drives a multiplex drug repurposing approach to a shortlist of candidate drugs against COVID-19. Brief Bioinform 2021; 22:bbab114. [PMID: 34009288 PMCID: PMC8135326 DOI: 10.1093/bib/bbab114] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Revised: 02/01/2021] [Accepted: 03/13/2021] [Indexed: 02/06/2023] Open
Abstract
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic is undeniably the most severe global health emergency since the 1918 Influenza outbreak. Depending on its evolutionary trajectory, the virus is expected to establish itself as an endemic infectious respiratory disease exhibiting seasonal flare-ups. Therefore, despite the unprecedented rally to reach a vaccine that can offer widespread immunization, it is equally important to reach effective prevention and treatment regimens for coronavirus disease 2019 (COVID-19). Contributing to this effort, we have curated and analyzed multi-source and multi-omics publicly available data from patients, cell lines and databases in order to fuel a multiplex computational drug repurposing approach. We devised a network-based integration of multi-omic data to prioritize the most important genes related to COVID-19 and subsequently re-rank the identified candidate drugs. Our approach resulted in a highly informed integrated drug shortlist by combining structural diversity filtering along with experts' curation and drug-target mapping on the depicted molecular pathways. In addition to the recently proposed drugs that are already generating promising results such as dexamethasone and remdesivir, our list includes inhibitors of Src tyrosine kinase (bosutinib, dasatinib, cytarabine and saracatinib), which appear to be involved in multiple COVID-19 pathophysiological mechanisms. In addition, we highlight specific immunomodulators and anti-inflammatory drugs like dactolisib and methotrexate and inhibitors of histone deacetylase like hydroquinone and vorinostat with potential beneficial effects in their mechanisms of action. Overall, this multiplex drug repurposing approach, developed and utilized herein specifically for SARS-CoV-2, can offer a rapid mapping and drug prioritization against any pathogen-related disease.
Collapse
|
49
|
LINCS Dataset-Based Repositioning of Dutasteride as an Anti-Neuroinflammation Agent. Brain Sci 2021; 11:brainsci11111411. [PMID: 34827410 PMCID: PMC8615696 DOI: 10.3390/brainsci11111411] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/17/2021] [Accepted: 10/22/2021] [Indexed: 12/21/2022] Open
Abstract
Neuroinflammation is often accompanied by central nervous system (CNS) injury seen in various CNS diseases, with no specific treatment. Drug repurposing is a strategy of finding new uses for approved or investigational drugs, and can be enabled by the Library of Integrated Network-based Cellular Signatures (LINCS), a large drug perturbation database. In this study, the signatures of Lipopolysaccharide (LPS) were compared with the signatures of compounds contained in the LINCS dataset. To the top 100 compounds obtained, the Quantitative Structure-Activity Relationship (QSAR)-based tool admetSAR was used to identify the top 10 candidate compounds with relatively high blood–brain barrier (BBB) penetration. Furthermore, the seventh-ranked compound, dutasteride, a 5-α-reductase inhibitor, was selected for in vitro and in vivo validation of its anti-neuroinflammation activity. The results showed that dutasteride significantly reduced the levels of IL-6 and TNF-α in the supernatants of LPS-stimulated BV2 cells, and decreased the levels of IL-6 in the hippocampus and plasma, and the number of activated microglia in the brain of LPS administration mice. Furthermore, dutasteride also attenuated the cognitive impairment caused by LPS stimulation in mice. Taken together, this study demonstrates that the LINCS dataset-based drug repurposing strategy is an effective approach, and the predicted candidate, dutasteride, has the potential to ameliorate LPS-induced neuroinflammation and cognitive impairment.
Collapse
|
50
|
Repositioning of a novel GABA-B receptor agonist, AZD3355 (Lesogaberan), for the treatment of non-alcoholic steatohepatitis. Sci Rep 2021; 11:20827. [PMID: 34675338 PMCID: PMC8531016 DOI: 10.1038/s41598-021-99008-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Accepted: 09/14/2021] [Indexed: 01/02/2023] Open
Abstract
Non-alcoholic steatohepatitis (NASH) is a rising health challenge, with no approved drugs. We used a computational drug repositioning strategy to uncover a novel therapy for NASH, identifying a GABA-B receptor agonist, AZD3355 (Lesogaberan) previously evaluated as a therapy for esophageal reflux. AZD3355's potential efficacy in NASH was tested in human stellate cells, human precision cut liver slices (hPCLS), and in vivo in a well-validated murine model of NASH. In human stellate cells AZD3355 significantly downregulated profibrotic gene and protein expression. Transcriptomic analysis of these responses identified key regulatory nodes impacted by AZD3355, including Myc, as well as MAP and ERK kinases. In PCLS, AZD3355 down-regulated collagen1α1, αSMA and TNF-α mRNAs as well as secreted collagen1α1. In vivo, the drug significantly improved histology, profibrogenic gene expression, and tumor development, which was comparable to activity of obeticholic acid in a robust mouse model of NASH, but awaits further testing to determine its relative efficacy in patients. These data identify a well-tolerated clinical stage asset as a novel candidate therapy for human NASH through its hepatoprotective, anti-inflammatory and antifibrotic mechanisms of action. The approach validates computational methods to identify novel therapies in NASH in uncovering new pathways of disease development that can be rapidly translated into clinical trials.
Collapse
|