1
|
Gerstner N, Kehl T, Lenhof K, Eckhart L, Schneider L, Stöckel D, Backes C, Meese E, Keller A, Lenhof HP. GeneTrail: A Framework for the Analysis of High-Throughput Profiles. Front Mol Biosci 2021; 8:716544. [PMID: 34604304 PMCID: PMC8481803 DOI: 10.3389/fmolb.2021.716544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 09/01/2021] [Indexed: 12/05/2022] Open
Abstract
Experimental high-throughput techniques, like next-generation sequencing or microarrays, are nowadays routinely applied to create detailed molecular profiles of cells. In general, these platforms generate high-dimensional and noisy data sets. For their analysis, powerful bioinformatics tools are required to gain novel insights into the biological processes under investigation. Here, we present an overview of the GeneTrail tool suite that offers rich functionality for the analysis and visualization of (epi-)genomic, transcriptomic, miRNomic, and proteomic profiles. Our framework enables the analysis of standard bulk, time-series, and single-cell measurements and includes various state-of-the-art methods to identify potentially deregulated biological processes and to detect driving factors within those deregulated processes. We highlight the capabilities of our web service with an analysis of a single-cell COVID-19 data set that demonstrates its potential for uncovering complex molecular mechanisms. GeneTrail can be accessed freely and without login requirements at http://genetrail.bioinf.uni-sb.de.
Collapse
Affiliation(s)
- Nico Gerstner
- Center for Bioinformatics, Saarland Informatics Campus, Saarbrücken, Germany
| | - Tim Kehl
- Center for Bioinformatics, Saarland Informatics Campus, Saarbrücken, Germany
| | - Kerstin Lenhof
- Center for Bioinformatics, Saarland Informatics Campus, Saarbrücken, Germany
| | - Lea Eckhart
- Center for Bioinformatics, Saarland Informatics Campus, Saarbrücken, Germany
| | - Lara Schneider
- Center for Bioinformatics, Saarland Informatics Campus, Saarbrücken, Germany
| | - Daniel Stöckel
- Healthcare Digital & Data, Merck Healthcare KGaA, Darmstadt, Germany
| | - Christina Backes
- Center for Bioinformatics, Saarland Informatics Campus, Saarbrücken, Germany
| | - Eckart Meese
- Department of Human Genetics, Saarland University, Homburg, Germany
| | - Andreas Keller
- Center for Bioinformatics, Saarland Informatics Campus, Saarbrücken, Germany
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany
- Department of Neurology and Neurological Sciences, Stanford University School of Medicine, Stanford, CA, United States
| | - Hans-Peter Lenhof
- Center for Bioinformatics, Saarland Informatics Campus, Saarbrücken, Germany
| |
Collapse
|
2
|
Hossain SMM, Halsana AA, Khatun L, Ray S, Mukhopadhyay A. Discovering key transcriptomic regulators in pancreatic ductal adenocarcinoma using Dirichlet process Gaussian mixture model. Sci Rep 2021; 11:7853. [PMID: 33846515 PMCID: PMC8041769 DOI: 10.1038/s41598-021-87234-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 03/23/2021] [Indexed: 12/18/2022] Open
Abstract
Pancreatic Ductal Adenocarcinoma (PDAC) is the most lethal type of pancreatic cancer, late detection leading to its therapeutic failure. This study aims to determine the key regulatory genes and their impacts on the disease’s progression, helping the disease’s etiology, which is still mostly unknown. We leverage the landmark advantages of time-series gene expression data of this disease and thereby identified the key regulators that capture the characteristics of gene activity patterns in the cancer progression. We have identified the key gene modules and predicted the functions of top genes from a reconstructed gene association network (GAN). A variation of the partial correlation method is utilized to analyze the GAN, followed by a gene function prediction task. Moreover, we have identified regulators for each target gene by gene regulatory network inference using the dynamical GENIE3 (dynGENIE3) algorithm. The Dirichlet process Gaussian process mixture model and cubic spline regression model (splineTimeR) are employed to identify the key gene modules and differentially expressed genes, respectively. Our analysis demonstrates a panel of key regulators and gene modules that are crucial for PDAC disease progression.
Collapse
Affiliation(s)
- Sk Md Mosaddek Hossain
- Computer Science and Engineering, Aliah University, Kolkata, 700160, India. .,Computer Science and Engineering, University of Kalyani, Kalyani, 741235, India.
| | | | - Lutfunnesa Khatun
- Computer Science and Engineering, University of Kalyani, Kalyani, 741235, India
| | - Sumanta Ray
- Computer Science and Engineering, Aliah University, Kolkata, 700160, India.
| | - Anirban Mukhopadhyay
- Computer Science and Engineering, University of Kalyani, Kalyani, 741235, India.
| |
Collapse
|
3
|
Al-Lamki RS, Hudson NJ, Bradley JR, Warren AY, Eisen T, Welsh SJ, Riddick ACP, O’Mahony FC, Turnbull A, Powles T, SCOTRRCC Collaborative, Reverter A, Harrison DJ, Stewart GD. The Efficacy of Sunitinib Treatment of Renal Cancer Cells Is Associated with the Protein PHAX In Vitro. BIOLOGY 2020; 9:E74. [PMID: 32272660 PMCID: PMC7236799 DOI: 10.3390/biology9040074] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Revised: 04/01/2020] [Accepted: 04/02/2020] [Indexed: 02/02/2023]
Abstract
Anti-angiogenic agents, such as the multi-tyrosine kinase inhibitor sunitinib, are key first line therapies for metastatic clear cell renal cell carcinoma (ccRCC), but their mechanism of action is not fully understood. Here, we take steps towards validating a computational prediction based on differential transcriptome network analysis that phosphorylated adapter RNA export protein (PHAX) is associated with sunitinib drug treatment. The regulatory impact factor differential network algorithm run on patient tissue samples suggests PHAX is likely an important regulator through changes in genome-wide network connectivity. Immunofluorescence staining of patient tumours showed strong localisation of PHAX to the microvasculature consistent with the anti-angiogenic effect of sunitinib. In normal kidney tissue, PHAX protein abundance was low but increased with tumour grade (G1 vs. G3/4; p < 0.01), consistent with a possible role in cancer progression. In organ culture, ccRCC cells had higher levels of PHAX protein expression than normal kidney cells, and sunitinib increased PHAX protein expression in a dose dependent manner (untreated vs. 100 µM; p < 0.05). PHAX knockdown in a ccRCC organ culture model impacted the ability of sunitinib to cause cancer cell death (p < 0.0001 untreated vs. treated), suggesting a role for PHAX in mediating the efficacy of sunitinib.
Collapse
Affiliation(s)
- Rafia S. Al-Lamki
- Department of Medicine, NIHR Cambridge Biomedical Research Centre, University of Cambridge, Cambridge CB2 0QQ, UK; (R.S.A.-L.); (J.R.B.)
| | - Nicholas J. Hudson
- School of Agriculture and Food Sciences, University of Queensland, Gatton, QLD 4343, Australia;
| | - John R. Bradley
- Department of Medicine, NIHR Cambridge Biomedical Research Centre, University of Cambridge, Cambridge CB2 0QQ, UK; (R.S.A.-L.); (J.R.B.)
| | - Anne Y. Warren
- Cambridge University Hospitals NHS Foundation Trust, Cambridge CB2 0QQ, UK; (A.Y.W.); (T.E.); (S.J.W.); (A.C.P.R.)
| | - Tim Eisen
- Cambridge University Hospitals NHS Foundation Trust, Cambridge CB2 0QQ, UK; (A.Y.W.); (T.E.); (S.J.W.); (A.C.P.R.)
- Department of Oncology, University of Cambridge, Cambridge CB2 0QQ, UK
| | - Sarah J. Welsh
- Cambridge University Hospitals NHS Foundation Trust, Cambridge CB2 0QQ, UK; (A.Y.W.); (T.E.); (S.J.W.); (A.C.P.R.)
| | - Antony C. P. Riddick
- Cambridge University Hospitals NHS Foundation Trust, Cambridge CB2 0QQ, UK; (A.Y.W.); (T.E.); (S.J.W.); (A.C.P.R.)
| | - Fiach C. O’Mahony
- Scottish Collaboration on Translational Research into Renal Cell Cancer (SCOTRRCC); fiach.o' (F.C.O.); (A.T.); (D.J.H.)
| | - Arran Turnbull
- Scottish Collaboration on Translational Research into Renal Cell Cancer (SCOTRRCC); fiach.o' (F.C.O.); (A.T.); (D.J.H.)
| | - Thomas Powles
- Bart’s Cancer Institute, Charterhouse Square, London EC1M 6BE, UK;
| | - SCOTRRCC Collaborative
- Scottish Collaboration on Translational Research into Renal Cell Cancer (SCOTRRCC); fiach.o' (F.C.O.); (A.T.); (D.J.H.)
| | - Antonio Reverter
- CSIRO Agriculture and Food, Queensland Bioscience Precinct, St. Lucia, QLD 4067, Australia;
| | - David J. Harrison
- Scottish Collaboration on Translational Research into Renal Cell Cancer (SCOTRRCC); fiach.o' (F.C.O.); (A.T.); (D.J.H.)
- School of Medicine, University of St. Andrews, St. Andrews KY16 9TF, UK
| | - Grant D. Stewart
- Cambridge University Hospitals NHS Foundation Trust, Cambridge CB2 0QQ, UK; (A.Y.W.); (T.E.); (S.J.W.); (A.C.P.R.)
- Scottish Collaboration on Translational Research into Renal Cell Cancer (SCOTRRCC); fiach.o' (F.C.O.); (A.T.); (D.J.H.)
- Department of Surgery, University of Cambridge, Cambridge CB2 0QQ, UK
| |
Collapse
|
4
|
Ehsani R, Drabløs F. Enhanced identification of significant regulators of gene expression. BMC Bioinformatics 2020; 21:134. [PMID: 32252623 PMCID: PMC7132893 DOI: 10.1186/s12859-020-3468-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Accepted: 03/24/2020] [Indexed: 12/29/2022] Open
Abstract
Background Diseases like cancer will lead to changes in gene expression, and it is relevant to identify key regulatory genes that can be linked directly to these changes. This can be done by computing a Regulatory Impact Factor (RIF) score for relevant regulators. However, this computation is based on estimating correlated patterns of gene expression, often Pearson correlation, and an assumption about a set of specific regulators, normally transcription factors. This study explores alternative measures of correlation, using the Fisher and Sobolev metrics, and an extended set of regulators, including epigenetic regulators and long non-coding RNAs (lncRNAs). Data on prostate cancer have been used to explore the effect of these modifications. Results A tool for computation of RIF scores with alternative correlation measures and extended sets of regulators was developed and tested on gene expression data for prostate cancer. The study showed that the Fisher and Sobolev metrics lead to improved identification of well-documented regulators of gene expression in prostate cancer, and the sets of identified key regulators showed improved overlap with previously defined gene sets of relevance to cancer. The extended set of regulators lead to identification of several interesting candidates for further studies, including lncRNAs. Several key processes were identified as important, including spindle assembly and the epithelial-mesenchymal transition (EMT). Conclusions The study has shown that using alternative metrics of correlation can improve the performance of tools based on correlation of gene expression in genomic data. The Fisher and Sobolev metrics should be considered also in other correlation-based applications.
Collapse
Affiliation(s)
- Rezvan Ehsani
- Department of Mathematics, University of Zabol, Zabol, Iran. .,Department of Bioinformatics, University of Zabol, Zabol, Iran.
| | - Finn Drabløs
- Department of Cancer Research and Molecular Medicine, NTNU - Norwegian University of Science and Technology, NO-7491, Trondheim, Norway.
| |
Collapse
|
5
|
Baumgarten N, Schmidt F, Schulz MH. Improved linking of motifs to their TFs using domain information. Bioinformatics 2020; 36:1655-1662. [PMID: 31742324 PMCID: PMC7703792 DOI: 10.1093/bioinformatics/btz855] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Revised: 11/08/2019] [Accepted: 11/16/2019] [Indexed: 11/23/2022] Open
Abstract
Motivation A central aim of molecular biology is to identify mechanisms of transcriptional regulation. Transcription factors (TFs), which are DNA-binding proteins, are highly involved in these processes, thus a crucial information is to know where TFs interact with DNA and to be aware of the TFs’ DNA-binding motifs. For that reason, computational tools exist that link DNA-binding motifs to TFs either without sequence information or based on TF-associated sequences, e.g. identified via a chromatin immunoprecipitation followed by sequencing (ChIP-seq) experiment. In this paper, we present MASSIF, a novel method to improve the performance of existing tools that link motifs to TFs relying on TF-associated sequences. MASSIF is based on the idea that a DNA-binding motif, which is correctly linked to a TF, should be assigned to a DNA-binding domain (DBD) similar to that of the mapped TF. Because DNA-binding motifs are in general not linked to DBDs, it is not possible to compare the DBD of a TF and the motif directly. Instead we created a DBD collection, which consist of TFs with a known DBD and an associated motif. This collection enables us to evaluate how likely it is that a linked motif and a TF of interest are associated to the same DBD. We named this similarity measure domain score, and represent it as a P-value. We developed two different ways to improve the performance of existing tools that link motifs to TFs based on TF-associated sequences: (i) using meta-analysis to combine P-values from one or several of these tools with the P-value of the domain score and (ii) filter unlikely motifs based on the domain score. Results We demonstrate the functionality of MASSIF on several human ChIP-seq datasets, using either motifs from the HOCOMOCO database or de novo identified ones as input motifs. In addition, we show that both variants of our method improve the performance of tools that link motifs to TFs based on TF-associated sequences significantly independent of the considered DBD type. Availability and implementation MASSIF is freely available online at https://github.com/SchulzLab/MASSIF. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Nina Baumgarten
- Institute for Cardiovascular Regeneration, Goethe University, Frankfurt am Main 60590, Germany.,German Center for Cardiovascular Regeneration, Partner Site Rhein-Main, Frankfurt am Main 60590, Germany
| | - Florian Schmidt
- High-throughput Genomics & Systems Biology, Cluster of Excellence MMCI, Saarland University.,Research Group Computational Biology, Max Planck Institute for Informatics, Saarland Informatics Campus, Saarbrücken 66123, Germany
| | - Marcel H Schulz
- Institute for Cardiovascular Regeneration, Goethe University, Frankfurt am Main 60590, Germany.,German Center for Cardiovascular Regeneration, Partner Site Rhein-Main, Frankfurt am Main 60590, Germany.,High-throughput Genomics & Systems Biology, Cluster of Excellence MMCI, Saarland University.,Research Group Computational Biology, Max Planck Institute for Informatics, Saarland Informatics Campus, Saarbrücken 66123, Germany
| |
Collapse
|
6
|
Kehl T, Schneider L, Kattler K, Stöckel D, Wegert J, Gerstner N, Ludwig N, Distler U, Tenzer S, Gessler M, Walter J, Keller A, Graf N, Meese E, Lenhof HP. The role of TCF3 as potential master regulator in blastemal Wilms tumors. Int J Cancer 2018; 144:1432-1443. [PMID: 30155889 DOI: 10.1002/ijc.31834] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2018] [Revised: 07/05/2018] [Accepted: 08/13/2018] [Indexed: 12/11/2022]
Abstract
Wilms tumors are the most common type of pediatric kidney tumors. While the overall prognosis for patients is favorable, especially tumors that exhibit a blastemal subtype after preoperative chemotherapy have a poor prognosis. For an improved risk assessment and therapy stratification, it is essential to identify the driving factors that are distinctive for this aggressive subtype. In our study, we compared gene expression profiles of 33 tumor biopsies (17 blastemal and 16 other tumors) after neoadjuvant chemotherapy. The analysis of this dataset using the Regulator Gene Association Enrichment algorithm successfully identified several biomarkers and associated molecular mechanisms that distinguish between blastemal and nonblastemal Wilms tumors. Specifically, regulators involved in embryonic development and epigenetic processes like chromatin remodeling and histone modification play an essential role in blastemal tumors. In this context, we especially identified TCF3 as the central regulatory element. Furthermore, the comparison of ChIP-Seq data of Wilms tumor cell cultures from a blastemal mouse xenograft and a stromal tumor provided further evidence that the chromatin states of blastemal cells share characteristics with embryonic stem cells that are not present in the stromal tumor cell line. These stem-cell like characteristics could potentially add to the increased malignancy and chemoresistance of the blastemal subtype. Along with TCF3, we detected several additional biomarkers that are distinctive for blastemal Wilms tumors after neoadjuvant chemotherapy and that may provide leads for new therapeutic regimens.
Collapse
Affiliation(s)
- Tim Kehl
- Center for Bioinformatics, Saarland Informatics Campus, Saarland University, Saarbrücken, Germany
| | - Lara Schneider
- Center for Bioinformatics, Saarland Informatics Campus, Saarland University, Saarbrücken, Germany
| | - Kathrin Kattler
- Department of Genetics, Saarland University, Saarbrücken, Germany
| | - Daniel Stöckel
- Center for Bioinformatics, Saarland Informatics Campus, Saarland University, Saarbrücken, Germany
| | - Jenny Wegert
- Theodor-Boveri-Institute/Biocenter, Developmental Biochemistry, and Comprehensive Cancer Center Mainfranken, Würzburg University, Würzburg, Germany
| | - Nico Gerstner
- Center for Bioinformatics, Saarland Informatics Campus, Saarland University, Saarbrücken, Germany
| | - Nicole Ludwig
- Human Genetics, Saarland University, Homburg, Germany
| | - Ute Distler
- Institute for Immunology, Johannes Gutenberg University Mainz, Mainz, Germany
| | - Stefan Tenzer
- Institute for Immunology, Johannes Gutenberg University Mainz, Mainz, Germany
| | - Manfred Gessler
- Theodor-Boveri-Institute/Biocenter, Developmental Biochemistry, and Comprehensive Cancer Center Mainfranken, Würzburg University, Würzburg, Germany
| | - Jörn Walter
- Department of Genetics, Saarland University, Saarbrücken, Germany
| | - Andreas Keller
- Center for Bioinformatics, Saarland Informatics Campus, Saarland University, Saarbrücken, Germany
| | - Norbert Graf
- Department of Pediatric Oncology and Hematology, Medical School, Saarland University, Homburg, Germany
| | - Eckart Meese
- Human Genetics, Saarland University, Homburg, Germany
| | - Hans-Peter Lenhof
- Center for Bioinformatics, Saarland Informatics Campus, Saarland University, Saarbrücken, Germany
| |
Collapse
|