1
|
Loss of ORP3 induces aneuploidy and promotes bladder cancer cell invasion through deregulated microtubule and actin dynamics. Cell Mol Life Sci 2023; 80:299. [PMID: 37740130 PMCID: PMC10516806 DOI: 10.1007/s00018-023-04959-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 09/07/2023] [Accepted: 09/08/2023] [Indexed: 09/24/2023]
Abstract
We have recently shown that loss of ORP3 leads to aneuploidy induction and promotes tumor formation. However, the specific mechanisms by which ORP3 contributes to ploidy-control and cancer initiation and progression is still unknown. Here, we report that ORP3 is highly expressed in ureter and bladder epithelium while its expression is downregulated in invasive bladder cancer cell lines and during tumor progression, both in human and in mouse bladder cancer. Moreover, we observed an increase in the incidence of N-butyl-N-(4-hydroxybutyl)-nitrosamine (BBN)-induced invasive bladder carcinoma in the tissue-specific Orp3 knockout mice. Experimental data demonstrate that ORP3 protein interacts with γ-tubulin at the centrosomes and with components of actin cytoskeleton. Altering the expression of ORP3 induces aneuploidy and genomic instability in telomerase-immortalized urothelial cells with a stable karyotype and influences the migration and invasive capacity of bladder cancer cell lines. These findings demonstrate a crucial role of ORP3 in ploidy-control and indicate that ORP3 is a bona fide tumor suppressor protein. Of note, the presented data indicate that ORP3 affects both cell invasion and migration as well as genome stability through interactions with cytoskeletal components, providing a molecular link between aneuploidy and cell invasion and migration, two crucial characteristics of metastatic cells.
Collapse
|
2
|
P04.18 Heterogeneity and plasticity of integrin α5β1 expression in glioblastoma stem cells. Neuro Oncol 2021. [DOI: 10.1093/neuonc/noab180.072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Abstract
BACKGROUND
Glioblastoma (GBM) is the most frequent and deadliest type of central nervous system tumors. Despite the treatment by the Stupp protocol, almost all patients relapse and new therapeutic protocols have been unsuccessful for ameliorating patient survival. Molecular heterogeneity of GBM and existence of glioma stem cells (GSC) may be linked to therapy resistance and recurrence. We demonstrated earlier that α5β1 integrin is a GBM therapeutic target which participate to therapy resistance; a high expression in patient tumors is linked to a worse prognosis. Expression of α5β1 integrin is heterogeneous inter- and intra-tumorally. We particularly addressed the role of glioma stem cell plasticity in the modulation of the integrin expression. Stem cells reside in specific niches (perivascular or hypoxic niches) in the tumor and are at the origin of the more differentiated tumor cell bulk. Metabolism is known to change between the different GSC states and may be affected by or may affect the integrin expression. The aim of our work is therefore to consider the expression of the integrin α5β1 in relationship with GSC differentiation or in hypoxic environment and with cell metabolism.
MATERIAL AND METHODS
Ten different patient-derived glioma stem cell lines were investigated. Cell culture in stem cell medium (neurospheres) or differentiation medium (adherent cell monolayer) was made in normoxia (21% O2) or hypoxia (1%O2). Alternatively, chemically-induced hypoxia (cobalt chloride/desferoxiamine) was used. Integrin expression was kinetically checked at the mRNA (RT-qPCR) or protein (Western blot) levels. Cell metabolism was investigated with the Seahorse Xfp technology and by HRMAS-NMR.
RESULTS
No GSC lines (neurospheres) expressed the α5β1 integrin. Interestingly, only half of them did after differentiation suggesting a first level of heterogeneity. A second level of heterogeneity was observed in hypoxic conditions provoking induction of integrin α5β1 expression in only some non-differentiated GSC. Three categories of GSC were thus characterized: one able to express the integrin in hypoxia and after differentiation, one never expressing it and the third one only after differentiation. Cell metabolism differed between GSC before and after differentiation and in presence of integrin α5β1 antagonists. Specific glioma regulator network analysis revealed new targets to be inhibited concomitantly with the integrin.
CONCLUSION
Data suggest that α5β1 integrin expression may be induced by different signaling pathways. Molecular switches may occur either when stem cells differentiate to tumor cells but also directly in stem cells in hypoxic niches. Characterization of α5β1 integrin expression drivers may help to find new therapeutic targets but also to delineate subpopulation of patients who would benefit from an anti-integrin strategy.
Collapse
|
3
|
Inference of an Integrative, Executable Network for Rheumatoid Arthritis Combining Data-Driven Machine Learning Approaches and a State-of-the-Art Mechanistic Disease Map. J Pers Med 2021; 11:785. [PMID: 34442429 PMCID: PMC8400381 DOI: 10.3390/jpm11080785] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Revised: 08/02/2021] [Accepted: 08/10/2021] [Indexed: 02/07/2023] Open
Abstract
Rheumatoid arthritis (RA) is a multifactorial, complex autoimmune disease that involves various genetic, environmental, and epigenetic factors. Systems biology approaches provide the means to study complex diseases by integrating different layers of biological information. Combining multiple data types can help compensate for missing or conflicting information and limit the possibility of false positives. In this work, we aim to unravel mechanisms governing the regulation of key transcription factors in RA and derive patient-specific models to gain more insights into the disease heterogeneity and the response to treatment. We first use publicly available transcriptomic datasets (peripheral blood) relative to RA and machine learning to create an RA-specific transcription factor (TF) co-regulatory network. The TF cooperativity network is subsequently enriched in signalling cascades and upstream regulators using a state-of-the-art, RA-specific molecular map. Then, the integrative network is used as a template to analyse patients' data regarding their response to anti-TNF treatment and identify master regulators and upstream cascades affected by the treatment. Finally, we use the Boolean formalism to simulate in silico subparts of the integrated network and identify combinations and conditions that can switch on or off the identified TFs, mimicking the effects of single and combined perturbations.
Collapse
|
4
|
Abstract
In many cancers, mechanisms of gene regulation can be severely altered. Identification of deregulated genes, which do not follow the regulation processes that exist between transcription factors and their target genes, is of importance to better understand the development of the disease. We propose a methodology to detect deregulation mechanisms with a particular focus on cancer subtypes. This strategy is based on the comparison between tumoral and healthy cells. First, we use gene expression data from healthy cells to infer a reference gene regulatory network. Then, we compare it with gene expression levels in tumor samples to detect deregulated target genes. We finally measure the ability of each transcription factor to explain these deregulations. We apply our method on a public bladder cancer data set derived from The Cancer Genome Atlas project and confirm that it captures hallmarks of cancer subtypes. We also show that it enables the discovery of new potential biomarkers.
Collapse
|
5
|
Metabolic syndrome, malnutrition, and its associations with cardiovascular and all-cause mortality in hemodialysis patients: Follow-up for three years. SAUDI JOURNAL OF KIDNEY DISEASES AND TRANSPLANTATION 2021; 31:129-135. [PMID: 32129205 DOI: 10.4103/1319-2442.279932] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
Metabolic disorder contributes to the increase in the mortality rate of patients on hemodialysis (HD). The aim of this study was to estimate the prevalence of metabolic syndrome (MS) and malnutrition in patients on maintenance HD and to evaluate their influence on cardiovascular and all-cause mortality during the follow-up. We carried out a prospective cross- sectional study in which we enrolled 100 patients from a single center who had been followed up for three years. Collected data included demographic characteristics, detailed medical history, clinical variables, MS variables, nutritional status, and laboratory findings. The outcomes were the occurrence of a cardiovascular event and cardiovascular or all-cause mortality during the follow-up period. The Statistical Package for the Social Sciences software was used for statistical analysis. Whereas 50% of patients had MS, 23% showed evidence of malnutrition. Patients with MS were older and had more preexisting cardiovascular diseases (CVDs). All patients were followed for 36 months. During this time, 19 patients with MS and 14 patients without MS died (38% vs. 28%; P = 0.19), most frequently of CVD. Mean survival time was 71.52 ± 42.1 months for MS group versus 92.06 ± 65 months for non-MS group, but the difference was not significant. MS was related with a higher cardiovascular mortality, while malnutrition was significantly associated with all-cause mortality. Our data showed that MS was not related to cardiovascular or all-cause mortality in HD patients and did not influence survival. The independent risk factors for all-cause mortality were older age, preexisting CVD, and malnutrition.
Collapse
|
6
|
Identification of putative master regulators in rheumatoid arthritis synovial fibroblasts using gene expression data and network inference. Sci Rep 2020; 10:16236. [PMID: 33004899 PMCID: PMC7529794 DOI: 10.1038/s41598-020-73147-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 09/14/2020] [Indexed: 12/15/2022] Open
Abstract
Rheumatoid arthritis (RA) is a systemic autoimmune disease that affects the synovial joints of the body. Rheumatoid arthritis fibroblast-like synoviocytes (RA FLS) are central players in the disease pathogenesis, as they are involved in the secretion of cytokines and proteolytic enzymes, exhibit invasive traits, high rate of self-proliferation and an apoptosis-resistant phenotype. We aim at characterizing transcription factors (TFs) that are master regulators in RA FLS and could potentially explain phenotypic traits. We make use of differentially expressed genes in synovial tissue from patients suffering from RA and osteoarthritis (OA) to infer a TF co-regulatory network, using dedicated software. The co-regulatory network serves as a reference to analyze microarray and single-cell RNA-seq data from isolated RA FLS. We identified five master regulators specific to RA FLS, namely BATF, POU2AF1, STAT1, LEF1 and IRF4. TF activity of the identified master regulators was also estimated with the use of two additional, independent software. The identified TFs contribute to the regulation of inflammation, proliferation and apoptosis, as indicated by the comparison of their differentially expressed target genes with hallmark molecular signatures derived from the Molecular Signatures Database (MSigDB). Our results show that TFs influence could be used to identify putative master regulators of phenotypic traits and suggest novel, druggable targets for experimental validation.
Collapse
|
7
|
|
8
|
Latent network-based representations for large-scale gene expression data analysis. BMC Bioinformatics 2019; 19:466. [PMID: 30717663 PMCID: PMC7394327 DOI: 10.1186/s12859-018-2481-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2018] [Accepted: 11/09/2018] [Indexed: 12/12/2022] Open
Abstract
Background With the recent advancements in high-throughput experimental procedures, biologists are gathering huge quantities of data. A main priority in bioinformatics and computational biology is to provide system level analytical tools capable of meeting an ever-growing production of high-throughput biological data while taking into account its biological context. In gene expression data analysis, genes have widely been considered as independent components. However, a systemic view shows that they act synergistically in living cells, forming functional complexes and more generally a biological system. Results In this paper, we propose LatNet, a signal transformation framework that, starting from an initial large-scale gene expression data, allows to generate new representations based on latent network-based relationships between the genes. LatNet aims to leverage system level relations between the genes as an underlying hidden structure to derive the new transformed latent signals. We present a concrete implementation of our framework, based on a gene regulatory network structure and two signal transformation approaches, to quantify latent network-based activity of regulators, as well as gene perturbation signals. The new gene/regulator signals are at the level of each sample of the input data and, thus, could directly be used instead of the initial expression signals for major bioinformatics analysis, including diagnosis and personalized medicine. Conclusion Multiple patterns could be hidden or weakly observed in expression data. LatNet helps in uncovering latent signals that could emphasize hidden patterns based on the relations between the genes and, thus, enhancing the performance of gene expression-based analysis algorithms. We use LatNet for the analysis of real-world gene expression data of bladder cancer and we show the efficiency of our transformation framework as compared to using the initial expression data.
Collapse
|
9
|
Abstract
Background Genome-scale metabolic models provide an opportunity for rational approaches to studies of the different reactions taking place inside the cell. The integration of these models with gene regulatory networks is a hot topic in systems biology. The methods developed to date focus mostly on resolving the metabolic elements and use fairly straightforward approaches to assess the impact of genome expression on the metabolic phenotype. Results We present here a method for integrating the reverse engineering of gene regulatory networks into these metabolic models. We applied our method to a high-dimensional gene expression data set to infer a background gene regulatory network. We then compared the resulting phenotype simulations with those obtained by other relevant methods. Conclusions Our method outperformed the other approaches tested and was more robust to noise. We also illustrate the utility of this method for studies of a complex biological phenomenon, the diauxic shift in yeast.
Collapse
|
10
|
Abstract
Background Genome layout and gene regulation appear to be interdependent. Understanding this interdependence is key to exploring the dynamic nature of chromosome conformation and to engineering functional genomes. Evidence for non-random genome layout, defined as the relative positioning of either co-functional or co-regulated genes, stems from two main approaches. Firstly, the analysis of contiguous genome segments across species, has highlighted the conservation of gene arrangement (synteny) along chromosomal regions. Secondly, the study of long-range interactions along a chromosome has emphasised regularities in the positioning of microbial genes that are co-regulated, co-expressed or evolutionarily correlated. While one-dimensional pattern analysis is a mature field, it is often powerless on biological datasets which tend to be incomplete, and partly incorrect. Moreover, there is a lack of comprehensive, user-friendly tools to systematically analyse, visualise, integrate and exploit regularities along genomes. Results Here we present the Genome REgulatory and Architecture Tools SCAN (GREAT:SCAN) software for the systematic study of the interplay between genome layout and gene expression regulation. GREAT:SCAN is a collection of related and interconnected applications currently able to perform systematic analyses of genome regularities as well as to improve transcription factor binding sites (TFBS) and gene regulatory network predictions based on gene positional information. Conclusions We demonstrate the capabilities of these tools by studying on one hand the regular patterns of genome layout in the major regulons of the bacterium Escherichia coli. On the other hand, we demonstrate the capabilities to improve TFBS prediction in microbes. Finally, we highlight, by visualisation of multivariate techniques, the interplay between position and sequence information for effective transcription regulation. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1047-0) contains supplementary material, which is available to authorized users.
Collapse
|
11
|
GREAT: a web portal for Genome Regulatory Architecture Tools. Nucleic Acids Res 2016; 44:W77-82. [PMID: 27151196 PMCID: PMC4987929 DOI: 10.1093/nar/gkw384] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2016] [Accepted: 04/26/2016] [Indexed: 11/15/2022] Open
Abstract
GREAT (Genome REgulatory Architecture Tools) is a novel web portal for tools designed to generate user-friendly and biologically useful analysis of genome architecture and regulation. The online tools of GREAT are freely accessible and compatible with essentially any operating system which runs a modern browser. GREAT is based on the analysis of genome layout -defined as the respective positioning of co-functional genes- and its relation with chromosome architecture and gene expression. GREAT tools allow users to systematically detect regular patterns along co-functional genomic features in an automatic way consisting of three individual steps and respective interactive visualizations. In addition to the complete analysis of regularities, GREAT tools enable the use of periodicity and position information for improving the prediction of transcription factor binding sites using a multi-view machine learning approach. The outcome of this integrative approach features a multivariate analysis of the interplay between the location of a gene and its regulatory sequence. GREAT results are plotted in web interactive graphs and are available for download either as individual plots, self-contained interactive pages or as machine readable tables for downstream analysis. The GREAT portal can be reached at the following URL https://absynth.issb.genopole.fr/GREAT and each individual GREAT tool is available for downloading.
Collapse
|
12
|
Abstract
In tumoral cells, gene regulation mechanisms are severely altered. Genes that do not react normally to their regulators' activity can provide explanations for the tumoral behavior, and be characteristic of cancer subtypes. We thus propose a statistical methodology to identify the misregulated genes given a reference network and gene expression data. Our model is based on a regulatory process in which all genes are allowed to be deregulated. We derive an EM algorithm where the hidden variables correspond to the status (under/over/normally expressed) of the genes and where the E-step is solved thanks to a message passing algorithm. Our procedure provides posterior probabilities of deregulation in a given sample for each gene. We assess the performance of our method by numerical experiments on simulations and on a bladder cancer data set.
Collapse
|
13
|
A systems biology approach towards the prediction of ciliopathy mechanisms. Cilia 2015. [PMCID: PMC4518625 DOI: 10.1186/2046-2530-4-s1-p88] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
14
|
CoRegNet: reconstruction and integrated analysis of co-regulatory networks. Bioinformatics 2015; 31:3066-8. [PMID: 25979476 PMCID: PMC4565029 DOI: 10.1093/bioinformatics/btv305] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 05/08/2015] [Indexed: 11/13/2022] Open
Abstract
UNLABELLED CoRegNet is an R/Bioconductor package to analyze large-scale transcriptomic data by highlighting sets of co-regulators. Based on a transcriptomic dataset, CoRegNet can be used to: reconstruct a large-scale co-regulatory network, integrate regulation evidences such as transcription factor binding sites and ChIP data, estimate sample-specific regulator activity, identify cooperative transcription factors and analyze the sample-specific combinations of active regulators through an interactive visualization tool. In this study CoRegNet was used to identify driver regulators of bladder cancer. AVAILABILITY CoRegNet is available at http://bioconductor.org/packages/CoRegNet CONTACT remy.nicolle@issb.genopole.fr or mohamed.elati@issb.genopole.fr SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
15
|
Differences in gene transcriptomic pattern of Plasmodium falciparum in children with cerebral malaria and asymptomatic carriers. PLoS One 2014; 9:e114401. [PMID: 25479608 PMCID: PMC4257676 DOI: 10.1371/journal.pone.0114401] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2014] [Accepted: 11/10/2014] [Indexed: 11/24/2022] Open
Abstract
The mechanisms underlying the heterogeneity of clinical malaria remain largely unknown. We hypothesized that differential gene expression contributes to phenotypic variation of parasites which results in a specific interaction with the host, leading to different clinical features of malaria. In this study, we analyzed the transcriptomes of isolates obtained from asymptomatic carriers and patients with uncomplicated or cerebral malaria. We also investigated the transcriptomes of 3D7 clone and 3D7-Lib that expresses severe malaria associated-variant surface antigen. Our findings revealed a specific up-regulation of genes involved in pathogenesis, adhesion to host cell, and erythrocyte aggregation in parasites from patients with cerebral malaria and 3D7-Lib, compared to parasites from asymptomatic carriers and 3D7, respectively. However, we did not find any significant difference between the transcriptomes of parasites from cerebral malaria and uncomplicated malaria, suggesting similar transcriptomic pattern in these two parasite populations. The difference between isolates from asymptomatic children and cerebral malaria concerned genes coding for exported proteins, Maurer's cleft proteins, transcriptional factor proteins, proteins implicated in protein transport, as well as Plasmodium conserved and hypothetical proteins. Interestingly, UPs A1, A2, A3 and UPs B1 of var genes were predominantly found in cerebral malaria-associated isolates and those containing architectural domains of DC4, DC5, DC13 and their neighboring rif genes in 3D7-lib. Therefore, more investigations are needed to analyze the effective role of these genes during malaria infection to provide with new knowledge on malaria pathology. In addition, concomitant regulation of genes within the chromosomal neighborhood suggests a common mechanism of gene regulation in P. falciparum.
Collapse
|
16
|
Abstract
We introduce Pepper (Protein complex Expansion using Protein–Protein intERactions), a Cytoscape app designed to identify protein complexes as densely connected subnetworks from seed lists of proteins derived from proteomic studies. Pepper identifies connected subgraph by using multi-objective optimization involving two functions: (i) the coverage, a solution must contain as many proteins from the seed as possible, (ii) the density, the proteins of a solution must be as connected as possible, using only interactions from a proteome-wide interaction network. Comparisons based on gold standard yeast and human datasets showed Pepper’s integrative approach as superior to standard protein complex discovery methods. The visualization and interpretation of the results are facilitated by an automated post-processing pipeline based on topological analysis and data integration about the predicted complex proteins. Pepper is a user-friendly tool that can be used to analyse any list of proteins. Availability: Pepper is available from the Cytoscape plug-in manager or online (http://apps.cytoscape.org/apps/pepper) and released under GNU General Public License version 3. Contact: mohamed.elati@issb.genopole.fr Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
|
17
|
Hybrid Method Inference for the Construction of Cooperative Regulatory Network in Human. IEEE Trans Nanobioscience 2014; 13:97-103. [DOI: 10.1109/tnb.2014.2316920] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
18
|
Abstract
Conventional approaches to predict transcriptional regulatory interactions usually rely on the definition of a shared motif sequence on the target genes of a transcription factor (TF). These efforts have been frustrated by the limited availability and accuracy of TF binding site motifs, usually represented as position-specific scoring matrices, which may match large numbers of sites and produce an unreliable list of target genes. To improve the prediction of binding sites, we propose to additionally use the unrelated knowledge of the genome layout. Indeed, it has been shown that co-regulated genes tend to be either neighbors or periodically spaced along the whole chromosome. This study demonstrates that respective gene positioning carries significant information. This novel type of information is combined with traditional sequence information by a machine learning algorithm called PreCisIon. To optimize this combination, PreCisIon builds a strong gene target classifier by adaptively combining weak classifiers based on either local binding sequence or global gene position. This strategy generically paves the way to the optimized incorporation of any future advances in gene target prediction based on local sequence, genome layout or on novel criteria. With the current state of the art, PreCisIon consistently improves methods based on sequence information only. This is shown by implementing a cross-validation analysis of the 20 major TFs from two phylogenetically remote model organisms. For Bacillus subtilis and Escherichia coli, respectively, PreCisIon achieves on average an area under the receiver operating characteristic curve of 70 and 60%, a sensitivity of 80 and 70% and a specificity of 60 and 56%. The newly predicted gene targets are demonstrated to be functionally consistent with previously known targets, as assessed by analysis of Gene Ontology enrichment or of the relevant literature and databases.
Collapse
|
19
|
Abstract
Background Identifying gene functional modules is an important step towards elucidating gene functions at a global scale. Clustering algorithms mostly rely on co-expression of genes, that is group together genes having similar expression profiles. Results We propose to cluster genes by co-regulation rather than by co-expression. We therefore present an inference algorithm for detecting co-regulated groups from gene expression data and introduce a method to cluster genes given that inferred regulatory structure. Finally, we propose to validate the clustering through a score based on the GO enrichment of the obtained groups of genes. Conclusion We evaluate the methods on the stress response of S. Cerevisiae data and obtain better scores than clustering obtained directly from gene expression.
Collapse
|
20
|
Abstract
MOTIVATION One of the most challenging tasks in the post-genomic era is the reconstruction of transcriptional regulation networks. The goal is to identify, for each gene expressed in a particular cellular context, the regulators affecting its transcription, and the co-ordination of several regulators in specific types of regulation. DNA microarrays can be used to investigate relationships between regulators and their target genes, through simultaneous observations of their RNA levels. RESULTS We propose a data mining system for inferring transcriptional regulation relationships from RNA expression values. This system is particularly suitable for the detection of cooperative transcriptional regulation. We model regulatory relationships as labelled two-layer gene regulatory networks, and describe a method for the efficient learning of these bipartite networks from discretized expression data sets. We also evaluate the statistical significance of such inferred networks and validate our methods on two public yeast expression data sets. AVAILABILITY http://www.lri.fr/~elati/licorn.html. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|