1
|
Jo K, Sung I, Lee D, Jang H, Kim S. Inferring transcriptomic cell states and transitions only from time series transcriptome data. Sci Rep 2021; 11:12566. [PMID: 34131182 PMCID: PMC8206345 DOI: 10.1038/s41598-021-91752-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 05/31/2021] [Indexed: 02/05/2023] Open
Abstract
Cellular stages of biological processes have been characterized using fluorescence-activated cell sorting and genetic perturbations, charting a limited landscape of cellular states. Time series transcriptome data can help define new cellular states at the molecular level since the analysis of transcriptional changes can provide information on cell states and transitions. However, existing methods for inferring cell states from transcriptome data use additional information such as prior knowledge on cell types or cell-type-specific markers to reduce the complexity of data. In this study, we present a novel time series clustering framework to infer TRAnscriptomic Cellular States (TRACS) only from time series transcriptome data by integrating Gaussian process regression, shape-based distance, and ranked pairs algorithm in a single computational framework. TRACS determines patterns that correspond to hidden cellular states by clustering gene expression data. TRACS was used to analyse single-cell and bulk RNA sequencing data and successfully generated cluster networks that reflected the characteristics of key stages of biological processes. Thus, TRACS has a potential to help reveal unknown cellular states and transitions at the molecular level using only time series transcriptome data. TRACS is implemented in Python and available at http://github.com/BML-cbnu/TRACS/ .
Collapse
Affiliation(s)
- Kyuri Jo
- grid.254229.a0000 0000 9611 0917Department of Computer Engineering, Chungbuk National University, Cheongju, 28644 Korea
| | - Inyoung Sung
- grid.31501.360000 0004 0470 5905Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826 Korea
| | - Dohoon Lee
- grid.31501.360000 0004 0470 5905Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826 Korea
| | - Hyuksoon Jang
- grid.254229.a0000 0000 9611 0917Department of Computer Engineering, Chungbuk National University, Cheongju, 28644 Korea
| | - Sun Kim
- grid.31501.360000 0004 0470 5905Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826 Korea ,grid.31501.360000 0004 0470 5905Department of Computer Science and Engineering, Seoul National University, Seoul, 08826 Korea ,grid.31501.360000 0004 0470 5905Institute of Engineering Research, Seoul National University, Seoul, 08826 Korea ,grid.31501.360000 0004 0470 5905Bioinformatics Institute, Seoul National University, Seoul, 08826 Korea
| |
Collapse
|
2
|
Uhlitz F, Sieber A, Wyler E, Fritsche-Guenther R, Meisig J, Landthaler M, Klinger B, Blüthgen N. An immediate-late gene expression module decodes ERK signal duration. Mol Syst Biol 2017; 13:928. [PMID: 28468958 PMCID: PMC5448165 DOI: 10.15252/msb.20177554] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
The RAF‐MEK‐ERK signalling pathway controls fundamental, often opposing cellular processes such as proliferation and apoptosis. Signal duration has been identified to play a decisive role in these cell fate decisions. However, it remains unclear how the different early and late responding gene expression modules can discriminate short and long signals. We obtained both protein phosphorylation and gene expression time course data from HEK293 cells carrying an inducible construct of the proto‐oncogene RAF. By mathematical modelling, we identified a new gene expression module of immediate–late genes (ILGs) distinct in gene expression dynamics and function. We find that mRNA longevity enables these ILGs to respond late and thus translate ERK signal duration into response amplitude. Despite their late response, their GC‐rich promoter structure suggested and metabolic labelling with 4SU confirmed that transcription of ILGs is induced immediately. A comparative analysis shows that the principle of duration decoding is conserved in PC12 cells and MCF7 cells, two paradigm cell systems for ERK signal duration. Altogether, our findings suggest that ILGs function as a gene expression module to decode ERK signal duration.
Collapse
Affiliation(s)
- Florian Uhlitz
- IRI Life Sciences & Institute for Theoretical Biology, Humboldt Universität Berlin, Berlin, Germany.,Institute of Pathology, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Anja Sieber
- IRI Life Sciences & Institute for Theoretical Biology, Humboldt Universität Berlin, Berlin, Germany.,Institute of Pathology, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Emanuel Wyler
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, Berlin, Germany
| | - Raphaela Fritsche-Guenther
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, Berlin, Germany
| | - Johannes Meisig
- IRI Life Sciences & Institute for Theoretical Biology, Humboldt Universität Berlin, Berlin, Germany.,Institute of Pathology, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Markus Landthaler
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, Berlin, Germany.,Berlin Institute of Health, Berlin, Germany
| | - Bertram Klinger
- IRI Life Sciences & Institute for Theoretical Biology, Humboldt Universität Berlin, Berlin, Germany.,Institute of Pathology, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Nils Blüthgen
- IRI Life Sciences & Institute for Theoretical Biology, Humboldt Universität Berlin, Berlin, Germany .,Institute of Pathology, Charité - Universitätsmedizin Berlin, Berlin, Germany.,Berlin Institute of Health, Berlin, Germany
| |
Collapse
|
3
|
Chang YH, Dobbe R, Bhushan P, Gray JW, Tomlin CJ. Reconstruction of Gene Regulatory Networks based on Repairing Sparse Low-rank Matrices. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2016; 13:767-777. [PMID: 27990101 PMCID: PMC5154690 DOI: 10.1109/tcbb.2015.2465952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
With the growth of high-throughput proteomic data, in particular time series gene expression data from various perturbations, a general question that has arisen is how to organize inherently heterogenous data into meaningful structures. Since biological systems such as breast cancer tumors respond differently to various treatments, little is known about exactly how these gene regulatory networks (GRNs) operate under different stimuli. Challenges due to the lack of knowledge not only occur in modeling the dynamics of a GRN but also cause bias or uncertainties in identifying parameters or inferring the GRN structure. This paper describes a new algorithm which enables us to estimate bias error due to the effect of perturbations and correctly identify the common graph structure among biased inferred graph structures. To do this, we retrieve common dynamics of the GRN subject to various perturbations. We refer to the task as "repairing" inspired by "image repairing" in computer vision. The method can automatically correctly repair the common graph structure across perturbed GRNs, even without precise information about the effect of the perturbations. We evaluate the method on synthetic data sets and demonstrate an application to the DREAM data sets and discuss its implications to experiment design.
Collapse
Affiliation(s)
- Young Hwan Chang
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720 USA
| | - Roel Dobbe
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720 USA
| | - Palak Bhushan
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720 USA
| | - Joe W Gray
- Department of Biomedical Engineering and the Center for Spatial Systems Biomedicine, Oregon Health and Science University, Portland, OR 97239, USA
| | - Claire J Tomlin
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720 USA; Faculty Scientist, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
| |
Collapse
|
4
|
Chang YH, Korkola J, Amin DN, Moasser MM, Carmena JM, Gray JW, Tomlin CJ. Disentangling multidimensional spatio-temporal data into their common and aberrant responses. PLoS One 2015; 10:e0121607. [PMID: 25901353 PMCID: PMC4406848 DOI: 10.1371/journal.pone.0121607] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Accepted: 02/03/2015] [Indexed: 11/18/2022] Open
Abstract
With the advent of high-throughput measurement techniques, scientists and engineers are starting to grapple with massive data sets and encountering challenges with how to organize, process and extract information into meaningful structures. Multidimensional spatio-temporal biological data sets such as time series gene expression with various perturbations over different cell lines, or neural spike trains across many experimental trials, have the potential to acquire insight about the dynamic behavior of the system. For this potential to be realized, we need a suitable representation to understand the data. A general question is how to organize the observed data into meaningful structures and how to find an appropriate similarity measure. A natural way of viewing these complex high dimensional data sets is to examine and analyze the large-scale features and then to focus on the interesting details. Since the wide range of experiments and unknown complexity of the underlying system contribute to the heterogeneity of biological data, we develop a new method by proposing an extension of Robust Principal Component Analysis (RPCA), which models common variations across multiple experiments as the lowrank component and anomalies across these experiments as the sparse component. We show that the proposed method is able to find distinct subtypes and classify data sets in a robust way without any prior knowledge by separating these common responses and abnormal responses. Thus, the proposed method provides us a new representation of these data sets which has the potential to help users acquire new insight from data.
Collapse
Affiliation(s)
- Young Hwan Chang
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA, USA
| | - James Korkola
- Department of Biomedical Engineering and the Center for Spatial Systems Biomedicine, Oregon Health and Science University, Portland, OR, USA
| | - Dhara N. Amin
- Department of Medicine, Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, CA, USA
| | - Mark M. Moasser
- Department of Medicine, Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, CA, USA
| | - Jose M. Carmena
- Department of Electrical Engineering and Computer Sciences, Helen Wills Neuroscience Institute, University of California, Berkeley and UCB/UCSF Graduate Program in Bioengineering, CA, USA
| | - Joe W. Gray
- Department of Biomedical Engineering and the Center for Spatial Systems Biomedicine, Oregon Health and Science University, Portland, OR, USA
| | - Claire J. Tomlin
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA, USA
- Faculty Scientist, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- * E-mail:
| |
Collapse
|
5
|
Ye M, Wang Z, Wang Y, Wu R. A multi-Poisson dynamic mixture model to cluster developmental patterns of gene expression by RNA-seq. Brief Bioinform 2014; 16:205-15. [PMID: 24817567 DOI: 10.1093/bib/bbu013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Dynamic changes of gene expression reflect an intrinsic mechanism of how an organism responds to developmental and environmental signals. With the increasing availability of expression data across a time-space scale by RNA-seq, the classification of genes as per their biological function using RNA-seq data has become one of the most significant challenges in contemporary biology. Here we develop a clustering mixture model to discover distinct groups of genes expressed during a period of organ development. By integrating the density function of multivariate Poisson distribution, the model accommodates the discrete property of read counts characteristic of RNA-seq data. The temporal dependence of gene expression is modeled by the first-order autoregressive process. The model is implemented with the Expectation-Maximization algorithm and model selection to determine the optimal number of gene clusters and obtain the estimates of Poisson parameters that describe the pattern of time-dependent expression of genes from each cluster. The model has been demonstrated by analyzing a real data from an experiment aimed to link the pattern of gene expression to catkin development in white poplar. The usefulness of the model has been validated through computer simulation. The model provides a valuable tool for clustering RNA-seq data, facilitating our global view of expression dynamics and understanding of gene regulation mechanisms.
Collapse
|
6
|
Asif HMS, Sanguinetti G. Simultaneous inference and clustering of transcriptional dynamics in gene regulatory networks. Stat Appl Genet Mol Biol 2013; 12:545-57. [PMID: 24051920 DOI: 10.1515/sagmb-2012-0010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
We present a novel method for simultaneous inference and nonparametric clustering of transcriptional dynamics from gene expression data. The proposed method uses gene expression data to infer time-varying TF profiles and cluster these temporal profiles according to the dynamics they exhibit. We use the latent structure of factorial hidden Markov model to model the transcription factor profiles as Markov chains and cluster these profiles using nonparametric mixture modeling. An efficient Gibbs sampling scheme is proposed for inference of latent variables and grouping of transcriptional dynamics into a priori unknown number of clusters. We test our model on simulated data and analyse its performance on two expression datasets; S. cerevisiae cell cycle data and E. coli oxygen starvation response data. Our results show the applicability of the method for genome wide analysis of expression data.
Collapse
|
7
|
Godsey B. Improved inference of gene regulatory networks through integrated Bayesian clustering and dynamic modeling of time-course expression data. PLoS One 2013; 8:e68358. [PMID: 23935862 PMCID: PMC3720774 DOI: 10.1371/journal.pone.0068358] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 06/03/2013] [Indexed: 12/23/2022] Open
Abstract
Inferring gene regulatory networks from expression data is difficult, but it is common and often useful. Most network problems are under-determined--there are more parameters than data points--and therefore data or parameter set reduction is often necessary. Correlation between variables in the model also contributes to confound network coefficient inference. In this paper, we present an algorithm that uses integrated, probabilistic clustering to ease the problems of under-determination and correlated variables within a fully Bayesian framework. Specifically, ours is a dynamic Bayesian network with integrated Gaussian mixture clustering, which we fit using variational Bayesian methods. We show, using public, simulated time-course data sets from the DREAM4 Challenge, that our algorithm outperforms non-clustering methods in many cases (7 out of 25) with fewer samples, rarely underperforming (1 out of 25), and often selects a non-clustering model if it better describes the data. Source code (GNU Octave) for BAyesian Clustering Over Networks (BACON) and sample data are available at: http://code.google.com/p/bacon-for-genetic-networks.
Collapse
Affiliation(s)
- Brian Godsey
- Department of Statistics and Probability Theory, Vienna University of Technology, Vienna, Austria.
| |
Collapse
|
8
|
Functional clustering of time series gene expression data by Granger causality. BMC SYSTEMS BIOLOGY 2012; 6:137. [PMID: 23107425 PMCID: PMC3573927 DOI: 10.1186/1752-0509-6-137] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/14/2011] [Accepted: 10/17/2012] [Indexed: 12/04/2022]
Abstract
Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them.
Collapse
|
9
|
An L, Doerge RW. Dynamic clustering of gene expression. ISRN BIOINFORMATICS 2012; 2012:537217. [PMID: 25969750 PMCID: PMC4393063 DOI: 10.5402/2012/537217] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2012] [Accepted: 08/05/2012] [Indexed: 11/23/2022]
Abstract
It is well accepted that genes are simultaneously involved in multiple biological processes and that genes are coordinated over the duration of such events. Unfortunately, clustering methodologies that group genes for the purpose of novel gene discovery fail to acknowledge the dynamic nature of biological processes and provide static clusters, even when the expression of genes is assessed across time or developmental stages. By taking advantage of techniques and theories from time frequency analysis, periodic gene expression profiles are dynamically clustered based on the assumption that different spectral frequencies characterize different biological processes. A two-step cluster validation approach is proposed to statistically estimate both the optimal number of clusters and to distinguish significant clusters from noise. The resulting clusters reveal coordinated coexpressed genes. This novel dynamic clustering approach has broad applicability to a vast range of sequential data scenarios where the order of the series is of interest.
Collapse
Affiliation(s)
- Lingling An
- Department of Agricultural and Biosystems Engineering, University of Arizona, Tucson, AZ 85721, USA
| | - R. W. Doerge
- Department of Statistics, Purdue University, West Lafayette, IN 47907, USA
| |
Collapse
|
10
|
Kojima K, Imoto S, Yamaguchi R, Fujita A, Yamauchi M, Gotoh N, Miyano S. Identifying regulational alterations in gene regulatory networks by state space representation of vector autoregressive models and variational annealing. BMC Genomics 2012; 13 Suppl 1:S6. [PMID: 22369122 PMCID: PMC3587380 DOI: 10.1186/1471-2164-13-s1-s6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Background In the analysis of effects by cell treatment such as drug dosing, identifying changes on gene network structures between normal and treated cells is a key task. A possible way for identifying the changes is to compare structures of networks estimated from data on normal and treated cells separately. However, this approach usually fails to estimate accurate gene networks due to the limited length of time series data and measurement noise. Thus, approaches that identify changes on regulations by using time series data on both conditions in an efficient manner are demanded. Methods We propose a new statistical approach that is based on the state space representation of the vector autoregressive model and estimates gene networks on two different conditions in order to identify changes on regulations between the conditions. In the mathematical model of our approach, hidden binary variables are newly introduced to indicate the presence of regulations on each condition. The use of the hidden binary variables enables an efficient data usage; data on both conditions are used for commonly existing regulations, while for condition specific regulations corresponding data are only applied. Also, the similarity of networks on two conditions is automatically considered from the design of the potential function for the hidden binary variables. For the estimation of the hidden binary variables, we derive a new variational annealing method that searches the configuration of the binary variables maximizing the marginal likelihood. Results For the performance evaluation, we use time series data from two topologically similar synthetic networks, and confirm that our proposed approach estimates commonly existing regulations as well as changes on regulations with higher coverage and precision than other existing approaches in almost all the experimental settings. For a real data application, our proposed approach is applied to time series data from normal Human lung cells and Human lung cells treated by stimulating EGF-receptors and dosing an anticancer drug termed Gefitinib. In the treated lung cells, a cancer cell condition is simulated by the stimulation of EGF-receptors, but the effect would be counteracted due to the selective inhibition of EGF-receptors by Gefitinib. However, gene expression profiles are actually different between the conditions, and the genes related to the identified changes are considered as possible off-targets of Gefitinib. Conclusions From the synthetically generated time series data, our proposed approach can identify changes on regulations more accurately than existing methods. By applying the proposed approach to the time series data on normal and treated Human lung cells, candidates of off-target genes of Gefitinib are found. According to the published clinical information, one of the genes can be related to a factor of interstitial pneumonia, which is known as a side effect of Gefitinib.
Collapse
Affiliation(s)
- Kaname Kojima
- Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan
| | | | | | | | | | | | | |
Collapse
|
11
|
Dimitrakopoulou K, Tsimpouris C, Papadopoulos G, Pommerenke C, Wilk E, Sgarbas KN, Schughart K, Bezerianos A. Dynamic gene network reconstruction from gene expression data in mice after influenza A (H1N1) infection. J Clin Bioinforma 2011; 1:27. [PMID: 22017961 PMCID: PMC3219564 DOI: 10.1186/2043-9113-1-27] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2011] [Accepted: 10/21/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The immune response to viral infection is a temporal process, represented by a dynamic and complex network of gene and protein interactions. Here, we present a reverse engineering strategy aimed at capturing the temporal evolution of the underlying Gene Regulatory Networks (GRN). The proposed approach will be an enabling step towards comprehending the dynamic behavior of gene regulation circuitry and mapping the network structure transitions in response to pathogen stimuli. RESULTS We applied the Time Varying Dynamic Bayesian Network (TV-DBN) method for reconstructing the gene regulatory interactions based on time series gene expression data for the mouse C57BL/6J inbred strain after infection with influenza A H1N1 (PR8) virus. Initially, 3500 differentially expressed genes were clustered with the use of k-means algorithm. Next, the successive in time GRNs were built over the expression profiles of cluster centroids. Finally, the identified GRNs were examined with several topological metrics and available protein-protein and protein-DNA interaction data, transcription factor and KEGG pathway data. CONCLUSIONS Our results elucidate the potential of TV-DBN approach in providing valuable insights into the temporal rewiring of the lung transcriptome in response to H1N1 virus.
Collapse
|
12
|
Ouyang Z, Song M, Güth R, Ha TJ, Larouche M, Goldowitz D. Conserved and differential gene interactions in dynamical biological systems. ACTA ACUST UNITED AC 2011; 27:2851-8. [PMID: 21840874 DOI: 10.1093/bioinformatics/btr472] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION While biological systems operated from a common genome can be conserved in various ways, they can also manifest highly diverse dynamics and functions. This is because the same set of genes can interact differentially across specific molecular contexts. For example, differential gene interactions give rise to various stages of morphogenesis during cerebellar development. However, after over a decade of efforts toward reverse engineering biological networks from high-throughput omic data, gene networks of most organisms remain sketchy. This hindrance has motivated us to develop comparative modeling to highlight conserved and differential gene interactions across experimental conditions, without reconstructing complete gene networks first. RESULTS We established a comparative dynamical system modeling (CDSM) approach to identify conserved and differential interactions across molecular contexts. In CDSM, interactions are represented by ordinary differential equations and compared across conditions through statistical heterogeneity and homogeneity tests. CDSM demonstrated a consistent superiority over differential correlation and reconstruct-then-compare in simulation studies. We exploited CDSM to elucidate gene interactions important for cellular processes poorly understood during mouse cerebellar development. We generated hypotheses on 66 differential genetic interactions involved in expansion of the external granule layer. These interactions are implicated in cell cycle, differentiation, apoptosis and morphogenesis. Additional 1639 differential interactions among gene clusters were also identified when we compared gene interactions during the presence of Rhombic lip versus the presence of distinct internal granule layer. Moreover, compared with differential correlation and reconstruct-then-compare, CDSM makes fewer assumptions on data and thus is applicable to a wider range of biological assays. AVAILABILITY Source code in C++ and R is available for non-commercial organizations upon request from the corresponding author. The cerebellum gene expression dataset used in this article is available upon request from the Goldowitz lab (dang@cmmt.ubc.ca, http://grits.dglab.org/). CONTACT joemsong@cs.nmsu.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Zhengyu Ouyang
- Department of Computer Science, New Mexico State University, Las Cruces, NM 88003, USA
| | | | | | | | | | | |
Collapse
|