1
|
Mobeen A, Joshi S, Fatima F, Bhargav A, Arif Y, Faruq M, Ramachandran S. NF-κB signaling is the major inflammatory pathway for inducing insulin resistance. 3 Biotech 2025; 15:47. [PMID: 39845928 PMCID: PMC11747027 DOI: 10.1007/s13205-024-04202-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Accepted: 12/23/2024] [Indexed: 01/24/2025] Open
Abstract
Insulin resistance is major factor in the development of metabolic syndrome and type 2 diabetes (T2D). We extracted 430 genes from literature associated with both insulin resistance and inflammation. The highly significant pathways were Toll-like receptor signaling, PI3K-Akt signaling, cytokine-cytokine receptor interaction, pathways in cancer, TNF signaling, and NF-kappa B signaling. Among the 297 common genes in all datasets of various T2D patients' tissues including blood, muscle, liver, pancreas, and adipose tissues, 71% and 60% of these genes were differentially expressed in pancreas (GSE25724) and liver (GSE15653), respectively. A total of 169 genes contain highly conserved motifs for various transcription factors involved in immune response, thereby suggesting coordinated expression. Through co-expression analysis, we obtained three modules. The respective modules had 78, 158, and 55 genes, and TRAF2, HMGA1, and RGS5 as hub genes. Further, we used the BioNSi pathways simulation tool and identified the following five KEGG pathways perturbed in four or more tissues, namely Toll-like receptor signaling pathway, RIG-1-like receptor signaling pathway, pathways in cancer, NF-kappa B signaling pathway, and insulin resistance pathway. The genes NFKBIA and IKBKB are common to all these five pathways. In addition, using the NF-κB computational activation model, we identified that the reversal of NF-κB constitutive activation through overexpression of NFKB1 (P50 homodimer), PPARG, PIAS3 could reduce insulin resistance by almost half of its original value. To conclude, co-expression studies, gene expression network simulation, and NF-κB computational modeling substantiate the causal role of NF-κB pathway in insulin resistance. These results taken together with other published evidence suggests that the TNF-TRAF2-IKBKB-NF-κB axis could be explored as a potential target in combination with available metabolic targets in the management of insulin resistance. Supplementary Information The online version contains supplementary material available at 10.1007/s13205-024-04202-4.
Collapse
Affiliation(s)
- Ahmed Mobeen
- CSIR Institute of Genomics & Integrative Biology, Sukhdev Vihar, New Delhi, 110025 India
| | - Sweta Joshi
- Department of Food Technology, SIST, Jamia Hamdard, New Delhi, 110062 India
| | - Firdaus Fatima
- CSIR Institute of Genomics & Integrative Biology, Sukhdev Vihar, New Delhi, 110025 India
- Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh 201002 India
| | - Anasuya Bhargav
- CSIR Institute of Genomics & Integrative Biology, Sukhdev Vihar, New Delhi, 110025 India
| | - Yusra Arif
- Centre of Bioinformatics, Institute of Inter Disciplinary Studies, Allahabad University, Allahabad, Uttar Pradesh 211002 India
| | - Mohammed Faruq
- CSIR Institute of Genomics & Integrative Biology, Sukhdev Vihar, New Delhi, 110025 India
- Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh 201002 India
| | - Srinivasan Ramachandran
- CSIR Institute of Genomics & Integrative Biology, Sukhdev Vihar, New Delhi, 110025 India
- Manav Rachna International Institute of Research and Studies, Sector 43, Delhi–Surajkund Road, Faridabad, Haryana 121004 India
| |
Collapse
|
2
|
Alcantar MA, English MA, Valeri JA, Collins JJ. A high-throughput synthetic biology approach for studying combinatorial chromatin-based transcriptional regulation. Mol Cell 2024; 84:2382-2396.e9. [PMID: 38906116 DOI: 10.1016/j.molcel.2024.05.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 04/11/2024] [Accepted: 05/24/2024] [Indexed: 06/23/2024]
Abstract
The construction of synthetic gene circuits requires the rational combination of multiple regulatory components, but predicting their behavior can be challenging due to poorly understood component interactions and unexpected emergent behaviors. In eukaryotes, chromatin regulators (CRs) are essential regulatory components that orchestrate gene expression. Here, we develop a screening platform to investigate the impact of CR pairs on transcriptional activity in yeast. We construct a combinatorial library consisting of over 1,900 CR pairs and use a high-throughput workflow to characterize the impact of CR co-recruitment on gene expression. We recapitulate known interactions and discover several instances of CR pairs with emergent behaviors. We also demonstrate that supervised machine learning models trained with low-dimensional amino acid embeddings accurately predict the impact of CR co-recruitment on transcriptional activity. This work introduces a scalable platform and machine learning approach that can be used to study how networks of regulatory components impact gene expression.
Collapse
Affiliation(s)
- Miguel A Alcantar
- Department of Biological Engineering, Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, MIT, Cambridge, MA 02139, USA
| | - Max A English
- Department of Biological Engineering, Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, MIT, Cambridge, MA 02139, USA
| | - Jacqueline A Valeri
- Department of Biological Engineering, Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, MIT, Cambridge, MA 02139, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - James J Collins
- Department of Biological Engineering, Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, MIT, Cambridge, MA 02139, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
| |
Collapse
|
3
|
Kang JE, Jun JH, Kwon JH, Lee JH, Hwang K, Kim S, Jeong N. Arabidopsis Transcription Regulatory Factor Domain/Domain Interaction Analysis Tool-Liquid/Liquid Phase Separation, Oligomerization, GO Analysis: A Toolkit for Interaction Data-Based Domain Analysis. Genes (Basel) 2023; 14:1476. [PMID: 37510380 PMCID: PMC10379056 DOI: 10.3390/genes14071476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 07/04/2023] [Accepted: 07/14/2023] [Indexed: 07/30/2023] Open
Abstract
Although a large number of databases are available for regulatory elements, a bottleneck has been created by the lack of bioinformatics tools to predict the interaction modes of regulatory elements. To reduce this gap, we developed the Arabidopsis Transcription Regulatory Factor Domain/Domain Interaction Analysis Tool-liquid/liquid phase separation (LLPS), oligomerization, GO analysis (ART FOUNDATION-LOG), a useful toolkit for protein-nucleic acid interaction (PNI) and protein-protein interaction (PPI) analysis based on domain-domain interactions (DDIs). LLPS, protein oligomerization, the structural properties of protein domains, and protein modifications are major components in the orchestration of the spatiotemporal dynamics of PPIs and PNIs. Our goal is to integrate PPI/PNI information into the development of a prediction model for identifying important genetic variants in peaches. Our program unified interdatabase relational keys based on protein domains to facilitate inference from the model species. A key advantage of this program lies in the integrated information of related features, such as protein oligomerization, LOG analysis, structural characterizations of domains (e.g., domain linkers, intrinsically disordered regions, DDIs, domain-motif (peptide) interactions, beta sheets, and transmembrane helices), and post-translational modification. We provided simple tests to demonstrate how to use this program, which can be applied to other eukaryotic organisms.
Collapse
Affiliation(s)
- Jee Eun Kang
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Ji Hae Jun
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Jung Hyun Kwon
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Ju-Hyun Lee
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Kidong Hwang
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Sungjong Kim
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Namhee Jeong
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| |
Collapse
|
4
|
Edrisi Maryan K, Farrokhi N, Samizadeh Lahiji H. Cold-responsive transcription factors in Arabidopsis and rice: A regulatory network analysis using array data and gene co-expression network. PLoS One 2023; 18:e0286324. [PMID: 37289769 PMCID: PMC10249815 DOI: 10.1371/journal.pone.0286324] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 05/15/2023] [Indexed: 06/10/2023] Open
Abstract
Plant growth and development can be influenced by cold stress. Responses of plants to cold are regulated in part by transcription factors (TFs) and microRNAs, which their determination would be necessary in comprehension of the corresponding molecular cues. Here, transcriptomes of Arabidopsis and rice were analyzed to computationally determine TFs and microRNAs that are differentially responsive to cold treatment, and their co-expression networks were established. Among 181 Arabidopsis and 168 rice differentially expressed TF genes, 37 (26 novel) were up- and 16 (8 novel) were downregulated. Common TF encoding genes were from ERF, MYB, bHLH, NFY, bZIP, GATA, HSF and WRKY families. NFY A4/C2/A10 were the significant hub TFs in both plants. Phytohormone responsive cis-elements such as ABRE, TGA, TCA and LTR were the common cis-elements in TF promoters. Arabidopsis had more responsive TFs compared to rice possibly due to its greater adaptation to ranges geographical latitudes. Rice had more relevant miRNAs probably because of its bigger genome size. The interacting partners and co-expressed genes were different for the common TFs so that of the downstream regulatory networks and the corresponding metabolic pathways. Identified cold-responsive TFs in (A + R) seemed to be more engaged in energy metabolism esp. photosynthesis, and signal transduction, respectively. At post-transcriptional level, miR5075 showed to target many identified TFs in rice. In comparison, the predictions showed that identified TFs are being targeted by diverse groups of miRNAs in Arabidopsis. Novel TFs, miRNAs and co-expressed genes were introduced as cold-responsive markers that can be harnessed in future studies and development of crop tolerant varieties.
Collapse
Affiliation(s)
- Khazar Edrisi Maryan
- Department of Cell & Molecular Biology, Faculty of Life Sciences & Biotechnology, Shahid Beheshti University, Tehran, Iran
- Department of Plant Biotechnology, Faculty of Agriculture, University of Guilan, Rasht, Iran
| | - Naser Farrokhi
- Department of Cell & Molecular Biology, Faculty of Life Sciences & Biotechnology, Shahid Beheshti University, Tehran, Iran
| | | |
Collapse
|
5
|
A systematic study of HIF1A cofactors in hypoxic cancer cells. Sci Rep 2022; 12:18962. [PMID: 36347941 PMCID: PMC9643333 DOI: 10.1038/s41598-022-23060-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 10/25/2022] [Indexed: 11/09/2022] Open
Abstract
Hypoxia inducible factor 1 alpha (HIF1A) is a transcription factor (TF) that forms highly structural and functional protein-protein interactions with other TFs to promote gene expression in hypoxic cancer cells. However, despite the importance of these TF-TF interactions, we still lack a comprehensive view of many of the TF cofactors involved and how they cooperate. In this study, we systematically studied HIF1A cofactors in eight cancer cell lines using the computational motif mining tool, SIOMICS, and discovered 201 potential HIF1A cofactors, which included 21 of the 29 known HIF1A cofactors in public databases. These 201 cofactors were statistically and biologically significant, with 19 of the top 37 cofactors in our study directly validated in the literature. The remaining 18 were novel cofactors. These discovered cofactors can be essential to HIF1A's regulatory functions and may lead to the discovery of new therapeutic targets in cancer treatment.
Collapse
|
6
|
Jia M, Yuan DY, Lovelace TC, Hu M, Benos PV. Causal Discovery in High-dimensional, Multicollinear Datasets. FRONTIERS IN EPIDEMIOLOGY 2022; 2:899655. [PMID: 36778756 PMCID: PMC9910507 DOI: 10.3389/fepid.2022.899655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 08/05/2022] [Indexed: 11/13/2022]
Abstract
As the cost of high-throughput genomic sequencing technology declines, its application in clinical research becomes increasingly popular. The collected datasets often contain tens or hundreds of thousands of biological features that need to be mined to extract meaningful information. One area of particular interest is discovering underlying causal mechanisms of disease outcomes. Over the past few decades, causal discovery algorithms have been developed and expanded to infer such relationships. However, these algorithms suffer from the curse of dimensionality and multicollinearity. A recently introduced, non-orthogonal, general empirical Bayes approach to matrix factorization has been demonstrated to successfully infer latent factors with interpretable structures from observed variables. We hypothesize that applying this strategy to causal discovery algorithms can solve both the high dimensionality and collinearity problems, inherent to most biomedical datasets. We evaluate this strategy on simulated data and apply it to two real-world datasets. In a breast cancer dataset, we identified important survival-associated latent factors and biologically meaningful enriched pathways within factors related to important clinical features. In a SARS-CoV-2 dataset, we were able to predict whether a patient (1) had Covid-19 and (2) would enter the ICU. Furthermore, we were able to associate factors with known Covid-19 related biological pathways.
Collapse
Affiliation(s)
- Minxue Jia
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
- Joint Carnegie Mellon - University of Pittsburgh Computational Biology PhD Program, Pittsburgh, PA, United States
| | - Daniel Y. Yuan
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
- Joint Carnegie Mellon - University of Pittsburgh Computational Biology PhD Program, Pittsburgh, PA, United States
| | - Tyler C. Lovelace
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
- Joint Carnegie Mellon - University of Pittsburgh Computational Biology PhD Program, Pittsburgh, PA, United States
| | - Mengying Hu
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
- Joint Carnegie Mellon - University of Pittsburgh Computational Biology PhD Program, Pittsburgh, PA, United States
| | - Panayiotis V. Benos
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
- Joint Carnegie Mellon - University of Pittsburgh Computational Biology PhD Program, Pittsburgh, PA, United States
- Department of Epidemiology, University of Florida, Gainesville, FL, United States
| |
Collapse
|
7
|
Kim C, Wang X, Kültz D. Prediction and Experimental Validation of a New Salinity-Responsive Cis-Regulatory Element (CRE) in a Tilapia Cell Line. Life (Basel) 2022; 12:787. [PMID: 35743818 PMCID: PMC9225295 DOI: 10.3390/life12060787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 05/12/2022] [Accepted: 05/16/2022] [Indexed: 11/16/2022] Open
Abstract
Transcriptional regulation is a major mechanism by which organisms integrate gene x environment interactions. It can be achieved by coordinated interplay between cis-regulatory elements (CREs) and transcription factors (TFs). Euryhaline tilapia (Oreochromis mossambicus) tolerate a wide range of salinity and thus are an appropriate model to examine transcriptional regulatory mechanisms during salinity stress in fish. Quantitative proteomics in combination with the transcription inhibitor actinomycin D revealed 19 proteins that are transcriptionally upregulated by hyperosmolality in tilapia brain (OmB) cells. We searched the extended proximal promoter up to intron1 of each corresponding gene for common motifs using motif discovery tools. The top-ranked motif identified (STREME1) represents a binding site for the Forkhead box TF L1 (FoxL1). STREME1 function during hyperosmolality was experimentally validated by choosing two of the 19 genes, chloride intracellular channel 2 (clic2) and uridine phosphorylase 1 (upp1), that are enriched in STREME1 in their extended promoters. Transcriptional induction of these genes during hyperosmolality requires STREME1, as evidenced by motif mutagenesis. We conclude that STREME1 represents a new functional CRE that contributes to gene x environment interactions during salinity stress in tilapia. Moreover, our results indicate that FoxL1 family TFs are contribute to hyperosmotic induction of genes in euryhaline fish.
Collapse
Affiliation(s)
- Chanhee Kim
- Stress-Induced Evolution Laboratory, Department of Animal Sciences, University of California, Davis, CA 95616, USA;
| | - Xiaodan Wang
- Laboratory of Aquaculture Nutrition and Environmental Health, School of Life Sciences, East China Normal University, Shanghai 200241, China;
| | - Dietmar Kültz
- Stress-Induced Evolution Laboratory, Department of Animal Sciences, University of California, Davis, CA 95616, USA;
| |
Collapse
|
8
|
Cyclin/Forkhead-mediated coordination of cyclin waves: an autonomous oscillator rationalizing the quantitative model of Cdk control for budding yeast. NPJ Syst Biol Appl 2021; 7:48. [PMID: 34903735 PMCID: PMC8668886 DOI: 10.1038/s41540-021-00201-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 11/01/2021] [Indexed: 01/21/2023] Open
Abstract
Networks of interacting molecules organize topology, amount, and timing of biological functions. Systems biology concepts required to pin down 'network motifs' or 'design principles' for time-dependent processes have been developed for the cell division cycle, through integration of predictive computer modeling with quantitative experimentation. A dynamic coordination of sequential waves of cyclin-dependent kinases (cyclin/Cdk) with the transcription factors network offers insights to investigate how incompatible processes are kept separate in time during the eukaryotic cell cycle. Here this coordination is discussed for the Forkhead transcription factors in light of missing gaps in the current knowledge of cell cycle control in budding yeast. An emergent design principle is proposed where cyclin waves are synchronized by a cyclin/Cdk-mediated feed-forward regulation through the Forkhead as a transcriptional timer. This design is rationalized by the bidirectional interaction between mitotic cyclins and the Forkhead transcriptional timer, resulting in an autonomous oscillator that may be instrumental for a well-timed progression throughout the cell cycle. The regulation centered around the cyclin/Cdk-Forkhead axis can be pivotal to timely coordinate cell cycle dynamics, thereby to actuate the quantitative model of Cdk control.
Collapse
|
9
|
Wang H, Huang B, Wang J. Predict long-range enhancer regulation based on protein-protein interactions between transcription factors. Nucleic Acids Res 2021; 49:10347-10368. [PMID: 34570239 PMCID: PMC8501976 DOI: 10.1093/nar/gkab841] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 08/10/2021] [Accepted: 09/10/2021] [Indexed: 12/18/2022] Open
Abstract
Long-range regulation by distal enhancers plays critical roles in cell-type specific transcriptional programs. Computational predictions of genome-wide enhancer-promoter interactions are still challenging due to limited accuracy and the lack of knowledge on the molecular mechanisms. Based on recent biological investigations, the protein-protein interactions (PPIs) between transcription factors (TFs) have been found to participate in the regulation of chromatin loops. Therefore, we developed a novel predictive model for cell-type specific enhancer-promoter interactions by leveraging the information of TF PPI signatures. Evaluated by a series of rigorous performance comparisons, the new model achieves superior performance over other methods. The model also identifies specific TF PPIs that may mediate long-range regulatory interactions, revealing new mechanistic understandings of enhancer regulation. The prioritized TF PPIs are associated with genes in distinct biological pathways, and the predicted enhancer-promoter interactions are strongly enriched with cis-eQTLs. Most interestingly, the model discovers enhancer-mediated trans-regulatory links between TFs and genes, which are significantly enriched with trans-eQTLs. The new predictive model, along with the genome-wide analyses, provides a platform to systematically delineate the complex interplay among TFs, enhancers and genes in long-range regulation. The novel predictions also lead to mechanistic interpretations of eQTLs to decode the genetic associations with gene expression.
Collapse
Affiliation(s)
- Hao Wang
- Department of Computational Mathematics, Science and Engineering, Michigan State University, 428 S. Shaw Ln., East Lansing, MI 48824, USA
| | - Binbin Huang
- Department of Computational Mathematics, Science and Engineering, Michigan State University, 428 S. Shaw Ln., East Lansing, MI 48824, USA
| | - Jianrong Wang
- Department of Computational Mathematics, Science and Engineering, Michigan State University, 428 S. Shaw Ln., East Lansing, MI 48824, USA
| |
Collapse
|
10
|
Rowland MA, Pilkiewicz KR, Mayo ML. Devil in the details: Mechanistic variations impact information transfer across models of transcriptional cascades. PLoS One 2021; 16:e0245094. [PMID: 33439904 PMCID: PMC7806174 DOI: 10.1371/journal.pone.0245094] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 12/22/2020] [Indexed: 11/19/2022] Open
Abstract
The transcriptional network determines a cell’s internal state by regulating protein expression in response to changes in the local environment. Due to the interconnected nature of this network, information encoded in the abundance of various proteins will often propagate across chains of noisy intermediate signaling events. The data-processing inequality (DPI) leads us to expect that this intracellular game of “telephone” should degrade this type of signal, with longer chains losing successively more information to noise. However, a previous modeling effort predicted that because the steps of these signaling cascades do not truly represent independent stages of data processing, the limits of the DPI could seemingly be surpassed, and the amount of transmitted information could actually increase with chain length. What that work did not examine was whether this regime of growing information transmission was attainable by a signaling system constrained by the mechanistic details of more complex protein-binding kinetics. Here we address this knowledge gap through the lens of information theory by examining a model that explicitly accounts for the binding of each transcription factor to DNA. We analyze this model by comparing stochastic simulations of the fully nonlinear kinetics to simulations constrained by the linear response approximations that displayed a regime of growing information. Our simulations show that even when molecular binding is considered, there remains a regime wherein the transmitted information can grow with cascade length, but ends after a critical number of links determined by the kinetic parameter values. This inflection point marks where correlations decay in response to an oversaturation of binding sites, screening informative transcription factor fluctuations from further propagation down the chain where they eventually become indistinguishable from the surrounding levels of noise.
Collapse
Affiliation(s)
- Michael A. Rowland
- Environmental Laboratory, U.S. Army Engineer Research and Development Center, Vicksburg, MS, United States of America
- * E-mail:
| | - Kevin R. Pilkiewicz
- Environmental Laboratory, U.S. Army Engineer Research and Development Center, Vicksburg, MS, United States of America
| | - Michael L. Mayo
- Environmental Laboratory, U.S. Army Engineer Research and Development Center, Vicksburg, MS, United States of America
| |
Collapse
|
11
|
Wang H, Liu Y, Guan H, Fan GL. The Regulation of Target Genes by Co-occupancy of Transcription Factors, c-Myc and Mxi1 with Max in the Mouse Cell Line. Curr Bioinform 2020. [DOI: 10.2174/1574893614666191106103633] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Background:
The regulatory function of transcription factors on genes is not only related
to the location of binding genes and its related functions, but is also related to the methods of
binding.
Objective:
It is necessary to study the regulation effects in different binding methods on target genes.
Methods:
In this study, we provided a reliable theoretical basis for studying gene expression
regulation of co-binding transcription factors and further revealed the specific regulation of
transcription factor co-binding in cancer cells.
Results:
Transcription factors tend to combine with other transcription factors in the regulatory
region to form a competitive or synergistic relationship to regulate target genes accurately.
Conclusion:
We found that up-regulated genes in cancer cells were involved in the regulation of
their own immune system related to the normal cells.
Collapse
Affiliation(s)
- Hui Wang
- Department of Physics, School of Physical Science and Technology, Inner Mongolia University, Hohhot, China
| | - Yuan Liu
- Department of Physics, School of Physical Science and Technology, Inner Mongolia University, Hohhot, China
| | - Hua Guan
- ENT Department, Huhhot First Hospital, Hohhot, China
| | - Guo-Liang Fan
- Department of Physics, School of Physical Science and Technology, Inner Mongolia University, Hohhot, China
| |
Collapse
|
12
|
An osmolality/salinity-responsive enhancer 1 (OSRE1) in intron 1 promotes salinity induction of tilapia glutamine synthetase. Sci Rep 2020; 10:12103. [PMID: 32694739 PMCID: PMC7374092 DOI: 10.1038/s41598-020-69090-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Accepted: 06/11/2020] [Indexed: 01/02/2023] Open
Abstract
Euryhaline tilapia (Oreochromis mossambicus) are fish that tolerate a wide salinity range from fresh water to > 3× seawater. Even though the physiological effector mechanisms of osmoregulation that maintain plasma homeostasis in fresh water and seawater fish are well known, the corresponding molecular mechanisms that control switching between hyper- (fresh water) and hypo-osmoregulation (seawater) remain mostly elusive. In this study we show that hyperosmotic induction of glutamine synthetase represents a prominent part of this switch. Proteomics analysis of the O. mossambicus OmB cell line revealed that glutamine synthetase is transcriptionally regulated by hyperosmolality. Therefore, the 5' regulatory sequence of O. mossambicus glutamine synthetase was investigated. Using an enhancer trapping assay, we discovered a novel osmosensitive mechanism by which intron 1 positively mediates glutamine synthetase transcription. Intron 1 includes a single, functional copy of an osmoresponsive element, osmolality/salinity-responsive enhancer 1 (OSRE1). Unlike for conventional enhancers, the hyperosmotic induction of glutamine synthetase by intron 1 is position dependent. But irrespective of intron 1 position, OSRE1 deletion from intron 1 abolishes hyperosmotic enhancer activity. These findings indicate that proper intron 1 positioning and the presence of an OSRE1 in intron 1 are required for precise enhancement of hyperosmotic glutamine synthetase expression.
Collapse
|
13
|
Rana R, Joon S, Kumar Jain A, Kumar Mohanty N. A study on the effect of phthalate esters and their metabolites on idiopathic infertile males. Andrologia 2020; 52:e13720. [PMID: 32567059 DOI: 10.1111/and.13720] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Revised: 05/24/2020] [Accepted: 05/28/2020] [Indexed: 11/28/2022] Open
Abstract
Phthalate plasticisers in medical, cosmetic and consumer products might pose serious health implications in humans including infertility. We sought to investigate the correlation, if any, between the phthalates and their metabolites and sperm quality parameters, and male infertility. Phthalate esters (15) and their metabolites (5) were estimated in the blood serum and urine samples from the age-matched 152 infertile and 75 fertile males using gas chromatography (GC) and high-performance liquid chromatography (HPLC). Finally, the data were analysed to correlate phthalate exposure and semen quality parameters in the infertility group. The estimated levels of DEHP, DBP, DIBP, BEHIP, BPBG, DPP, DIOP, DIHP, DMP, DINP, BIOP, DMOP and DICHP were significantly higher in the infertile males compared to the fertile males (p < .05 or p < .01). However, these were not found to be associated with the semen quality parameters (sperm count, motility and sperm morphology). Similarly, HPLC data revealed that the associations between semen parameters (sperm count, sperm motility and sperm morphology) and phthalate metabolite (MEHP and MBP) concentrations in urine samples from the infertile males were mostly unremarkable or statistically nonsignificant. Conclusively, environmental exposure to phthalates and their impacts on male infertility were statistically insignificant in our study groups.
Collapse
Affiliation(s)
- Rashmi Rana
- Department of Research, Sir Ganga Ram Hospital, New Delhi, India
| | - Shikha Joon
- Department of Research, Sir Ganga Ram Hospital, New Delhi, India
| | | | | |
Collapse
|
14
|
Zhang S, Liang Y, Wang X, Su Z, Chen Y. FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets. DNA Res 2019; 26:231-242. [PMID: 30957858 PMCID: PMC6589551 DOI: 10.1093/dnares/dsz004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Accepted: 03/05/2019] [Indexed: 11/14/2022] Open
Abstract
Detecting binding motifs of combinatorial transcription factors (TFs) from chromatin immunoprecipitation sequencing (ChIP-seq) experiments is an important and challenging computational problem for understanding gene regulations. Although a number of motif-finding algorithms have been presented, most are either time consuming or have sub-optimal accuracy for processing large-scale datasets. In this article, we present a fully parallelized algorithm for detecting combinatorial motifs from ChIP-seq datasets by using Fisher combined method and OpenMP parallel design. Large scale validations on both synthetic data and 350 ChIP-seq datasets from the ENCODE database showed that FisherMP has not only super speeds on large datasets, but also has high accuracy when compared with multiple popular methods. By using FisherMP, we successfully detected combinatorial motifs of CTCF, YY1, MAZ, STAT3 and USF2 in chromosome X, suggesting that they are functional co-players in gene regulation and chromosomal organization. Integrative and statistical analysis of these TF-binding peaks clearly demonstrate that they are not only highly coordinated with each other, but that they are also correlated with histone modifications. FisherMP can be applied for integrative analysis of binding motifs and for predicting cis-regulatory modules from a large number of ChIP-seq datasets.
Collapse
Affiliation(s)
- Shaoqiang Zhang
- College of Computer and Information Engineering, Tianjin Normal University, Tianjin, China
| | - Ying Liang
- College of Computer and Information Engineering, Tianjin Normal University, Tianjin, China
| | - Xiangyun Wang
- College of Computer and Information Engineering, Tianjin Normal University, Tianjin, China
| | - Zhengchang Su
- College of Computer and Information Engineering, Tianjin Normal University, Tianjin, China
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, NC, USA
| | - Yong Chen
- Department of Biological Sciences, Center for Systems Biology, the University of Texas at Dallas, Richardson, TX, USA
| |
Collapse
|
15
|
Lukoseviciute M, Gavriouchkina D, Williams RM, Hochgreb-Hagele T, Senanayake U, Chong-Morrison V, Thongjuea S, Repapi E, Mead A, Sauka-Spengler T. From Pioneer to Repressor: Bimodal foxd3 Activity Dynamically Remodels Neural Crest Regulatory Landscape In Vivo. Dev Cell 2019; 47:608-628.e6. [PMID: 30513303 PMCID: PMC6286384 DOI: 10.1016/j.devcel.2018.11.009] [Citation(s) in RCA: 68] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Revised: 08/15/2018] [Accepted: 10/31/2018] [Indexed: 02/06/2023]
Abstract
The neural crest (NC) is a transient embryonic stem cell-like population characterized by its multipotency and broad developmental potential. Here, we perform NC-specific transcriptional and epigenomic profiling of foxd3-mutant cells in vivo to define the gene regulatory circuits controlling NC specification. Together with global binding analysis obtained by foxd3 biotin-ChIP and single cell profiles of foxd3-expressing premigratory NC, our analysis shows that, during early steps of NC formation, foxd3 acts globally as a pioneer factor to prime the onset of genes regulating NC specification and migration by re-arranging the chromatin landscape, opening cis-regulatory elements and reshuffling nucleosomes. Strikingly, foxd3 then gradually switches from an activator to its well-described role as a transcriptional repressor and potentially uses differential partners for each role. Taken together, these results demonstrate that foxd3 acts bimodally in the neural crest as a switch from “permissive” to “repressive” nucleosome and chromatin organization to maintain multipotency and define cell fates. FoxD3 primes neural crest specification by modulating distal enhancers FoxD3 represses a number of neural crest migration and differentiation genes In neural crest, FoxD3 acts to switch chromatin from “permissive” to “repressive” Distinctive gene regulatory mechanisms underlie the bimodal action of FoxD3
Collapse
Affiliation(s)
- Martyna Lukoseviciute
- Radcliffe Department of Medicine, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK
| | - Daria Gavriouchkina
- Radcliffe Department of Medicine, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK
| | - Ruth M Williams
- Radcliffe Department of Medicine, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK
| | - Tatiana Hochgreb-Hagele
- Radcliffe Department of Medicine, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK
| | - Upeka Senanayake
- Radcliffe Department of Medicine, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK
| | - Vanessa Chong-Morrison
- Radcliffe Department of Medicine, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK
| | - Supat Thongjuea
- Molecular Haematology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK
| | - Emmanouela Repapi
- MRC WIMM Centre for Computational Biology Research Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK
| | - Adam Mead
- Molecular Haematology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK
| | - Tatjana Sauka-Spengler
- Radcliffe Department of Medicine, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK.
| |
Collapse
|
16
|
Grewal RK, Saraf S, Deb A, Kundu S. Differentially Expressed MicroRNAs Link Cellular Physiology to Phenotypic Changes in Rice Under Stress Conditions. PLANT & CELL PHYSIOLOGY 2018; 59:2143-2154. [PMID: 30010993 DOI: 10.1093/pcp/pcy136] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Indexed: 06/08/2023]
Abstract
Plant microRNAs (miRNAs) and their target genes have important functional roles in nutrition deficiency and stress response. However, the underlying mechanisms relating relative expression of miRNAs and target mRNAs to morphological adjustments are not well defined. By combining miRNA expression profiles, corresponding target genes and transcription factors that bind to computationally identified over-represented cis-regulatory elements (CREs) common in miRNAs and target gene promoters, we implement a strategy that identifies a set of differentially expressed regulatory interactions which, in turn, relate underlying cellular mechanisms to some of the phenotypic changes observed. Integration of experimentally reported individual interactions with identified regulatory interactions explains how (i) during mineral deficiency osa-miR167 inhibits shoot growth but activates adventitious root growth by influencing free auxin content; (ii) during sulfur deficiency osa-miR394 is involved in adventitious root growth inhibition, sulfur and iron homeostasis, and auxin-mediated regulation of sulfur homeostasis; (iii) osa-miR399 contributes to cross-talk between cytokinin and phosphorus deficiency signaling; and (iv) a feed-forward loop involving the osa-miR166, trihelix and HD-ZIP III transcription factors may regulate leaf senescence during drought. This strategy not only identifies various regulatory interactions connecting phenotypic changes with cellular or molecular events triggered by stress, but also provides a framework to deepen our understanding of stress cellular physiology.
Collapse
Affiliation(s)
- Rumdeep K Grewal
- Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, Kolkata, India
- Department of Botany, Bhairab Ganguly College, Kolkata, India
| | - Shradha Saraf
- Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, Kolkata, India
| | - Arindam Deb
- Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, Kolkata, India
| | - Sudip Kundu
- Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, Kolkata, India
| |
Collapse
|
17
|
Ma W, Yang L, Rohs R, Noble WS. DNA sequence+shape kernel enables alignment-free modeling of transcription factor binding. Bioinformatics 2018; 33:3003-3010. [PMID: 28541376 PMCID: PMC5870879 DOI: 10.1093/bioinformatics/btx336] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2016] [Accepted: 05/23/2017] [Indexed: 01/07/2023] Open
Abstract
Motivation Transcription factors (TFs) bind to specific DNA sequence motifs. Several lines of evidence suggest that TF-DNA binding is mediated in part by properties of the local DNA shape: the width of the minor groove, the relative orientations of adjacent base pairs, etc. Several methods have been developed to jointly account for DNA sequence and shape properties in predicting TF binding affinity. However, a limitation of these methods is that they typically require a training set of aligned TF binding sites. Results We describe a sequence + shape kernel that leverages DNA sequence and shape information to better understand protein-DNA binding preference and affinity. This kernel extends an existing class of k-mer based sequence kernels, based on the recently described di-mismatch kernel. Using three in vitro benchmark datasets, derived from universal protein binding microarrays (uPBMs), genomic context PBMs (gcPBMs) and SELEX-seq data, we demonstrate that incorporating DNA shape information improves our ability to predict protein-DNA binding affinity. In particular, we observe that (i) the k-spectrum + shape model performs better than the classical k-spectrum kernel, particularly for small k values; (ii) the di-mismatch kernel performs better than the k-mer kernel, for larger k; and (iii) the di-mismatch + shape kernel performs better than the di-mismatch kernel for intermediate k values. Availability and implementation The software is available at https://bitbucket.org/wenxiu/sequence-shape.git. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Wenxiu Ma
- Department of Statistics, University of California Riverside, Riverside, CA 92521, USA
| | - Lin Yang
- Molecular and Computational Biology Program, Departments of Biological Sciences, Chemistry, Physics, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Remo Rohs
- Molecular and Computational Biology Program, Departments of Biological Sciences, Chemistry, Physics, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - William Stafford Noble
- Department of Genome Sciences, Department of Computer Science and Engineering, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
18
|
Node-independent elementary signaling modes: A measure of redundancy in Boolean signaling transduction networks. ACTA ACUST UNITED AC 2016. [DOI: 10.1017/nws.2016.4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
AbstractThe redundancy of a system denotes the amount of duplicate components or mechanisms in it. For a network, especially one in which mass or information is being transferred from an origin to a destination, redundancy is related to the robustness of the system. Existing network measures of redundancy rely on local connectivity (e.g. clustering coefficients) or the existence of multiple paths. As in many systems there are functional dependencies between components and paths, a measure that not only characterizes the topology of a network, but also takes into account these functional dependencies, becomes most desirable.We propose a network redundancy measure in a prototypical model that contains functionally dependent directed paths: a Boolean model of a signal transduction network. The functional dependencies are made explicit by using an expanded network and the concept of elementary signaling modes (ESMs). We define the redundancy of a Boolean signal transduction network as the maximum number of node-independent ESMs and develop a methodology for identifying all maximal node-independent ESM combinations. We apply our measure to a number of signal transduction network models and show that it successfully distills known properties of the systems and offers new functional insights. The concept can be easily extended to similar related forms, e.g. edge-independent ESMs.
Collapse
|
19
|
Abstract
Transcriptional control of gene expression requires interactions between the cis-regulatory elements (CREs) controlling gene promoters. We developed a sensitive computational method to identify CRE combinations with conserved spacing that does not require genome alignments. When applied to seven sensu stricto and sensu lato Saccharomyces species, 80% of the predicted interactions displayed some evidence of combinatorial transcriptional behavior in several existing datasets including: (1) chromatin immunoprecipitation data for colocalization of transcription factors, (2) gene expression data for coexpression of predicted regulatory targets, and (3) gene ontology databases for common pathway membership of predicted regulatory targets. We tested several predicted CRE interactions with chromatin immunoprecipitation experiments in a wild-type strain and strains in which a predicted cofactor was deleted. Our experiments confirmed that transcription factor (TF) occupancy at the promoters of the CRE combination target genes depends on the predicted cofactor while occupancy of other promoters is independent of the predicted cofactor. Our method has the additional advantage of identifying regulatory differences between species. By analyzing the S. cerevisiae and S. bayanus genomes, we identified differences in combinatorial cis-regulation between the species and showed that the predicted changes in gene regulation explain several of the species-specific differences seen in gene expression datasets. In some instances, the same CRE combinations appear to regulate genes involved in distinct biological processes in the two different species. The results of this research demonstrate that (1) combinatorial cis-regulation can be inferred by multi-genome analysis and (2) combinatorial cis-regulation can explain differences in gene expression between species.
Collapse
|
20
|
Deb A, Grewal RK, Kundu S. Regulatory Cross-Talks and Cascades in Rice Hormone Biosynthesis Pathways Contribute to Stress Signaling. FRONTIERS IN PLANT SCIENCE 2016; 7:1303. [PMID: 27617021 PMCID: PMC4999436 DOI: 10.3389/fpls.2016.01303] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2016] [Accepted: 08/15/2016] [Indexed: 05/18/2023]
Abstract
Crosstalk among different hormone signaling pathways play an important role in modulating plant response to both biotic and abiotic stress. Hormone activity is controlled by its bio-availability, which is again influenced by its biosynthesis. Thus, independent hormone biosynthesis pathways must be regulated and co-ordinated to mount an integrated response. One of the possibilities is to use cis-regulatory elements to orchestrate expression of hormone biosynthesis genes. Analysis of CREs, associated with differentially expressed hormone biosynthesis related genes in rice leaf under Magnaporthe oryzae attack and drought stress enabled us to obtain insights about cross-talk among hormone biosynthesis pathways at the transcriptional level. We identified some master transcription regulators that co-ordinate different hormone biosynthesis pathways under stress. We found that Abscisic acid and Brassinosteroid regulate Cytokinin conjugation; conversely Brassinosteroid biosynthesis is affected by both Abscisic acid and Cytokinin. Jasmonic acid and Ethylene biosynthesis may be modulated by Abscisic acid through DREB transcription factors. Jasmonic acid or Salicylic acid biosynthesis pathways are co-regulated but they are unlikely to influence each others production directly. Thus, multiple hormones may modulate hormone biosynthesis pathways through a complex regulatory network, where biosynthesis of one hormone is affected by several other contributing hormones.
Collapse
Affiliation(s)
- Arindam Deb
- Department of Biophysics, Molecular Biology and Bioinformatics, University of CalcuttaKolkata, India
| | - Rumdeep K. Grewal
- Department of Biophysics, Molecular Biology and Bioinformatics, University of CalcuttaKolkata, India
- Computational Systems Biology Group, Center of Excellence in Systems Biology and Biomedical Engineering, University of CalcuttaKolkata, India
| | - Sudip Kundu
- Department of Biophysics, Molecular Biology and Bioinformatics, University of CalcuttaKolkata, India
- Computational Systems Biology Group, Center of Excellence in Systems Biology and Biomedical Engineering, University of CalcuttaKolkata, India
- *Correspondence: Sudip Kundu
| |
Collapse
|
21
|
Liu L, Zhao W, Zhou X. Modeling co-occupancy of transcription factors using chromatin features. Nucleic Acids Res 2015; 44:e49. [PMID: 26590261 PMCID: PMC4797273 DOI: 10.1093/nar/gkv1281] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2015] [Accepted: 11/04/2015] [Indexed: 12/11/2022] Open
Abstract
Regulation of gene expression requires both transcription factor (TFs) and epigenetic modifications, and interplays between the two types of factors have been discovered. However study of relationships between chromatin features and TF–TF co-occupancy remains limited. Here, we revealed the relationship by first illustrating distinct profile patterns of chromatin features related to different binding events, including single TF binding and TF–TF co-occupancy of 71 TFs from five human cell lines. We further implemented statistical analyses to demonstrate the relationship by accurately predicting co-occupancy genome-widely using chromatin features including DNase I hypersensitivity, 11 histone modifications (HMs) and GC content. Remarkably, our results showed that the combination of chromatin features enables accurate predictions across the five cells. For individual chromatin features, DNase I enables high and consistent predictions. H3K27ac, H3K4me 2, H3K4me3 and H3K9ac are more reliable predictors than other HMs. Although the combination of 11 HMs achieves accurate predictions, their predictive ability varies considerably when a model obtained from one cell is applied to others, indicating relationship between HMs and TF–TF co-occupancy is cell type dependent. GC content is not a reliable predictor, but the addition of GC content to any other features enhances their predictive ability. Together, our results elucidate a strong relationship between TF–TF co-occupancy and chromatin features.
Collapse
Affiliation(s)
- Liang Liu
- Center for Bioinformatics and Systems Biology, Department of Radiology, Wake Forest School of Medicine, Winston-Salem, NC 27157, USA
| | - Weiling Zhao
- Center for Bioinformatics and Systems Biology, Department of Radiology, Wake Forest School of Medicine, Winston-Salem, NC 27157, USA
| | - Xiaobo Zhou
- Center for Bioinformatics and Systems Biology, Department of Radiology, Wake Forest School of Medicine, Winston-Salem, NC 27157, USA
| |
Collapse
|
22
|
Deb A, Kundu S. Deciphering Cis-Regulatory Element Mediated Combinatorial Regulation in Rice under Blast Infected Condition. PLoS One 2015; 10:e0137295. [PMID: 26327607 PMCID: PMC4556519 DOI: 10.1371/journal.pone.0137295] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Accepted: 08/14/2015] [Indexed: 01/15/2023] Open
Abstract
Combinations of cis-regulatory elements (CREs) present at the promoters facilitate the binding of several transcription factors (TFs), thereby altering the consequent gene expressions. Due to the eminent complexity of the regulatory mechanism, the combinatorics of CRE-mediated transcriptional regulation has been elusive. In this work, we have developed a new methodology that quantifies the co-occurrence tendencies of CREs present in a set of promoter sequences; these co-occurrence scores are filtered in three consecutive steps to test their statistical significance; and the significantly co-occurring CRE pairs are presented as networks. These networks of co-occurring CREs are further transformed to derive higher order of regulatory combinatorics. We have further applied this methodology on the differentially up-regulated gene-sets of rice tissues under fungal (Magnaporthe) infected conditions to demonstrate how it helps to understand the CRE-mediated combinatorial gene regulation. Our analysis includes a wide spectrum of biologically important results. The CRE pairs having a strong tendency to co-occur often exhibit very similar joint distribution patterns at the promoters of rice. We couple the network approach with experimental results of plant gene regulation and defense mechanisms and find evidences of auto and cross regulation among TF families, cross-talk among multiple hormone signaling pathways, similarities and dissimilarities in regulatory combinatorics between different tissues, etc. Our analyses have pointed a highly distributed nature of the combinatorial gene regulation facilitating an efficient alteration in response to fungal attack. All together, our proposed methodology could be an important approach in understanding the combinatorial gene regulation. It can be further applied to unravel the tissue and/or condition specific combinatorial gene regulation in other eukaryotic systems with the availability of annotated genomic sequences and suitable experimental data.
Collapse
Affiliation(s)
- Arindam Deb
- Department of Biophysics Molecular Biology and Bioinformatics, University of Calcutta, Kolkata, West Bengal, India
| | - Sudip Kundu
- Department of Biophysics Molecular Biology and Bioinformatics, University of Calcutta, Kolkata, West Bengal, India
- Center of Excellence in Systems Biology and Biomedical Engineering (TEQIP Phase II), University of Calcutta, Kolkata, West Bengal, India
- * E-mail:
| |
Collapse
|
23
|
Wu WS. A Computational Method for Identifying Yeast Cell Cycle Transcription Factors. Methods Mol Biol 2015; 1342:209-19. [PMID: 26254926 DOI: 10.1007/978-1-4939-2957-3_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Abstract
The eukaryotic cell cycle is a complex process and is precisely regulated at many levels. Many genes specific to the cell cycle are regulated transcriptionally and are expressed just before they are needed. To understand the cell cycle process, it is important to identify the cell cycle transcription factors (TFs) that regulate the expression of cell cycle-regulated genes. Here, we describe a computational method to identify cell cycle TFs in yeast by integrating current ChIP-chip, mutant, transcription factor-binding site (TFBS), and cell cycle gene expression data. For each identified cell cycle TF, our method also assigned specific cell cycle phases in which the TF functions and identified the time lag for the TF to exert regulatory effects on its target genes. Moreover, our method can identify novel cell cycle-regulated genes as a by-product.
Collapse
Affiliation(s)
- Wei-Sheng Wu
- Department of Electrical Engineering, National Cheng Kung University, No. 1 Daxue Road, East District, Tainan City, 701, Taiwan,
| |
Collapse
|
24
|
Liu ZP. Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data. Curr Genomics 2015; 16:3-22. [PMID: 25937810 PMCID: PMC4412962 DOI: 10.2174/1389202915666141110210634] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Revised: 09/05/2014] [Accepted: 09/05/2014] [Indexed: 12/17/2022] Open
Abstract
Transcriptional regulation plays vital roles in many fundamental biological processes. Reverse engineering of genome-wide regulatory networks from high-throughput transcriptomic data provides a promising way to characterize the global scenario of regulatory relationships between regulators and their targets. In this review, we summarize and categorize the main frameworks and methods currently available for inferring transcriptional regulatory networks from microarray gene expression profiling data. We overview each of strategies and introduce representative methods respectively. Their assumptions, advantages, shortcomings, and possible improvements and extensions are also clarified and commented.
Collapse
Affiliation(s)
- Zhi-Ping Liu
- Department of Biomedical Engineering, School of Control Science and Engineering, Shandong University, Jinan, Shandong 250061, China
| |
Collapse
|
25
|
Liu L, Zhang SW, Zhang YC, Liu H, Zhang L, Chen R, Huang Y, Meng J. Decomposition of RNA methylome reveals co-methylation patterns induced by latent enzymatic regulators of the epitranscriptome. MOLECULAR BIOSYSTEMS 2014; 11:262-74. [PMID: 25370990 DOI: 10.1039/c4mb00604f] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Biochemical modifications to mRNA, especially N6-methyladenosine (m6A) and 5-methylcytosine (m5C), have been recently shown to be associated with crucial biological functions. Despite the intriguing advancements, little is known so far about the dynamic landscape of RNA methylome across different cell types and how the epitranscriptome is regulated at the system level by enzymes, i.e., RNA methyltransferases and demethylases. To investigate this issue, a meta-analysis of m6A MeRIP-Seq datasets collected from 10 different experimental conditions (cell type/tissue or treatment) is performed, and the combinatorial epitranscriptome, which consists of 42 758 m6A sites, is extracted and divided into 3 clusters, in which the methylation sites are likely to be hyper- or hypo-methylated simultaneously (or co-methylated), indicating the sharing of a common methylation regulator. Four different clustering approaches are used, including K-means, hierarchical clustering (HC), Bayesian factor regression model (BFRM) and nonnegative matrix factorization (NMF) to unveil the co-methylation patterns. To validate whether the patterns are corresponding to enzymatic regulators, i.e., RNA methyltransferases or demethylases, the target sites of a known m6A regulator, fat mass and obesity-associated protein (FTO), are identified from an independent mouse MeRIP-Seq dataset and lifted to human. Our study shows that 3 out of the 4 clustering approaches used can successfully identify a group of methylation sites overlapping with FTO target sites at a significance level of 0.05 (after multiple hypothesis adjustment), among which, the result of NMF is the most significant (p-value 2.81×10(-06)). We defined a new approach evaluating the consistency between two clustering results which shows that clustering results of different methods are highly correlated strongly indicating the existence of co-methylation patterns. Consistent with recent studies, a number of cancer and neuronal disease-related bimolecular functions are enriched in the identified clusters, which are biological functions that can be regulated at the epitranscriptional level, indicating the pharmaceutical prospect of RNA N6-methyladenosine-related studies. This result successfully reveals the linkage between the global RNA co-methylation patterns embedded in the epitranscriptomic data under multiple experimental conditions and the latent enzymatic regulators, suggesting a promising direction towards a more comprehensive understanding of the epitranscriptome.
Collapse
Affiliation(s)
- Lian Liu
- Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical University, Xi'an, Shaanxi 710027, China.
| | | | | | | | | | | | | | | |
Collapse
|
26
|
Abstract
Nearly 20% of the budding yeast genome is transcribed periodically during the cell division cycle. The precise temporal execution of this large transcriptional program is controlled by a large interacting network of transcriptional regulators, kinases, and ubiquitin ligases. Historically, this network has been viewed as a collection of four coregulated gene clusters that are associated with each phase of the cell cycle. Although the broad outlines of these gene clusters were described nearly 20 years ago, new technologies have enabled major advances in our understanding of the genes comprising those clusters, their regulation, and the complex regulatory interplay between clusters. More recently, advances are being made in understanding the roles of chromatin in the control of the transcriptional program. We are also beginning to discover important regulatory interactions between the cell-cycle transcriptional program and other cell-cycle regulatory mechanisms such as checkpoints and metabolic networks. Here we review recent advances and contemporary models of the transcriptional network and consider these models in the context of eukaryotic cell-cycle controls.
Collapse
|
27
|
Chi SM, Seo YK, Park YK, Yoon S, Park CY, Kim YS, Kim SY, Nam D. REGNET: mining context-specific human transcription networks using composite genomic information. BMC Genomics 2014; 15:450. [PMID: 24912499 PMCID: PMC4070555 DOI: 10.1186/1471-2164-15-450] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2013] [Accepted: 05/27/2014] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND Genome-wide expression profiles reflect the transcriptional networks specific to the given cell context. However, most statistical models try to estimate the average connectivity of the networks from a collection of gene expression data, and are unable to characterize the context-specific transcriptional regulations. We propose an approach for mining context-specific transcription networks from a large collection of gene expression fold-change profiles and composite gene-set information. RESULTS Using a composite gene-set analysis method, we combine the information of transcription factor binding sites, Gene Ontology or pathway gene sets and gene expression fold-change profiles for a variety of cell conditions. We then collected all the significant patterns and constructed a database of context-specific transcription networks for human (REGNET). As a result, context-specific roles of transcription factors as well as their functional targets are readily explored. To validate the approach, nine predicted targets of E2F1 in HeLa cells were tested using chromatin immunoprecipitation assay. Among them, five (Gadd45b, Dusp6, Mll5, Bmp2 and E2f3) were successfully bound by E2F1. c-JUN and the EMT transcription networks were also validated from literature. CONCLUSIONS REGNET is a useful tool for exploring the ternary relationships among the transcription factors, their functional targets and the corresponding cell conditions. It is able to provide useful clues for novel cell-specific transcriptional regulations. The REGNET database is available at http://mgrc.kribb.re.kr/regnet.
Collapse
Affiliation(s)
- Sang-Mun Chi
- />School of Computer Science and Engineering, Kyungsung University, Busan, Republic of Korea
| | - Young-Kyo Seo
- />School of Life Sciences, UNIST, Ulsan, Republic of Korea
| | - Young-Kyu Park
- />Medical Genomics Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Sora Yoon
- />School of Life Sciences, UNIST, Ulsan, Republic of Korea
| | | | - Yong Sung Kim
- />Medical Genomics Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Seon-Young Kim
- />Medical Genomics Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Dougu Nam
- />School of Life Sciences, UNIST, Ulsan, Republic of Korea
- />Division of Mathematical Sciences, UNIST, Ulsan, Republic of Korea
| |
Collapse
|
28
|
Corrado G, Tebaldi T, Bertamini G, Costa F, Quattrone A, Viero G, Passerini A. PTRcombiner: mining combinatorial regulation of gene expression from post-transcriptional interaction maps. BMC Genomics 2014; 15:304. [PMID: 24758252 PMCID: PMC4234518 DOI: 10.1186/1471-2164-15-304] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2014] [Accepted: 04/02/2014] [Indexed: 02/07/2023] Open
Abstract
Background The progress in mapping RNA-protein and RNA-RNA interactions at the transcriptome-wide level paves the way to decipher possible combinatorial patterns embedded in post-transcriptional regulation of gene expression. Results Here we propose an innovative computational tool to extract clusters of mRNA trans-acting co-regulators (RNA binding proteins and non-coding RNAs) from pairwise interaction annotations. In addition the tool allows to analyze the binding site similarity of co-regulators belonging to the same cluster, given their positional binding information. The tool has been tested on experimental collections of human and yeast interactions, identifying modules that coordinate functionally related messages. Conclusions This tool is an original attempt to uncover combinatorial patterns using all the post-transcriptional interaction data available so far. PTRcombiner is available at http://disi.unitn.it/~passerini/software/PTRcombiner/.
Collapse
Affiliation(s)
| | | | | | | | | | - Gabriella Viero
- Department of Information Engineering and Computer Science (DISI), University of Trento, 38123 Trento, Italy.
| | | |
Collapse
|
29
|
Liu G, Marras A, Nielsen J. The future of genome-scale modeling of yeast through integration of a transcriptional regulatory network. QUANTITATIVE BIOLOGY 2014. [DOI: 10.1007/s40484-014-0027-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
30
|
Hendriks GJ, Gaidatzis D, Aeschimann F, Großhans H. Extensive oscillatory gene expression during C. elegans larval development. Mol Cell 2014; 53:380-92. [PMID: 24440504 DOI: 10.1016/j.molcel.2013.12.013] [Citation(s) in RCA: 146] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2013] [Revised: 11/25/2013] [Accepted: 12/12/2013] [Indexed: 10/25/2022]
Abstract
Oscillations are a key to achieving dynamic behavior and thus occur in biological systems as diverse as the beating heart, defecating worms, and nascent somites. Here we report pervasive, large-amplitude, and phase-locked oscillations of gene expression in developing C. elegans larvae, caused by periodic transcription. Nearly one fifth of detectably expressed transcripts oscillate with an 8 hr period, and hundreds change >10-fold. Oscillations are important for molting but occur in all phases, implying additional functions. Ribosome profiling reveals that periodic mRNA accumulation causes rhythmic translation, potentially facilitating transient protein accumulation as well as coordinated production of stable, complex structures such as the cuticle. Finally, large-amplitude oscillations in RNA sampled from whole worms indicate robust synchronization of gene expression programs across cells and tissues, suggesting that these oscillations will be a powerful new model to study coordinated gene expression in an animal.
Collapse
Affiliation(s)
- Gert-Jan Hendriks
- Friedrich Miescher Institute for Biomedical Research, Maulbeerstrasse 66, CH-4058 Basel, Switzerland; University of Basel, Petersplatz 1, CH-4003 Basel, Switzerland
| | - Dimos Gaidatzis
- Friedrich Miescher Institute for Biomedical Research, Maulbeerstrasse 66, CH-4058 Basel, Switzerland; Swiss Institute of Bioinformatics, CH-4058 Basel, Switzerland
| | - Florian Aeschimann
- Friedrich Miescher Institute for Biomedical Research, Maulbeerstrasse 66, CH-4058 Basel, Switzerland; University of Basel, Petersplatz 1, CH-4003 Basel, Switzerland
| | - Helge Großhans
- Friedrich Miescher Institute for Biomedical Research, Maulbeerstrasse 66, CH-4058 Basel, Switzerland.
| |
Collapse
|
31
|
Korkuć P, Schippers JH, Walther D. Characterization and identification of cis-regulatory elements in Arabidopsis based on single-nucleotide polymorphism information. PLANT PHYSIOLOGY 2014; 164:181-200. [PMID: 24204023 PMCID: PMC3875800 DOI: 10.1104/pp.113.229716] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2013] [Accepted: 11/06/2013] [Indexed: 05/19/2023]
Abstract
Identifying regulatory elements and revealing their role in gene expression regulation remains a central goal of plant genome research. We exploited the detailed genomic sequencing information of a large number of Arabidopsis (Arabidopsis thaliana) accessions to characterize known and to identify novel cis-regulatory elements in gene promoter regions of Arabidopsis by relying on conservation as the hallmark signal of functional relevance. Based on the genomic layout and the obtained density profiles of single-nucleotide polymorphisms (SNPs) in sequence regions upstream of transcription start sites, the average length of promoter regions in Arabidopsis could be established at 500 bp. Genes associated with high degrees of variability of their respective upstream regions are preferentially involved in environmental response and signaling processes, while low levels of promoter SNP density are common among housekeeping genes. Known cis-elements were found to exhibit a decreased SNP density than sequence regions not associated with known motifs. For 15 known cis-element motifs, strong positional preferences relative to the transcription start site were detected based on their promoter SNP density profiles. Five novel candidate cis-element motifs were identified as consensus motifs of 17 sequence hexamers exhibiting increased sequence conservation combined with evidence of positional preferences, annotation information, and functional relevance for inducing correlated gene expression. Our study demonstrates that the currently available resolution of SNP data offers novel ways for the identification of functional genomic elements and the characterization of gene promoter sequences.
Collapse
|
32
|
Role of the repressor Oaf3p in the recruitment of transcription factors and chromatin dynamics during the oleate response. Biochem J 2013; 449:507-17. [PMID: 23088601 DOI: 10.1042/bj20121029] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Cellular responses to environmental stimuli are mediated by the co-ordinated activity of multiple control mechanisms, which result in the dynamics of cell function. Communication between different levels of regulation is central for this adaptability. The present study focuses on the interplay between transcriptional regulators and chromatin modifiers to co-operatively regulate transcription in response to a fatty acid stimulus. The genes involved in the β-oxidation of fatty acids are highly induced in response to fatty acid exposure by four gene-specific transcriptional regulators, Oaf (oleate-activated transcription factor) 1p, Pip2p (peroxisome induction pathway 2), Oaf3p and Adr1p (alcohol dehydrogenase regulator 1). In the present study, we examine the interplay of these factors with Htz1p (histone variant H2A.Z) in regulating POT1 (peroxisomal oxoacyl thiolase 1) encoding peroxisomal thiolase and PIP2 encoding the autoregulatory oleate-specific transcriptional activator. Temporal resolution of ChIP (chromatin immunoprecipitation) data indicates that Htz1p is required for the timely removal of the transcriptional repressor Oaf3p during oleate induction. Adr1p plays an important role in the assembly of Htz1p-containing nucleosomes on the POT1 and PIP2 promoters. We also investigated the function of the uncharacterized transcriptional inhibitor Oaf3p. Deletion of OAF3 led to faster POT1 mRNA accumulation than in the wild-type. Most impressively, a highly protected nucleosome structure on the POT1 promoter during activation was observed in the OAF3 mutant cells in response to oleate induction.
Collapse
|
33
|
Bean GJ, Ideker T. Differential analysis of high-throughput quantitative genetic interaction data. Genome Biol 2012; 13:R123. [PMID: 23268787 PMCID: PMC4056373 DOI: 10.1186/gb-2012-13-12-r123] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2012] [Revised: 11/15/2012] [Accepted: 12/26/2012] [Indexed: 11/10/2022] Open
Abstract
Synthetic genetic arrays have been very effective at measuring genetic interactions in yeast in a high-throughput manner and recently have been expanded to measure quantitative changes in interaction, termed 'differential interactions', across multiple conditions. Here, we present a strategy that leverages statistical information from the experimental design to produce a novel, quantitative differential interaction score, which performs favorably compared to previous differential scores. We also discuss the added utility of differential genetic-similarity in differential network analysis. Our approach is preferred for differential network analysis, and our implementation, written in MATLAB, can be found at http://chianti.ucsd.edu/~gbean/compute_differential_scores.m.
Collapse
Affiliation(s)
- Gordon J Bean
- Bioinformatics and Systems Biology Program, University of California, San Diego,
9500 Gilman Drive, Dept. 0419, La Jolla, CA 92093-0419, USA
| | - Trey Ideker
- Bioinformatics and Systems Biology Program, University of California, San Diego,
9500 Gilman Drive, Dept. 0419, La Jolla, CA 92093-0419, USA
- Department of Bioengineering, University of California, San Diego, 9500 Gilman
Drive MC 0412, La Jolla, CA 92093-0412, USA
- Institute for Genomic Medicine, University of California, San Diego, 9500 Gilman
Drive, 0642, La Jolla, CA 92093, USA
- Department of Medicine, University of California, San Diego, 9500 Gilman Drive, #
0671, La Jolla, CA 92093-0671, USA
| |
Collapse
|
34
|
Vandenbon A, Kumagai Y, Akira S, Standley DM. A novel unbiased measure for motif co-occurrence predicts combinatorial regulation of transcription. BMC Genomics 2012; 13 Suppl 7:S11. [PMID: 23282148 PMCID: PMC3521209 DOI: 10.1186/1471-2164-13-s7-s11] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Background Multiple transcription factors (TFs) are involved in the generation of gene expression patterns, such as tissue-specific gene expression and pleiotropic immune responses. However, how combinations of TFs orchestrate diverse gene expression patterns is poorly understood. Here we propose a new measure for regulatory motif co-occurrence and a new methodology to systematically identify TF pairs significantly co-occurring in a set of promoter sequences. Results Initial analyses suggest that non-CpG promoters have a higher potential for combinatorial regulation than CpG island-associated promoters, and that co-occurrences are strongly influenced by motif similarity. We applied our method to large-scale gene expression data from various tissues, and showed how our measure for motif co-occurrence is not biased by motif over-representation. Our method identified, amongst others, the binding motifs of HNF1 and FOXP1 to be significantly co-occurring in promoters of liver/kidney specific genes. Binding sites tend to be positioned proximally to each other, suggesting interactions exist between this pair of transcription factors. Moreover, the binding sites of several TFs were found to co-occur with NF-κB and IRF sites in sets of genes with similar expression patterns in dendritic cells after Toll-like receptor stimulation. Of these, we experimentally verified that CCAAT enhancer binding protein alpha positively regulates its target promoters synergistically with NF-κB. Conclusions Both computational and experimental results indicate that the proposed method can clarify TF interactions that could not be observed by currently available prediction methods.
Collapse
Affiliation(s)
- Alexis Vandenbon
- Laboratory of Systems Immunology, Immunology Frontier Research Center, Osaka University, 3-1 Yamada-oka, Suita, Osaka 565-0871, Japan.
| | | | | | | |
Collapse
|
35
|
From plant gene regulatory grids to network dynamics. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2012; 1819:454-65. [DOI: 10.1016/j.bbagrm.2012.02.016] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2011] [Revised: 02/15/2012] [Accepted: 02/16/2012] [Indexed: 11/19/2022]
|
36
|
Lee TY, Chang WC, Hsu JBK, Chang TH, Shien DM. GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group. BMC Genomics 2012; 13 Suppl 1:S3. [PMID: 22369687 PMCID: PMC3587379 DOI: 10.1186/1471-2164-13-s1-s3] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Background Sequence features in promoter regions are involved in regulating gene transcription initiation. Although numerous computational methods have been developed for predicting transcriptional start sites (TSSs) or transcription factor (TF) binding sites (TFBSs), they lack annotations for do not consider some important regulatory features such as CpG islands, tandem repeats, the TATA box, CCAAT box, GC box, over-represented oligonucleotides, DNA stability, and GC content. Additionally, the combinatorial interaction of TFs regulates the gene group that is associated with same expression pattern. To investigate gene transcriptional regulation, an integrated system that annotates regulatory features in a promoter sequence and detects co-regulation of TFs in a group of genes is needed. Results This work identifies TSSs and regulatory features in a promoter sequence, and recognizes co-occurrence of cis-regulatory elements in co-expressed genes using a novel system. Three well-known TSS prediction tools are incorporated with orthologous conserved features, such as CpG islands, nucleotide composition, over-represented hexamer nucleotides, and DNA stability, to construct the novel Gene Promoter Miner (GPMiner) using a support vector machine (SVM). According to five-fold cross-validation results, the predictive sensitivity and specificity are both roughly 80%. The proposed system allows users to input a group of gene names/symbols, enabling the co-occurrence of TFBSs to be determined. Additionally, an input sequence can also be analyzed for homogeneity of experimental mammalian promoter sequences, and conserved regulatory features between homologous promoters can be observed through cross-species analysis. After identifying promoter regions, regulatory features are visualized graphically to facilitate gene promoter observations. Conclusions The GPMiner, which has a user-friendly input/output interface, has numerous benefits in analyzing human and mouse promoters. The proposed system is freely available at http://GPMiner.mbc.nctu.edu.tw/.
Collapse
Affiliation(s)
- Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan.
| | | | | | | | | |
Collapse
|
37
|
Chen MJM, Chou LC, Hsieh TT, Lee DD, Liu KW, Yu CY, Oyang YJ, Tsai HK, Chen CY. De novo motif discovery facilitates identification of interactions between transcription factors in Saccharomyces cerevisiae. ACTA ACUST UNITED AC 2012; 28:701-8. [PMID: 22238267 DOI: 10.1093/bioinformatics/bts002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
MOTIVATION Gene regulation involves complicated mechanisms such as cooperativity between a set of transcription factors (TFs). Previous studies have used target genes shared by two TFs as a clue to infer TF-TF interactions. However, this task remains challenging because the target genes with low binding affinity are frequently omitted by experimental data, especially when a single strict threshold is employed. This article aims at improving the accuracy of inferring TF-TF interactions by incorporating motif discovery as a fundamental step when detecting overlapping targets of TFs based on ChIP-chip data. RESULTS The proposed method, simTFBS, outperforms three naïve methods that adopt fixed thresholds when inferring TF-TF interactions based on ChIP-chip data. In addition, simTFBS is compared with two advanced methods and demonstrates its advantages in predicting TF-TF interactions. By comparing simTFBS with predictions based on the set of available annotated yeast TF binding motifs, we demonstrate that the good performance of simTFBS is indeed coming from the additional motifs found by the proposed procedures. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mei-Ju May Chen
- Department of Computer Science and Information Engineering, National Taiwan University and Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Cheng C, Shou C, Yip KY, Gerstein MB. Genome-wide analysis of chromatin features identifies histone modification sensitive and insensitive yeast transcription factors. Genome Biol 2011; 12:R111. [PMID: 22060676 PMCID: PMC3334597 DOI: 10.1186/gb-2011-12-11-r111] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2011] [Revised: 10/12/2011] [Accepted: 11/07/2011] [Indexed: 12/20/2022] Open
Abstract
We propose a method to predict yeast transcription factor targets by integrating histone modification profiles with transcription factor binding motif information. It shows improved predictive power compared to a binding motif-only method. We find that transcription factors cluster into histone-sensitive and -insensitive classes. The target genes of histone-sensitive transcription factors have stronger histone modification signals than those of histone-insensitive ones. The two classes also differ in tendency to interact with histone modifiers, degree of connectivity in protein-protein interaction networks, position in the transcriptional regulation hierarchy, and in a number of additional features, indicating possible differences in their transcriptional regulation mechanisms.
Collapse
Affiliation(s)
- Chao Cheng
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | | | | | | |
Collapse
|
39
|
Eser U, Falleur-Fettig M, Johnson A, Skotheim JM. Commitment to a cellular transition precedes genome-wide transcriptional change. Mol Cell 2011; 43:515-27. [PMID: 21855792 DOI: 10.1016/j.molcel.2011.06.024] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2010] [Revised: 04/13/2011] [Accepted: 06/17/2011] [Indexed: 01/13/2023]
Abstract
In budding yeast, commitment to cell division corresponds to activating the positive feedback loop of G1 cyclins controlled by the transcription factors SBF and MBF. This pair of transcription factors has over 200 targets, implying that cell-cycle commitment coincides with genome-wide changes in transcription. Here, we find that genes within this regulon have a well-defined distribution of transcriptional activation times. Combinatorial use of SBF and MBF results in a logical OR function for gene expression and partially explains activation timing. Activation of G1 cyclin expression precedes the activation of the bulk of the G1/S regulon, ensuring that commitment to cell division occurs before large-scale changes in transcription. Furthermore, we find similar positive feedback-first regulation in the yeasts S. bayanus and S. cerevisiae, as well as human cells. The widespread use of the feedback-first motif in eukaryotic cell-cycle control, implemented by nonorthologous proteins, suggests its frequent deployment at cellular transitions.
Collapse
Affiliation(s)
- Umut Eser
- Department of Applied Physics, Stanford University, Stanford CA 94305, USA
| | | | | | | |
Collapse
|
40
|
Hsu JT, Peng CH, Hsieh WP, Lan CY, Tang CY. A novel method to identify cooperative functional modules: study of module coordination in the Saccharomyces cerevisiae cell cycle. BMC Bioinformatics 2011; 12:281. [PMID: 21749690 PMCID: PMC3143111 DOI: 10.1186/1471-2105-12-281] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2011] [Accepted: 07/12/2011] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Identifying key components in biological processes and their associations is critical for deciphering cellular functions. Recently, numerous gene expression and molecular interaction experiments have been reported in Saccharomyces cerevisiae, and these have enabled systematic studies. Although a number of approaches have been used to predict gene functions and interactions, tools that analyze the essential coordination of functional components in cellular processes still need to be developed. RESULTS In this work, we present a new approach to study the cooperation of functional modules (sets of functionally related genes) in a specific cellular process. A cooperative module pair is defined as two modules that significantly cooperate with certain functional genes in a cellular process. This method identifies cooperative module pairs that significantly influence a cellular process and the correlated genes and interactions that are essential to that process. Using the yeast cell cycle as an example, we identified 101 cooperative module associations among 82 modules, and importantly, we established a cell cycle-specific cooperative module network. Most of the identified module pairs cover cooperative pathways and components essential to the cell cycle. We found that 14, 36, 18, 15, and 20 cooperative module pairs significantly cooperate with genes regulated in early G1, late G1, S, G2, and M phase, respectively. Fifty-nine module pairs that correlate with Cdc28 and other essential regulators were also identified. These results are consistent with previous studies and demonstrate that our methodology is effective for studying cooperative mechanisms in the cell cycle. CONCLUSIONS In this work, we propose a new approach to identifying condition-related cooperative interactions, and importantly, we establish a cell cycle-specific cooperation module network. These results provide a global view of the cell cycle and the method can be used to discover the dynamic coordination properties of functional components in other cellular processes.
Collapse
Affiliation(s)
- Jeh-Ting Hsu
- Department of Computer Science, National Tsing Hua University, Hsinchu 30013, Taiwan
| | | | | | | | | |
Collapse
|
41
|
Schmeier S, Jankovic B, Bajic VB. Simplified method to predict mutual interactions of human transcription factors based on their primary structure. PLoS One 2011; 6:e21887. [PMID: 21750739 PMCID: PMC3130058 DOI: 10.1371/journal.pone.0021887] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2010] [Accepted: 06/14/2011] [Indexed: 11/18/2022] Open
Abstract
Background Physical interactions between transcription factors (TFs) are necessary for forming regulatory protein complexes and thus play a crucial role in gene regulation. Currently, knowledge about the mechanisms of these TF interactions is incomplete and the number of known TF interactions is limited. Computational prediction of such interactions can help identify potential new TF interactions as well as contribute to better understanding the complex machinery involved in gene regulation. Methodology We propose here such a method for the prediction of TF interactions. The method uses only the primary sequence information of the interacting TFs, resulting in a much greater simplicity of the prediction algorithm. Through an advanced feature selection process, we determined a subset of 97 model features that constitute the optimized model in the subset we considered. The model, based on quadratic discriminant analysis, achieves a prediction accuracy of 85.39% on a blind set of interactions. This result is achieved despite the selection for the negative data set of only those TF from the same type of proteins, i.e. TFs that function in the same cellular compartment (nucleus) and in the same type of molecular process (transcription initiation). Such selection poses significant challenges for developing models with high specificity, but at the same time better reflects real-world problems. Conclusions The performance of our predictor compares well to those of much more complex approaches for predicting TF and general protein-protein interactions, particularly when taking the reduced complexity of model utilisation into account.
Collapse
Affiliation(s)
- Sebastian Schmeier
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | - Boris Jankovic
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | - Vladimir B. Bajic
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
- * E-mail:
| |
Collapse
|
42
|
Goudot C, Etchebest C, Devaux F, Lelandais G. The reconstruction of condition-specific transcriptional modules provides new insights in the evolution of yeast AP-1 proteins. PLoS One 2011; 6:e20924. [PMID: 21695268 PMCID: PMC3111461 DOI: 10.1371/journal.pone.0020924] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2011] [Accepted: 05/15/2011] [Indexed: 11/19/2022] Open
Abstract
AP-1 proteins are transcription factors (TFs) that belong to the basic leucine zipper family, one of the largest families of TFs in eukaryotic cells. Despite high homology between their DNA binding domains, these proteins are able to recognize diverse DNA motifs. In yeasts, these motifs are referred as YRE (Yap Response Element) and are either seven (YRE-Overlap) or eight (YRE-Adjacent) base pair long. It has been proposed that the AP-1 DNA binding motif preference relies on a single change in the amino acid sequence of the yeast AP-1 TFs (an arginine in the YRE-O binding factors being replaced by a lysine in the YRE-A binding Yaps). We developed a computational approach to infer condition-specific transcriptional modules associated to the orthologous AP-1 protein Yap1p, Cgap1p and Cap1p, in three yeast species: the model yeast Saccharomyces cerevisiae and two pathogenic species Candida glabrata and Candida albicans. Exploitation of these modules in terms of predictions of the protein/DNA regulatory interactions changed our vision of AP-1 protein evolution. Cis-regulatory motif analyses revealed the presence of a conserved adenine in 5' position of the canonical YRE sites. While Yap1p, Cgap1p and Cap1p shared a remarkably low number of target genes, an impressive conservation was observed in the YRE sequences identified by Yap1p and Cap1p. In Candida glabrata, we found that Cgap1p, unlike Yap1p and Cap1p, recognizes YRE-O and YRE-A motifs. These findings were supported by structural data available for the transcription factor Pap1p (Schizosaccharomyces pombe). Thus, whereas arginine and lysine substitutions in Cgap1p and Yap1p proteins were reported as responsible for a specific YRE-O or YRE-A preference, our analyses rather suggest that the ancestral yeast AP-1 protein could recognize both YRE-O and YRE-A motifs and that the arginine/lysine exchange is not the only determinant of the specialization of modern Yaps for one motif or another.
Collapse
Affiliation(s)
- Christel Goudot
- Dynamique des Structures et Interactions des Macromolécules Biologiques (DSIMB), INSERM, U665, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR-S665, Paris, France
- INTS, Paris, France
| | - Catherine Etchebest
- Dynamique des Structures et Interactions des Macromolécules Biologiques (DSIMB), INSERM, U665, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR-S665, Paris, France
- INTS, Paris, France
| | - Frédéric Devaux
- Laboratoire de Génomique des Microorganismes, UMR7238 CNRS, Université Pierre et Marie Curie, Paris, France
| | - Gaëlle Lelandais
- Dynamique des Structures et Interactions des Macromolécules Biologiques (DSIMB), INSERM, U665, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR-S665, Paris, France
- INTS, Paris, France
| |
Collapse
|
43
|
Wong KC, Peng C, Wong MH, Leung KS. Generalizing and learning protein-DNA binding sequence representations by an evolutionary algorithm. Soft comput 2011. [DOI: 10.1007/s00500-011-0692-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
44
|
|
45
|
Bickel PJ, Boley N, Brown JB, Huang H, Zhang NR. Subsampling methods for genomic inference. Ann Appl Stat 2010. [DOI: 10.1214/10-aoas363] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
46
|
Su N, Wang Y, Qian M, Deng M. Combinatorial regulation of transcription factors and microRNAs. BMC SYSTEMS BIOLOGY 2010; 4:150. [PMID: 21059252 PMCID: PMC3225826 DOI: 10.1186/1752-0509-4-150] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/18/2010] [Accepted: 11/08/2010] [Indexed: 11/14/2022]
Abstract
Background Gene regulation is a key factor in gaining a full understanding of molecular biology. Cis-regulatory modules (CRMs), consisting of multiple transcription factor binding sites, have been confirmed as the main regulators in gene expression. In recent years, a novel regulator known as microRNA (miRNA) has been found to play an important role in gene regulation. Meanwhile, transcription factor and microRNA co-regulation has been widely identified. Thus, the relationships between CRMs and microRNAs have generated interest among biologists. Results We constructed new combinatorial regulatory modules based on CRMs and miRNAs. By analyzing their effect on gene expression profiles, we found that genes targeted by both CRMs and miRNAs express in a significantly similar way. Furthermore, we constructed a regulatory network composed of CRMs, miRNAs, and their target genes. Investigating its structure, we found that the feed forward loop is a significant network motif, which plays an important role in gene regulation. In addition, we further analyzed the effect of miRNAs in embryonic cells, and we found that mir-154, as well as some other miRNAs, have significant co-regulation effect with CRMs in embryonic development. Conclusions Based on the co-regulation of CRMs and miRNAs, we constructed a novel combinatorial regulatory network which was found to play an important role in gene regulation, particularly during embryonic development.
Collapse
Affiliation(s)
- Naifang Su
- LMAM, School of Mathematical Sciences, Peking University, Beijing 100871, China
| | | | | | | |
Collapse
|
47
|
Ferrezuelo F, Colomina N, Futcher B, Aldea M. The transcriptional network activated by Cln3 cyclin at the G1-to-S transition of the yeast cell cycle. Genome Biol 2010; 11:R67. [PMID: 20573214 PMCID: PMC2911115 DOI: 10.1186/gb-2010-11-6-r67] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2010] [Accepted: 06/23/2010] [Indexed: 12/25/2022] Open
Abstract
Background The G1-to-S transition of the cell cycle in the yeast Saccharomyces cerevisiae involves an extensive transcriptional program driven by transcription factors SBF (Swi4-Swi6) and MBF (Mbp1-Swi6). Activation of these factors ultimately depends on the G1 cyclin Cln3. Results To determine the transcriptional targets of Cln3 and their dependence on SBF or MBF, we first have used DNA microarrays to interrogate gene expression upon Cln3 overexpression in synchronized cultures of strains lacking components of SBF and/or MBF. Secondly, we have integrated this expression dataset together with other heterogeneous data sources into a single probabilistic model based on Bayesian statistics. Our analysis has produced more than 200 transcription factor-target assignments, validated by ChIP assays and by functional enrichment. Our predictions show higher internal coherence and predictive power than previous classifications. Our results support a model whereby SBF and MBF may be differentially activated by Cln3. Conclusions Integration of heterogeneous genome-wide datasets is key to building accurate transcriptional networks. By such integration, we provide here a reliable transcriptional network at the G1-to-S transition in the budding yeast cell cycle. Our results suggest that to improve the reliability of predictions we need to feed our models with more informative experimental data.
Collapse
Affiliation(s)
- Francisco Ferrezuelo
- Departament de Ciències Mèdiques Bàsiques, Institut de Recerca Biomèdica de Lleida, Universitat de Lleida, Montserrat Roig 2, 25008 Lleida, Spain.
| | | | | | | |
Collapse
|
48
|
Hu J, Li H, Zhang J. Analysis of transcriptional synergy between upstream regions and introns in ribosomal protein genes of yeast. Comput Biol Chem 2010; 34:106-14. [PMID: 20430699 DOI: 10.1016/j.compbiolchem.2010.03.008] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2009] [Revised: 01/13/2010] [Accepted: 03/26/2010] [Indexed: 10/19/2022]
Abstract
Transcriptional regulation in eukaryotic genes generally requires combinatorial binding on DNA of multiple transcription factors. Though many analyses have been performed for identification of combinatorial patterns in promoter sequences, there are few studies concerned with introns of genes. Here our study focuses on the transcriptional synergistic (cooperative) regulation between upstream promoters and introns of ribosomal protein (RP) genes in Saccharomyces cerevisiae yeast. We first extract some potential transcriptional regulatory motifs based on a statistical comparative analysis. 98% of these motifs are accordance with experimental analyses. Then by pairing these motifs each other, we identify some potential synergistic motif pairs between upstream regions and introns of yeast RP genes (RPGs). Among 48 detected motif pairs, 44 match the binding sites for interacting transcriptional factors known from experiments or predictions. Checking the positions of these motif pairs in yeast RPGs, it is found that both motifs of the detected motif pairs are enriched in specific regions of upstream regions and introns, respectively. Some motif pairs present distance and orientation preferences, which may be favorable for transcription factors to bind simultaneously to DNA. These results will be helpful to understand the mechanism of synergistic regulation in yeast RPGs.
Collapse
Affiliation(s)
- Jun Hu
- Laboratory for Conservation and Utilization of Bio-resources, Yunnan University, Kunming 650091, China
| | | | | |
Collapse
|
49
|
Yu CY, Chou LC, Chang DTH. Predicting protein-protein interactions in unbalanced data using the primary structure of proteins. BMC Bioinformatics 2010; 11:167. [PMID: 20361868 PMCID: PMC2868006 DOI: 10.1186/1471-2105-11-167] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2009] [Accepted: 04/02/2010] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Elucidating protein-protein interactions (PPIs) is essential to constructing protein interaction networks and facilitating our understanding of the general principles of biological systems. Previous studies have revealed that interacting protein pairs can be predicted by their primary structure. Most of these approaches have achieved satisfactory performance on datasets comprising equal number of interacting and non-interacting protein pairs. However, this ratio is highly unbalanced in nature, and these techniques have not been comprehensively evaluated with respect to the effect of the large number of non-interacting pairs in realistic datasets. Moreover, since highly unbalanced distributions usually lead to large datasets, more efficient predictors are desired when handling such challenging tasks. RESULTS This study presents a method for PPI prediction based only on sequence information, which contributes in three aspects. First, we propose a probability-based mechanism for transforming protein sequences into feature vectors. Second, the proposed predictor is designed with an efficient classification algorithm, where the efficiency is essential for handling highly unbalanced datasets. Third, the proposed PPI predictor is assessed with several unbalanced datasets with different positive-to-negative ratios (from 1:1 to 1:15). This analysis provides solid evidence that the degree of dataset imbalance is important to PPI predictors. CONCLUSIONS Dealing with data imbalance is a key issue in PPI prediction since there are far fewer interacting protein pairs than non-interacting ones. This article provides a comprehensive study on this issue and develops a practical tool that achieves both good prediction performance and efficiency using only protein sequence information.
Collapse
Affiliation(s)
- Chi-Yuan Yu
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei 106, Taiwan
| | - Lih-Ching Chou
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei 106, Taiwan
| | - Darby Tien-Hao Chang
- Department of Electrical Engineering, National Cheng Kung University, Tainan 70101, Taiwan
| |
Collapse
|
50
|
Xie Y, Pan W, Jeong KS, Xiao G, Khodursky AB. A Bayesian approach to joint modeling of protein-DNA binding, gene expression and sequence data. Stat Med 2010; 29:489-503. [PMID: 20049751 DOI: 10.1002/sim.3815] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The genome-wide DNA-protein-binding data, DNA sequence data and gene expression data represent complementary means to deciphering global and local transcriptional regulatory circuits. Combining these different types of data can not only improve the statistical power, but also provide a more comprehensive picture of gene regulation. In this paper, we propose a novel statistical model to augment protein-DNA-binding data with gene expression and DNA sequence data when available. We specify a hierarchical Bayes model and use Markov chain Monte Carlo simulations to draw inferences. Both simulation studies and an analysis of an experimental data set show that the proposed joint modeling method can significantly improve the specificity and sensitivity of identifying target genes as compared with conventional approaches relying on a single data source.
Collapse
Affiliation(s)
- Yang Xie
- Division of Biostatistics, Department of Clinical Sciences, University of Texas Southwestern Medical Center at Dallas, Dallas, TX, USA.
| | | | | | | | | |
Collapse
|