1
|
Zhang Z, Ruf-Zamojski F, Zamojski M, Bernard D, Chen X, Troyanskaya O, Sealfon S. Peak-agnostic high-resolution cis-regulatory circuitry mapping using single cell multiome data. Nucleic Acids Res 2024; 52:572-582. [PMID: 38084892 PMCID: PMC10810203 DOI: 10.1093/nar/gkad1166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 11/15/2023] [Accepted: 11/27/2023] [Indexed: 01/26/2024] Open
Abstract
Single same cell RNAseq/ATACseq multiome data provide unparalleled potential to develop high resolution maps of the cell-type specific transcriptional regulatory circuitry underlying gene expression. We present CREMA, a framework that recovers the full cis-regulatory circuitry by modeling gene expression and chromatin activity in individual cells without peak-calling or cell type labeling constraints. We demonstrate that CREMA overcomes the limitations of existing methods that fail to identify about half of functional regulatory elements which are outside the called chromatin 'peaks'. These circuit sites outside called peaks are shown to be important cell type specific functional regulatory loci, sufficient to distinguish individual cell types. Analysis of mouse pituitary data identifies a Gata2-circuit for the gonadotrope-enriched disease-associated Pcsk1 gene, which is experimentally validated by reduced gonadotrope expression in a gonadotrope conditional Gata2-knockout model. We present a web accessible human immune cell regulatory circuit resource, and provide CREMA as an R package.
Collapse
Affiliation(s)
- Zidong Zhang
- Department of Neurology, Center for Advanced Research on Diagnostic Assays, Icahn School of Medicine at Mount Sinai (ISMMS), New York, NY, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Frederique Ruf-Zamojski
- Department of Neurology, Center for Advanced Research on Diagnostic Assays, Icahn School of Medicine at Mount Sinai (ISMMS), New York, NY, USA
| | - Michel Zamojski
- Department of Neurology, Center for Advanced Research on Diagnostic Assays, Icahn School of Medicine at Mount Sinai (ISMMS), New York, NY, USA
| | - Daniel J Bernard
- Department of Pharmacology and Therapeutics, McGill University, Montreal, QC H3G 1Y6, Canada
| | - Xi Chen
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
| | - Olga G Troyanskaya
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
- Department of Computer Science, Princeton University, Princeton, NJ, USA
| | - Stuart C Sealfon
- Department of Neurology, Center for Advanced Research on Diagnostic Assays, Icahn School of Medicine at Mount Sinai (ISMMS), New York, NY, USA
| |
Collapse
|
2
|
Liu X, Han W, Hu X. Post-transcriptional regulation of myeloid cell-mediated inflammatory responses. Adv Immunol 2023; 160:59-82. [PMID: 38042586 DOI: 10.1016/bs.ai.2023.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2023]
Abstract
Myeloid cells, particularly macrophages, act as the frontline responders to infectious agents and initiate inflammation. While the molecular mechanisms driving inflammatory responses have primarily focused on pattern recognition by myeloid cells and subsequent transcriptional events, it is crucial to note that post-transcriptional regulation plays a pivotal role in this process. In addition to the transcriptional regulation of innate immune responses, additional layers of intricate network of post-transcriptional mechanisms critically determine the quantity and duration of key inflammatory products and thus the outcome of immune responses. A multitude of mechanisms governing post-transcriptional regulation in innate immunity have been uncovered, encompassing RNA alternative splicing, mRNA stability, and translational regulation. This review encapsulates the current insights into the post-transcriptional regulation of inflammatory genes within myeloid cells, with particular emphasis on translational regulation during inflammation. While acknowledging the advancements, we also shed light on the existing gaps in immunological research pertaining to post-transcriptional levels and propose perspectives that controlling post-transcriptional process may serve as potential targets for therapeutic interventions in inflammatory diseases.
Collapse
Affiliation(s)
- Xingxian Liu
- Institute for Immunology, Tsinghua University, Beijing, P.R. China; Department of Basic Medical Sciences, Tsinghua University, Beijing, P.R. China; Tsinghua-Peking Center for Life Sciences, Beijing, P.R. China; Beijing Key Laboratory for Immunological Research on Chronic Diseases, Beijing, P.R. China
| | - Weidong Han
- Department of Bio-therapeutic, the First Medical Center, Chinese PLA General Hospital, Beijing, P.R. China
| | - Xiaoyu Hu
- Institute for Immunology, Tsinghua University, Beijing, P.R. China; Department of Basic Medical Sciences, Tsinghua University, Beijing, P.R. China; Tsinghua-Peking Center for Life Sciences, Beijing, P.R. China; The State Key Laboratory of Membrane Biology, Beijing, P.R. China.
| |
Collapse
|
3
|
Ner-Gaon H, Peleg R, Gazit R, Reiner-Benaim A, Shay T. Mapping the splicing landscape of the human immune system. Front Immunol 2023; 14:1116392. [PMID: 37711610 PMCID: PMC10499523 DOI: 10.3389/fimmu.2023.1116392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 08/14/2023] [Indexed: 09/16/2023] Open
Abstract
Most human genes code for more than one transcript. Different ratios of transcripts of the same gene can be found in different cell types or states, indicating differential use of transcription start sites or differential splicing. Such differential transcript use (DTUs) events provide an additional layer of regulation and protein diversity. With the exceptions of PTPRC and CIITA, there are very few reported cases of DTU events in the immune system. To rigorously map DTUs between different human immune cell types, we leveraged four publicly available RNA sequencing datasets. We identified 282 DTU events between five human healthy immune cell types that appear in at least two datasets. The patterns of the DTU events were mostly cell-type-specific or lineage-specific, in the context of the five cell types tested. DTUs correlated with the expression pattern of potential regulators, namely, splicing factors and transcription factors. Of the several immune related conditions studied, only sepsis affected the splicing of more than a few genes and only in innate immune cells. Taken together, we map the DTUs landscape in human peripheral blood immune cell types, and present hundreds of genes whose transcript use changes between cell types or upon activation.
Collapse
Affiliation(s)
- Hadas Ner-Gaon
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Ronnie Peleg
- The Shraga Segal Department of Microbiology, Immunology and Genetics, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Roi Gazit
- The Shraga Segal Department of Microbiology, Immunology and Genetics, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Anat Reiner-Benaim
- Department of Epidemiology, Biostatistics and Community Health Sciences, School of Public Health, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Tal Shay
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| |
Collapse
|
4
|
Przytycki PF. Uncovering the genetic circuits that drive diseases. NATURE COMPUTATIONAL SCIENCE 2023; 3:584-585. [PMID: 38177750 DOI: 10.1038/s43588-023-00475-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2024]
Affiliation(s)
- Pawel F Przytycki
- Faculty of Computing & Data Sciences, Boston University, Boston, MA, USA.
| |
Collapse
|
5
|
Chen X, Wang Y, Cappuccio A, Cheng WS, Zamojski FR, Nair VD, Miller CM, Rubenstein AB, Nudelman G, Tadych A, Theesfeld CL, Vornholt A, George MC, Ruffin F, Dagher M, Chawla DG, Soares-Schanoski A, Spurbeck RR, Ndhlovu LC, Sebra R, Kleinstein SH, Letizia AG, Ramos I, Fowler VG, Woods CW, Zaslavsky E, Troyanskaya OG, Sealfon SC. Mapping disease regulatory circuits at cell-type resolution from single-cell multiomics data. NATURE COMPUTATIONAL SCIENCE 2023; 3:644-657. [PMID: 37974651 PMCID: PMC10653299 DOI: 10.1038/s43588-023-00476-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 06/06/2023] [Indexed: 11/19/2023]
Abstract
Resolving chromatin-remodeling-linked gene expression changes at cell-type resolution is important for understanding disease states. Here we describe MAGICAL (Multiome Accessibility Gene Integration Calling and Looping), a hierarchical Bayesian approach that leverages paired single-cell RNA sequencing and single-cell transposase-accessible chromatin sequencing from different conditions to map disease-associated transcription factors, chromatin sites, and genes as regulatory circuits. By simultaneously modeling signal variation across cells and conditions in both omics data types, MAGICAL achieved high accuracy on circuit inference. We applied MAGICAL to study Staphylococcus aureus sepsis from peripheral blood mononuclear single-cell data that we generated from subjects with bloodstream infection and uninfected controls. MAGICAL identified sepsis-associated regulatory circuits predominantly in CD14 monocytes, known to be activated by bacterial sepsis. We addressed the challenging problem of distinguishing host regulatory circuit responses to methicillin-resistant and methicillin-susceptible S. aureus infections. Although differential expression analysis failed to show predictive value, MAGICAL identified epigenetic circuit biomarkers that distinguished methicillin-resistant from methicillin-susceptible S. aureus infections.
Collapse
Affiliation(s)
- Xi Chen
- Center for Computational Biology, Flatiron Institute, New York, NY, USA
- Lewis-Sigler Institute of Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Yuan Wang
- Department of Computer Science, Princeton University, Princeton, NJ, USA
| | - Antonio Cappuccio
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Wan-Sze Cheng
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Venugopalan D. Nair
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Clare M. Miller
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Aliza B. Rubenstein
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - German Nudelman
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Alicja Tadych
- Lewis-Sigler Institute of Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Chandra L. Theesfeld
- Lewis-Sigler Institute of Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Alexandria Vornholt
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Felicia Ruffin
- Division of Infectious Diseases, Department of Medicine, Duke University School of Medicine, Durham, NC, USA
| | - Michael Dagher
- Division of Infectious Diseases, Department of Medicine, Duke University School of Medicine, Durham, NC, USA
| | - Daniel G. Chawla
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | | | | | - Lishomwa C. Ndhlovu
- Division of Infectious Diseases, Department of Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Robert Sebra
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Steven H. Kleinstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
- Department of Pathology and Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
| | | | - Irene Ramos
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Vance G. Fowler
- Division of Infectious Diseases, Department of Medicine, Duke University School of Medicine, Durham, NC, USA
| | - Christopher W. Woods
- Division of Infectious Diseases, Department of Medicine, Duke University School of Medicine, Durham, NC, USA
| | - Elena Zaslavsky
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- These authors jointly supervised this work: Elena Zaslavsky, Olga G. Troyanskaya, Stuart C. Sealfon
| | - Olga G. Troyanskaya
- Center for Computational Biology, Flatiron Institute, New York, NY, USA
- Lewis-Sigler Institute of Integrative Genomics, Princeton University, Princeton, NJ, USA
- Department of Computer Science, Princeton University, Princeton, NJ, USA
- These authors jointly supervised this work: Elena Zaslavsky, Olga G. Troyanskaya, Stuart C. Sealfon
| | - Stuart C. Sealfon
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- These authors jointly supervised this work: Elena Zaslavsky, Olga G. Troyanskaya, Stuart C. Sealfon
| |
Collapse
|
6
|
“Structure”-function relationships in eukaryotic transcription factors: The role of intrinsically disordered regions in gene regulation. Mol Cell 2022; 82:3970-3984. [DOI: 10.1016/j.molcel.2022.09.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 08/19/2022] [Accepted: 09/21/2022] [Indexed: 11/06/2022]
|
7
|
Eagle K, Harada T, Kalfon J, Perez MW, Heshmati Y, Ewers J, Koren JV, Dempster JM, Kugener G, Paralkar VR, Lin CY, Dharia NV, Stegmaier K, Orkin SH, Pimkin M. Transcriptional Plasticity Drives Leukemia Immune Escape. Blood Cancer Discov 2022; 3:394-409. [PMID: 35709529 PMCID: PMC9897290 DOI: 10.1158/2643-3230.bcd-21-0207] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 04/21/2022] [Accepted: 06/08/2022] [Indexed: 11/16/2022] Open
Abstract
Relapse of acute myeloid leukemia (AML) after allogeneic bone marrow transplantation has been linked to immune evasion due to reduced expression of major histocompatibility complex class II (MHCII) genes through unknown mechanisms. In this work, we developed CORENODE, a computational algorithm for genome-wide transcription network decomposition that identified a transcription factor (TF) tetrad consisting of IRF8, MYB, MEF2C, and MEIS1, regulating MHCII expression in AML cells. We show that reduced MHCII expression at relapse is transcriptionally driven by combinatorial changes in the expression of these TFs, where MYB and IRF8 play major opposing roles, acting independently of the IFNγ/CIITA pathway. Beyond the MHCII genes, MYB and IRF8 antagonistically regulate a broad genetic program responsible for cytokine signaling and T-cell stimulation that displays reduced expression at relapse. A small number of cells with altered TF abundance and silenced MHCII expression are present at the time of initial leukemia diagnosis, likely contributing to eventual relapse. SIGNIFICANCE Our findings point to an adaptive transcriptional mechanism of AML evolution after allogeneic transplantation whereby combinatorial fluctuations of TF expression under immune pressure result in the selection of cells with a silenced T-cell stimulation program. This article is highlighted in the In This Issue feature, p. 369.
Collapse
Affiliation(s)
- Kenneth Eagle
- Cancer and Blood Disorders Center, Dana-Farber Cancer Institute and Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts
- Ken Eagle Consulting, Houston, Texas
| | - Taku Harada
- Cancer and Blood Disorders Center, Dana-Farber Cancer Institute and Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts
| | - Jérémie Kalfon
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts
| | - Monika W. Perez
- Cancer and Blood Disorders Center, Dana-Farber Cancer Institute and Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts
| | - Yaser Heshmati
- Cancer and Blood Disorders Center, Dana-Farber Cancer Institute and Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts
| | - Jazmin Ewers
- Cancer and Blood Disorders Center, Dana-Farber Cancer Institute and Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts
| | - Jošt Vrabič Koren
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
| | | | | | - Vikram R. Paralkar
- Division of Hematology/Oncology, Department of Medicine, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Charles Y. Lin
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
| | - Neekesh V. Dharia
- Cancer and Blood Disorders Center, Dana-Farber Cancer Institute and Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts
| | - Kimberly Stegmaier
- Cancer and Blood Disorders Center, Dana-Farber Cancer Institute and Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts
| | - Stuart H. Orkin
- Cancer and Blood Disorders Center, Dana-Farber Cancer Institute and Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts
- Howard Hughes Medical Institute, Boston, Massachusetts
| | - Maxim Pimkin
- Cancer and Blood Disorders Center, Dana-Farber Cancer Institute and Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts
| |
Collapse
|
8
|
A novel approach GRNTSTE to reconstruct gene regulatory interactions applied to a case study for rat pineal rhythm gene. Sci Rep 2022; 12:10227. [PMID: 35715583 PMCID: PMC9205975 DOI: 10.1038/s41598-022-14903-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 06/14/2022] [Indexed: 01/13/2023] Open
Abstract
Accurate inference and prediction of gene regulatory network are very important for understanding dynamic cellular processes. The large-scale time series genomics data are helpful to reveal the molecular dynamics and dynamic biological processes of complex biological systems. Firstly, we collected the time series data of the rat pineal gland tissue in the natural state according to a fixed sampling rate, and performed whole-genome sequencing. The large-scale time-series sequencing data set of rat pineal gland was constructed, which includes 480 time points, the time interval between adjacent time points is 3 min, and the sampling period is 24 h. Then, we proposed a new method of constructing gene expression regulatory network, named the gene regulatory network based on time series data and entropy transfer (GRNTSTE) method. The method is based on transfer entropy and large-scale time-series gene expression data to infer the causal regulatory relationship between genes in a data-driven mode. The comparative experiments prove that GRNTSTE has better performance than dynamical gene network inference with ensemble of trees (dynGENIE3) and SCRIBE, and has similar performance to TENET. Meanwhile, we proved that the performance of GRNTSTE is slightly lower than that of SINCERITIES method and better than other gene regulatory network construction methods in BEELINE framework, which is based on the BEELINE data set. Finally, the rat pineal rhythm gene expression regulatory network was constructed by us based on the GRNTSTE method, which provides an important reference for the study of the pineal rhythm mechanism, and is of great significance to the study of the pineal rhythm mechanism.
Collapse
|
9
|
Lagator M, Sarikas S, Steinrueck M, Toledo-Aparicio D, Bollback JP, Guet CC, Tkačik G. Predicting bacterial promoter function and evolution from random sequences. eLife 2022; 11:64543. [PMID: 35080492 PMCID: PMC8791639 DOI: 10.7554/elife.64543] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Accepted: 01/09/2022] [Indexed: 12/12/2022] Open
Abstract
Predicting function from sequence is a central problem of biology. Currently, this is possible only locally in a narrow mutational neighborhood around a wildtype sequence rather than globally from any sequence. Using random mutant libraries, we developed a biophysical model that accounts for multiple features of σ70 binding bacterial promoters to predict constitutive gene expression levels from any sequence. We experimentally and theoretically estimated that 10–20% of random sequences lead to expression and ~80% of non-expressing sequences are one mutation away from a functional promoter. The potential for generating expression from random sequences is so pervasive that selection acts against σ70-RNA polymerase binding sites even within inter-genic, promoter-containing regions. This pervasiveness of σ70-binding sites implies that emergence of promoters is not the limiting step in gene regulatory evolution. Ultimately, the inclusion of novel features of promoter function into a mechanistic model enabled not only more accurate predictions of gene expression levels, but also identified that promoters evolve more rapidly than previously thought.
Collapse
Affiliation(s)
- Mato Lagator
- School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, United Kingdom.,Institute of Science and Technology Austria, Klosterneuburg, Austria
| | - Srdjan Sarikas
- Institute of Science and Technology Austria, Klosterneuburg, Austria.,Center for Physiology and Pharmacology, Medical University of Vienna, Klosterneuburg, Austria
| | | | | | - Jonathan P Bollback
- Institute of Integrative Biology, Functional and Comparative Genomics, University of Liverpool, Liverpool, United Kingdom
| | - Calin C Guet
- Institute of Science and Technology Austria, Klosterneuburg, Austria
| | - Gašper Tkačik
- Institute of Science and Technology Austria, Klosterneuburg, Austria
| |
Collapse
|
10
|
Zhang S, Knaack S, Roy S. Enabling Studies of Genome-Scale Regulatory Network Evolution in Large Phylogenies with MRTLE. Methods Mol Biol 2022; 2477:439-455. [PMID: 35524131 PMCID: PMC9794031 DOI: 10.1007/978-1-0716-2257-5_24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Transcriptional regulatory networks specify context-specific patterns of genes and play a central role in how species evolve and adapt. Inferring genome-scale regulatory networks in non-model species is the first step for examining patterns of conservation and divergence of regulatory networks. Transcriptomic data obtained under varying environmental stimuli in multiple species are becoming increasingly available, which can be used to infer regulatory networks. However, inference and analysis of multiple gene regulatory networks in a phylogenetic setting remains challenging. We developed an algorithm, Multi-species Regulatory neTwork LEarning (MRTLE), to facilitate such studies of regulatory network evolution. MRTLE is a probabilistic graphical model-based algorithm that uses phylogenetic structure, transcriptomic data for multiple species, and sequence-specific motifs in each species to simultaneously infer genome-scale regulatory networks across multiple species. We applied MRTLE to study regulatory network evolution across six ascomycete yeasts using transcriptomic measurements collected across different stress conditions. MRTLE networks recapitulated experimentally derived interactions in the model organism S. cerevisiae as well as non-model species, and it was more beneficial for network inference than methods that do not use phylogenetic information. We examined the regulatory networks across species and found that regulators associated with significant expression and network changes are involved in stress-related processes. MTRLE and its associated downstream analysis provide a scalable and principled framework to examine evolutionary dynamics of transcriptional regulatory networks across multiple species in a large phylogeny.
Collapse
Affiliation(s)
- Shilu Zhang
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, USA
| | - Sara Knaack
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, USA
| | - Sushmita Roy
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, USA.
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA.
| |
Collapse
|
11
|
Jiang T, Zhou W, Chang Z, Zou H, Bai J, Sun Q, Pan T, Xu J, Li Y, Li X. ImmReg: the regulon atlas of immune-related pathways across cancer types. Nucleic Acids Res 2021; 49:12106-12118. [PMID: 34755873 PMCID: PMC8643631 DOI: 10.1093/nar/gkab1041] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 10/10/2021] [Accepted: 10/14/2021] [Indexed: 01/05/2023] Open
Abstract
Immune system gene regulation perturbation has been found to be a major cause of the development of various types of cancer. Numbers of mechanisms contribute to gene expression regulation, thus, systematically identification of potential regulons of immune-related pathways is critical to cancer immunotherapy. Here, we comprehensively chart the landscape of transcription factors, microRNAs, RNA binding proteins and long noncoding RNAs regulation in 17 immune-related pathways across 33 cancers. The potential immunology regulons are likely to exhibit higher expressions in immune cells, show expression perturbations in cancer, and are significantly correlated with immune cell infiltrations. We also identify a panel of clinically relevant immunology regulons across cancers. Moreover, the regulon atlas of immune-related pathways helps prioritizing cancer-related genes (i.e. ETV7, miR-146a-5p, ZFP36 and HCP5). We further identified two molecular subtypes of glioma (cold and hot tumour phenotypes), which were characterized by differences in immune cell infiltrations, expression of checkpoints, and prognosis. Finally, we developed a user-friendly resource, ImmReg (http://bio-bigdata.hrbmu.edu.cn/ImmReg/), with multiple modules to visualize, browse, and download immunology regulation. Our study provides a comprehensive landscape of immunology regulons, which will shed light on future development of RNA-based cancer immunotherapies.
Collapse
Affiliation(s)
- Tiantongfei Jiang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Weiwei Zhou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Zhenghong Chang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Haozhe Zou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Jing Bai
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Qisen Sun
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Tao Pan
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Hainan Women and Children's Medical Center, Hainan Medical University, Haikou 571199, China
| | - Juan Xu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Yongsheng Li
- Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Hainan Women and Children's Medical Center, Hainan Medical University, Haikou 571199, China
| | - Xia Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China.,Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Hainan Women and Children's Medical Center, Hainan Medical University, Haikou 571199, China
| |
Collapse
|
12
|
Abdel-Wahhab MA, El-Nekeety AA, Mohammed HE, El-Messery TM, Roby MH, Abdel-Aziem SH, Hassan NS. Synthesis of encapsulated fish oil using whey protein isolate to prevent the oxidative damage and cytotoxicity of titanium dioxide nanoparticles in rats. Heliyon 2021; 7:e08456. [PMID: 34901503 PMCID: PMC8640477 DOI: 10.1016/j.heliyon.2021.e08456] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 10/30/2021] [Accepted: 11/18/2021] [Indexed: 12/27/2022] Open
Abstract
Fish oil exhibited several beneficial effects on human health; however, its applications face several challenges such as its effects on the organoleptic properties of food and its susceptibility to oxidation. Titanium dioxide NPs (TiO2-NPs) are utilized widely in pharmaceutical and food applications although there are some reports about their oxidative damage to living organisms. The current work was undertaken to identify fatty acids content in mullet fish oil, encapsulation, and characterization of the oil, and to assess the protective efficiency of the encapsulated mullet fish oil (EMFO) against the oxidative damage and genotoxicity of TiO2-NPs in rats. Sixty female Sprague-Dawley rats were distributed to 6 groups and treated for 21 days included the control group; TiO2-NPs-treated group (50 mg/kg b.w); the groups treated with EMFO (50 or 100 mg/kg b.w) and the groups received TiO2-NPs plus EMFO at the low or high dose. Samples of blood, liver, and kidney were taken for different assays and histological studies. The GC-FID analysis showed that a total of 14 different fatty acids were found in Mullet fish oil included 41.4% polyunsaturated fatty acids (PUFAs), 31.1% monounsaturated fatty acids (MUFAs), and 25.1% saturated fatty acids (SFAs). The structure of EMFO was spherical with an average diameter of 234.5 nm and a zeta potential of -6.24 mV and was stable up to 10 days at 25 °C with EE of 81.08%. The PV of EMFO was decreased at 5 days then increased at 15 days; however, TBARS was increased throughout the storage time over 15 days. The biological evaluation showed that TiO2-NPs disturb the hepato-nephro functions, lipid profile, inflammatory cytokines, oxidative stress markers, antioxidant enzymes activity, and their corresponding gene expression along with severe pathological alterations in both hepatic and renal tissue. Co-administration of EMFO induced a strong antioxidant role, and the high level could normalize the majority of the parameters tested and the histological picture of the hepatic and renal tissues. These results pointed out that the encapsulation technology enhances the protective role of EMFO against oxidative stress and genotoxicity of TiO2-NPs through the prevention of ω-3 PUFAs oxidation and controlling their release.
Collapse
Affiliation(s)
- Mosaad A. Abdel-Wahhab
- Food Toxicology & Contaminants Department, National Research Centre, Dokki, Cairo, Egypt
| | - Aziza A. El-Nekeety
- Food Toxicology & Contaminants Department, National Research Centre, Dokki, Cairo, Egypt
| | - Hagar E. Mohammed
- Zoology Department, Faculty of Science, Al-Arish University, Al-Arish, Egypt
| | | | - Mohamed H. Roby
- Food Science and Technology Department, Faculty of Agriculture, Fayoum University, Fayoum, Egypt
| | | | - Nabila S. Hassan
- Pathology Department, National Research Centre, Dokki, Cairo, Egypt
| |
Collapse
|
13
|
Abdel-Wahhab MA, El-Nekeety AA, Mohammed HE, Elshafey OI, Abdel-Aziem SH, Hassan NS. Elimination of oxidative stress and genotoxicity of biosynthesized titanium dioxide nanoparticles in rats via supplementation with whey protein-coated thyme essential oil. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2021; 28:57640-57656. [PMID: 34089164 DOI: 10.1007/s11356-021-14723-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Accepted: 05/31/2021] [Indexed: 06/12/2023]
Abstract
The green synthesis of metal nanoparticles is growing dramatically; however, the toxicity of these biosynthesized particles against living organisms is not fully explored. Therefore, this study was designed to synthesize and characterize TiO2-NPs, encapsulation and characterization thyme essential oil (ETEO), and determination of the bioactive constituents of ETEO using GC-MS and evaluate their protective role against TiO2-NPs-induced oxidative damage and genotoxicity in rats. Six groups of rats were treated orally for 30 days including the control group, TiO2-NPs (300 mg/kg b.w)-treated group, ETEO at low (50 mg/kg b.w) or high dose (100 mg/kg b.w)-treated groups, and TiO2-NPs plus ETEO at the two doses-treated groups. Blood and tissues were collected for different assays. The GC-MS results indicated the presence of 21 compounds belonging to phenols, terpene derivatives, and heterocyclic compounds. The synthesized TiO2-NPs were 45 nm tetragonal particles with a zeta potential of -27.34 mV; however, ETEO were 119 nm round particles with a zeta potential of -28.33 mV. TiO2-NPs administration disturbs the liver and kidney markers, lipid profile, cytokines, oxidative stress parameters, the apoptotic and antioxidant hepatic mRNA expression, and induced histological alterations in the liver and kidney tissues. ETEO could improve all these parameters in a dose-dependent manner. It could be concluded that ETEO is a promising candidate for the protection against TiO2-NPs and can be applied safely in food applications.
Collapse
Affiliation(s)
- Mosaad A Abdel-Wahhab
- Food Toxicology & Contaminants Department, National Research Centre, Dokki, Cairo, Egypt.
| | - Aziza A El-Nekeety
- Food Toxicology & Contaminants Department, National Research Centre, Dokki, Cairo, Egypt
| | - Hagar E Mohammed
- Zoology Department, Faculty of Science, Arish University, Arish, Egypt
| | - Ola I Elshafey
- Physical Chemistry Department, National Research Centre, Dokki, Cairo, Egypt
| | | | - Nabila S Hassan
- Pathology Department, National Research Centre, Dokki, Cairo, Egypt
| |
Collapse
|
14
|
Chung H, Parkhurst CN, Magee EM, Phillips D, Habibi E, Chen F, Yeung BZ, Waldman J, Artis D, Regev A. Joint single-cell measurements of nuclear proteins and RNA in vivo. Nat Methods 2021; 18:1204-1212. [PMID: 34608310 PMCID: PMC8532076 DOI: 10.1038/s41592-021-01278-1] [Citation(s) in RCA: 76] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 08/19/2021] [Indexed: 02/08/2023]
Abstract
Identifying gene-regulatory targets of nuclear proteins in tissues is a challenge. Here we describe intranuclear cellular indexing of transcriptomes and epitopes (inCITE-seq), a scalable method that measures multiplexed intranuclear protein levels and the transcriptome in parallel across thousands of nuclei, enabling joint analysis of transcription factor (TF) levels and gene expression in vivo. We apply inCITE-seq to characterize cell state-related changes upon pharmacological induction of neuronal activity in the mouse brain. Modeling gene expression as a linear combination of quantitative protein levels revealed genome-wide associations of each TF and recovered known gene targets. TF-associated genes were coexpressed as distinct modules that each reflected positive or negative TF levels, showing that our approach can disentangle relative putative contributions of TFs to gene expression and add interpretability to inferred gene networks. inCITE-seq can illuminate how combinations of nuclear proteins shape gene expression in native tissue contexts, with direct applications to solid or frozen tissues and clinical specimens.
Collapse
Affiliation(s)
- Hattie Chung
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
| | - Christopher N Parkhurst
- Jill Roberts Institute for Research in Inflammatory Bowel Disease, Weill Cornell Medicine, Cornell University, New York, NY, USA
| | - Emma M Magee
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Devan Phillips
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Genentech, South San Francisco, CA, USA
| | - Ehsan Habibi
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Fei Chen
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
| | | | - Julia Waldman
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - David Artis
- Jill Roberts Institute for Research in Inflammatory Bowel Disease, Weill Cornell Medicine, Cornell University, New York, NY, USA
- Friedman Center for Nutrition and Inflammation, Joan and Sanford I. Weill Department of Medicine, Department of Microbiology and Immunology, Weill Cornell Medicine, Cornell University, New York, NY, USA
| | - Aviv Regev
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Howard Hughes Medical Institute, Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Genentech, South San Francisco, CA, USA.
| |
Collapse
|
15
|
Wang N, Lefaudeux D, Mazumder A, Li JJ, Hoffmann A. Identifying the combinatorial control of signal-dependent transcription factors. PLoS Comput Biol 2021; 17:e1009095. [PMID: 34166361 PMCID: PMC8263068 DOI: 10.1371/journal.pcbi.1009095] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 07/07/2021] [Accepted: 05/18/2021] [Indexed: 12/13/2022] Open
Abstract
The effectiveness of immune responses depends on the precision of stimulus-responsive gene expression programs. Cells specify which genes to express by activating stimulus-specific combinations of stimulus-induced transcription factors (TFs). Their activities are decoded by a gene regulatory strategy (GRS) associated with each response gene. Here, we examined whether the GRSs of target genes may be inferred from stimulus-response (input-output) datasets, which remains an unresolved model-identifiability challenge. We developed a mechanistic modeling framework and computational workflow to determine the identifiability of all possible combinations of synergistic (AND) or non-synergistic (OR) GRSs involving three transcription factors. Considering different sets of perturbations for stimulus-response studies, we found that two thirds of GRSs are easily distinguishable but that substantially more quantitative data is required to distinguish the remaining third. To enhance the accuracy of the inference with timecourse experimental data, we developed an advanced error model that avoids error overestimates by distinguishing between value and temporal error. Incorporating this error model into a Bayesian framework, we show that GRS models can be identified for individual genes by considering multiple datasets. Our analysis rationalizes the allocation of experimental resources by identifying most informative TF stimulation conditions. Applying this computational workflow to experimental data of immune response genes in macrophages, we found that a much greater fraction of genes are combinatorially controlled than previously reported by considering compensation among transcription factors. Specifically, we revealed that a group of known NFκB target genes may also be regulated by IRF3, which is supported by chromatin immuno-precipitation analysis. Our study provides a computational workflow for designing and interpreting stimulus-response gene expression studies to identify underlying gene regulatory strategies and further a mechanistic understanding.
Collapse
Affiliation(s)
- Ning Wang
- Institute for Quantitative and Computational Biosciences (QCBio), University of California, Los Angeles, California, United States of America
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, California, United States of America
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, California, United States of America
| | - Diane Lefaudeux
- Institute for Quantitative and Computational Biosciences (QCBio), University of California, Los Angeles, California, United States of America
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, California, United States of America
| | - Anup Mazumder
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, California, United States of America
| | - Jingyi Jessica Li
- Institute for Quantitative and Computational Biosciences (QCBio), University of California, Los Angeles, California, United States of America
- Department of Statistics, University of California, Los Angeles, California, United States of America
| | - Alexander Hoffmann
- Institute for Quantitative and Computational Biosciences (QCBio), University of California, Los Angeles, California, United States of America
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, California, United States of America
- * E-mail:
| |
Collapse
|
16
|
Abstract
BACKGROUND Systems biology is a rapidly advancing field of science that allows us to look into disease mechanisms, patient diagnosis and stratification, and drug development in a completely new light. It is based on the utilization of unbiased computational systems free of the traditional experimental approaches based on personal choices of what is important and what select experiments should be performed to obtain the expected results. METHODS Systems biology can be applied to inflammatory bowel disease (IBD) by learning basic concepts of omes and omics and how omics-derived "big data" can be integrated to discover the biological networks underlying highly complex diseases like IBD. Once these biological networks (interactomes) are identified, then the molecules controlling the disease network can be singled out and specific blockers developed. RESULTS The field of systems biology in IBD is just emerging, and there is still limited information on how to best utilize its power to advance our understanding of Crohn disease and ulcerative colitis to develop novel therapeutic strategies. Few centers have embraced systems biology in IBD, but the creation of international consortia and large biobanks will make biosamples available to basic and clinical IBD investigators for further research studies. CONCLUSIONS The implementation of systems biology is indispensable and unavoidable, and the patient and medical communities will both benefit immensely from what it will offer in the near future.
Collapse
Affiliation(s)
- Claudio Fiocchi
- Department of Inflammation & Immunity, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA.,Department of Gastroenterology, Hepatology and Nutrition, Digestive Disease and Surgery Institute, Cleveland Clinic, Cleveland, Ohio, USA
| | | |
Collapse
|
17
|
Baur B, Shin J, Zhang S, Roy S. Data integration for inferring context-specific gene regulatory networks. CURRENT OPINION IN SYSTEMS BIOLOGY 2020; 23:38-46. [PMID: 33225112 PMCID: PMC7676633 DOI: 10.1016/j.coisb.2020.09.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Transcriptional regulatory networks control context-specific gene expression patterns and play important roles in normal and disease processes. Advances in genomics are rapidly increasing our ability to measure different components of the regulation machinery at the single-cell and bulk population level. An important challenge is to combine different types of regulatory genomic measurements to construct a more complete picture of gene regulatory networks across different disease, environmental, and developmental contexts. In this review, we focus on recent computational methods that integrate regulatory genomic data sets to infer context specificity and dynamics in regulatory networks.
Collapse
Affiliation(s)
- Brittany Baur
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, 53715, USA
| | - Junha Shin
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, 53715, USA
| | - Shilu Zhang
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, 53715, USA
| | - Sushmita Roy
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, 53715, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53715, USA
| |
Collapse
|
18
|
Malekpour SA, Alizad-Rahvar AR, Sadeghi M. LogicNet: probabilistic continuous logics in reconstructing gene regulatory networks. BMC Bioinformatics 2020; 21:318. [PMID: 32690031 PMCID: PMC7372900 DOI: 10.1186/s12859-020-03651-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Accepted: 07/10/2020] [Indexed: 11/10/2022] Open
Abstract
Background Gene Regulatory Networks (GRNs) have been previously studied by using Boolean/multi-state logics. While the gene expression values are usually scaled into the range [0, 1], these GRN inference methods apply a threshold to discretize the data, resulting in missing information. Most of studies apply fuzzy logics to infer the logical gene-gene interactions from continuous data. However, all these approaches require an a priori known network structure. Results Here, by introducing a new probabilistic logic for continuous data, we propose a novel logic-based approach (called the LogicNet) for the simultaneous reconstruction of the GRN structure and identification of the logics among the regulatory genes, from the continuous gene expression data. In contrast to the previous approaches, the LogicNet does not require an a priori known network structure to infer the logics. The proposed probabilistic logic is superior to the existing fuzzy logics and is more relevant to the biological contexts than the fuzzy logics. The performance of the LogicNet is superior to that of several Mutual Information-based and regression-based tools for reconstructing GRNs. Conclusions The LogicNet reconstructs GRNs and logic functions without requiring prior knowledge of the network structure. Moreover, in another application, the LogicNet can be applied for logic function detection from the known regulatory genes-target interactions. We also conclude that computational modeling of the logical interactions among the regulatory genes significantly improves the GRN reconstruction accuracy.
Collapse
Affiliation(s)
- Seyed Amir Malekpour
- School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran.
| | - Amir Reza Alizad-Rahvar
- School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran
| | - Mehdi Sadeghi
- National Institute of Genetic Engineering and Biotechnology, Tehran, Iran
| |
Collapse
|
19
|
Kang X, Hajek B, Hanzawa Y. From graph topology to ODE models for gene regulatory networks. PLoS One 2020; 15:e0235070. [PMID: 32603340 PMCID: PMC7326199 DOI: 10.1371/journal.pone.0235070] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Accepted: 06/08/2020] [Indexed: 11/28/2022] Open
Abstract
A gene regulatory network can be described at a high level by a directed graph with signed edges, and at a more detailed level by a system of ordinary differential equations (ODEs). The former qualitatively models the causal regulatory interactions between ordered pairs of genes, while the latter quantitatively models the time-varying concentrations of mRNA and proteins. This paper clarifies the connection between the two types of models. We propose a property, called the constant sign property, for a general class of ODE models. The constant sign property characterizes the set of conditions (system parameters, external signals, or internal states) under which an ODE model is consistent with a signed, directed graph. If the constant sign property for an ODE model holds globally for all conditions, then the ODE model has a single signed, directed graph. If the constant sign property for an ODE model only holds locally, which may be more typical, then the ODE model corresponds to different graphs under different sets of conditions. In addition, two versions of constant sign property are given and a relationship between them is proved. As an example, the ODE models that capture the effect of cis-regulatory elements involving protein complex binding, based on the model in the GeneNetWeaver source code, are described in detail and shown to satisfy the global constant sign property with a unique consistent gene regulatory graph. Even a single gene regulatory graph is shown to have many ODE models of GeneNetWeaver type consistent with it due to combinatorial complexity and continuous parameters. Finally the question of how closely data generated by one ODE model can be fit by another ODE model is explored. It is observed that the fit is better if the two models come from the same graph.
Collapse
Affiliation(s)
- Xiaohan Kang
- Department of Electrical and Computer Engineering, and Coordinated Science Laboratory, University of Illinois at Urbana–Champaign, Urbana, Illinois, United States of America
| | - Bruce Hajek
- Department of Electrical and Computer Engineering, and Coordinated Science Laboratory, University of Illinois at Urbana–Champaign, Urbana, Illinois, United States of America
| | - Yoshie Hanzawa
- Department of Biology, California State University, Northridge, Northridge, California, United States of America
| |
Collapse
|
20
|
Bhatt D, Stan RC, Pinhata R, Machado M, Maity S, Cunningham‐Rundles C, Vogel C, de Camargo MM. Chemical chaperones reverse early suppression of regulatory circuits during unfolded protein response in B cells from common variable immunodeficiency patients. Clin Exp Immunol 2020; 200:73-86. [PMID: 31859362 PMCID: PMC7066380 DOI: 10.1111/cei.13410] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/15/2019] [Indexed: 12/19/2022] Open
Abstract
B cells orchestrate pro-survival and pro-apoptotic inputs during unfolded protein response (UPR) to translate, fold, sort, secrete and recycle immunoglobulins. In common variable immunodeficiency (CVID) patients, activated B cells are predisposed to an overload of abnormally processed, misfolded immunoglobulins. Using highly accurate transcript measurements, we show that expression of UPR genes and immunoglobulin chains differs qualitatively and quantitatively during the first 4 h of chemically induced UPR in B cells from CVID patients and a healthy subject. We tested thapsigargin or tunicamycin as stressors and 4-phenylbutyrate, dimethyl sulfoxide and tauroursodeoxycholic acid as chemical chaperones. We found an early and robust decrease of the UPR upon endoplasmic reticulum (ER) stress in CVID patient cells compared to the healthy control consistent with the disease phenotype. The chemical chaperones increased the UPR in the CVID patient cells in response to the stressors, suggesting that misfolded immunoglobulins were stabilized. We suggest that the AMP-dependent transcription factor alpha branch of the UPR is disturbed in CVID patients, underlying the observed expression behavior.
Collapse
Affiliation(s)
- D. Bhatt
- Department of ImmunologyUniversity of São PauloSão PauloBrazil
| | - R. C. Stan
- Department of ImmunologyUniversity of São PauloSão PauloBrazil
- Department of Proteomics and Structural BiologyCantacuzino Military Medical Research Development National InstituteBucharestRomania
| | - R. Pinhata
- Department of ImmunologyUniversity of São PauloSão PauloBrazil
| | - M. Machado
- Department of ImmunologyUniversity of São PauloSão PauloBrazil
| | - S. Maity
- Center for Genomics and Systems BiologyNew York UniversityNew YorkNYUSA
| | - C. Cunningham‐Rundles
- Department of Medicine, Allergy & ImmunologyMount Sinai Medicine SchoolNew YorkNYUSA
| | - C. Vogel
- Center for Genomics and Systems BiologyNew York UniversityNew YorkNYUSA
| | | |
Collapse
|
21
|
Dynamic network inference and association computation discover gene modules regulating virulence, mycotoxin and sexual reproduction in Fusarium graminearum. BMC Genomics 2020; 21:179. [PMID: 32093656 PMCID: PMC7041293 DOI: 10.1186/s12864-020-6596-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Accepted: 02/19/2020] [Indexed: 11/21/2022] Open
Abstract
Background The filamentous fungus Fusarium graminearum causes devastating crop diseases and produces harmful mycotoxins worldwide. Understanding the complex F. graminearum transcriptional regulatory networks (TRNs) is vital for effective disease management. Reconstructing F. graminearum dynamic TRNs, an NP (non-deterministic polynomial) -hard problem, remains unsolved using commonly adopted reductionist or co-expression based approaches. Multi-omic data such as fungal genomic, transcriptomic data and phenomic data are vital to but so far have been largely isolated and untapped for unraveling phenotype-specific TRNs. Results Here for the first time, we harnessed these resources to infer global TRNs for F. graminearum using a Bayesian network based algorithm called “Module Networks”. The inferred TRNs contain 49 regulatory modules that show condition-specific gene regulation. Through a thorough validation based on prior biological knowledge including functional annotations and TF binding site enrichment, our network prediction displayed high accuracy and concordance with existing knowledge. One regulatory module was partially validated using network perturbations caused by Tri6 and Tri10 gene disruptions, as well as using Tri6 Chip-seq data. We then developed a novel computational method to calculate the associations between modules and phenotypes, and identified major module groups regulating different phenotypes. As a result, we identified TRN subnetworks responsible for F. graminearum virulence, sexual reproduction and mycotoxin production, pinpointing phenotype-associated modules and key regulators. Finally, we found a clear compartmentalization of TRN modules in core and lineage-specific genomic regions in F. graminearum, reflecting the evolution of the TRNs in fungal speciation. Conclusions This system-level reconstruction of filamentous fungal TRNs provides novel insights into the intricate networks of gene regulation that underlie key processes in F. graminearum pathobiology and offers promise for the development of improved disease control strategies.
Collapse
|
22
|
Verd B, Monk NAM, Jaeger J. Modularity, criticality, and evolvability of a developmental gene regulatory network. eLife 2019; 8:e42832. [PMID: 31169494 PMCID: PMC6645726 DOI: 10.7554/elife.42832] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2019] [Accepted: 06/05/2019] [Indexed: 01/16/2023] Open
Abstract
The existence of discrete phenotypic traits suggests that the complex regulatory processes which produce them are functionally modular. These processes are usually represented by networks. Only modular networks can be partitioned into intelligible subcircuits able to evolve relatively independently. Traditionally, functional modularity is approximated by detection of modularity in network structure. However, the correlation between structure and function is loose. Many regulatory networks exhibit modular behaviour without structural modularity. Here we partition an experimentally tractable regulatory network-the gap gene system of dipteran insects-using an alternative approach. We show that this system, although not structurally modular, is composed of dynamical modules driving different aspects of whole-network behaviour. All these subcircuits share the same regulatory structure, but differ in components and sensitivity to regulatory interactions. Some subcircuits are in a state of criticality, while others are not, which explains the observed differential evolvability of the various expression features in the system.
Collapse
Affiliation(s)
- Berta Verd
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
- Universitat Pompeu Fabra (UPF)BarcelonaSpain
- Konrad Lorenz Institute for Evolution and Cognition Research (KLI)KlosterneuburgAustria
- Department of GeneticsUniversity of CambridgeCambridgeUnited Kingdom
| | - Nicholas AM Monk
- School of Mathematics and StatisticsUniversity of SheffieldSheffieldUnited States
| | - Johannes Jaeger
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
- Universitat Pompeu Fabra (UPF)BarcelonaSpain
- Konrad Lorenz Institute for Evolution and Cognition Research (KLI)KlosterneuburgAustria
- School of Mathematics and StatisticsUniversity of SheffieldSheffieldUnited States
- Wissenschaftskolleg zu BerlinBerlinGermany
- Center for Systems Biology Dresden (CSBD)DresdenGermany
- Complexity Science Hub (CSH)ViennaAustria
- Centre de Recherches Interdisciplinaires (CRI)ParisFrance
| |
Collapse
|
23
|
Castro JC, Valdés I, Gonzalez-García LN, Danies G, Cañas S, Winck FV, Ñústez CE, Restrepo S, Riaño-Pachón DM. Gene regulatory networks on transfer entropy (GRNTE): a novel approach to reconstruct gene regulatory interactions applied to a case study for the plant pathogen Phytophthora infestans. Theor Biol Med Model 2019; 16:7. [PMID: 30961611 PMCID: PMC6454757 DOI: 10.1186/s12976-019-0103-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 03/07/2019] [Indexed: 11/10/2022] Open
Abstract
Background The increasing amounts of genomics data have helped in the understanding of the molecular dynamics of complex systems such as plant and animal diseases. However, transcriptional regulation, although playing a central role in the decision-making process of cellular systems, is still poorly understood. In this study, we linked expression data with mathematical models to infer gene regulatory networks (GRN). We present a simple yet effective method to estimate transcription factors’ GRNs from transcriptional data. Method We defined interactions between pairs of genes (edges in the GRN) as the partial mutual information between these genes that takes into account time and possible lags in time from one gene in relation to another. We call this method Gene Regulatory Networks on Transfer Entropy (GRNTE) and it corresponds to Granger causality for Gaussian variables in an autoregressive model. To evaluate the reconstruction accuracy of our method, we generated several sub-networks from the GRN of the eukaryotic yeast model, Saccharomyces cerevisae. Then, we applied this method using experimental data of the plant pathogen Phytophthora infestans. We evaluated the transcriptional expression levels of 48 transcription factors of P. infestans during its interaction with one moderately resistant and one susceptible cultivar of yellow potato (Solanum tuberosum group Phureja), using RT-qPCR. With these data, we reconstructed the regulatory network of P. infestans during its interaction with these hosts. Results We first evaluated the performance of our method, based on the transfer entropy (GRNTE), on eukaryotic datasets from the GRNs of the yeast S. cerevisae. Results suggest that GRNTE is comparable with the state-of-the-art methods when the parameters for edge detection are properly tuned. In the case of P. infestans, most of the genes considered in this study, showed a significant change in expression from the onset of the interaction (0 h post inoculum - hpi) to the later time-points post inoculation. Hierarchical clustering of the expression data discriminated two distinct periods during the infection: from 12 to 36 hpi and from 48 to 72 hpi for both the moderately resistant and susceptible cultivars. These distinct periods could be associated with two phases of the life cycle of the pathogen when infecting the host plant: the biotrophic and necrotrophic phases. Conclusions Here we presented an algorithmic solution to the problem of network reconstruction in time series data. This analytical perspective makes use of the dynamic nature of time series data as it relates to intrinsically dynamic processes such as transcription regulation, were multiple elements of the cell (e.g., transcription factors) act simultaneously and change over time. We applied the algorithm to study the regulatory network of P. infestans during its interaction with two hosts which differ in their level of resistance to the pathogen. Although the gene expression analysis did not show differences between the two hosts, the results of the GRN analyses evidenced rewiring of the genes’ interactions according to the resistance level of the host. This suggests that different regulatory processes are activated in response to different environmental cues. Applications of our methodology showed that it could reliably predict where to place edges in the transcriptional networks and sub-networks. The experimental approach used here can help provide insights on the biological role of these interactions on complex processes such as pathogenicity. The code used is available at https://github.com/jccastrog/GRNTE under GNU general public license 3.0. Electronic supplementary material The online version of this article (10.1186/s12976-019-0103-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Juan Camilo Castro
- Department of Biological Sciences, Universidad de los Andes, Bogotá D.C, Colombia
| | - Ivan Valdés
- Department of Biological Sciences, Universidad de los Andes, Bogotá D.C, Colombia
| | | | - Giovanna Danies
- Department of Design, Universidad de los Andes, Bogotá D.C, Colombia
| | - Silvia Cañas
- Department of Biological Sciences, Universidad de los Andes, Bogotá D.C, Colombia
| | - Flavia Vischi Winck
- Regulatory Systems Biology Laboratory, Department of Biochemistry, Institute of Chemistry, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Carlos Eduardo Ñústez
- School of Agricultural Sciences, Universidad Nacional de Colombia, Bogotá D.C, Colombia
| | - Silvia Restrepo
- Department of Biological Sciences, Universidad de los Andes, Bogotá D.C, Colombia
| | - Diego Mauricio Riaño-Pachón
- Computational, Evolutionary and Systems Biology Laboratory, Center for Nuclear Energy in Agriculture, Universidade de São Paulo, Piracicaba, SP, Brazil.
| |
Collapse
|
24
|
Emerging Principles of Gene Expression Programs and Their Regulation. Mol Cell 2019; 71:389-397. [PMID: 30075140 DOI: 10.1016/j.molcel.2018.07.017] [Citation(s) in RCA: 101] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Revised: 07/15/2018] [Accepted: 07/16/2018] [Indexed: 12/30/2022]
Abstract
Many mechanisms contribute to regulation of gene expression to ensure coordinated cellular behaviors and fate decisions. Transcriptional responses to external signals can consist of many hundreds of genes that can be parsed into different categories based on kinetics of induction, cell-type and signal specificity, and duration of the response. Here we discuss the structure of transcription programs and suggest a basic framework to categorize gene expression programs based on characteristics related to their control mechanisms. We also discuss possible evolutionary implications of this framework.
Collapse
|
25
|
Siahpirani AF, Chasman D, Roy S. Integrative Approaches for Inference of Genome-Scale Gene Regulatory Networks. Methods Mol Biol 2019; 1883:161-194. [PMID: 30547400 DOI: 10.1007/978-1-4939-8882-2_7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Transcriptional regulatory networks specify the regulatory proteins of target genes that control the context-specific expression levels of genes. With our ability to profile the different types of molecular components of cells under different conditions, we are now uniquely positioned to infer regulatory networks in diverse biological contexts such as different cell types, tissues, and time points. In this chapter, we cover two main classes of computational methods to integrate different types of information to infer genome-scale transcriptional regulatory networks. The first class of methods focuses on integrative methods for specifically inferring connections between transcription factors and target genes by combining gene expression data with regulatory edge-specific knowledge. The second class of methods integrates upstream signaling networks with transcriptional regulatory networks by combining gene expression data with protein-protein interaction networks and proteomic datasets. We conclude with a section on practical applications of a network inference algorithm to infer a genome-scale regulatory network.
Collapse
Affiliation(s)
- Alireza Fotuhi Siahpirani
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, USA.,Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, USA
| | - Deborah Chasman
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, USA
| | - Sushmita Roy
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, USA. .,Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA.
| |
Collapse
|
26
|
Eetemadi A, Tagkopoulos I. Genetic Neural Networks: an artificial neural network architecture for capturing gene expression relationships. Bioinformatics 2018; 35:2226-2234. [DOI: 10.1093/bioinformatics/bty945] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2018] [Revised: 10/27/2018] [Accepted: 11/16/2018] [Indexed: 01/16/2023] Open
Abstract
Abstract
Motivation
Gene expression prediction is one of the grand challenges in computational biology. The availability of transcriptomics data combined with recent advances in artificial neural networks provide an unprecedented opportunity to create predictive models of gene expression with far reaching applications.
Results
We present the Genetic Neural Network (GNN), an artificial neural network for predicting genome-wide gene expression given gene knockouts and master regulator perturbations. In its core, the GNN maps existing gene regulatory information in its architecture and it uses cell nodes that have been specifically designed to capture the dependencies and non-linear dynamics that exist in gene networks. These two key features make the GNN architecture capable to capture complex relationships without the need of large training datasets. As a result, GNNs were 40% more accurate on average than competing architectures (MLP, RNN, BiRNN) when compared on hundreds of curated and inferred transcription modules. Our results argue that GNNs can become the architecture of choice when building predictors of gene expression from exponentially growing corpus of genome-wide transcriptomics data.
Availability and implementation
https://github.com/IBPA/GNN
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ameen Eetemadi
- Department of Computer Science, University of California, Davis, CA, USA
- Genome Center, University of California, Davis, CA, USA
| | - Ilias Tagkopoulos
- Department of Computer Science, University of California, Davis, CA, USA
- Genome Center, University of California, Davis, CA, USA
| |
Collapse
|
27
|
Wang A, Shu X, Niu X, Zhao W, Ai P, Li P, Zheng A. Comparison of gene co-networks analysis provide a systems view of rice (Oryza sativa L.) response to Tilletia horrida infection. PLoS One 2018; 13:e0202309. [PMID: 30372430 PMCID: PMC6205584 DOI: 10.1371/journal.pone.0202309] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2018] [Accepted: 10/09/2018] [Indexed: 01/29/2023] Open
Abstract
The biotrophic soil-borne fungus Tilletia horrida causes rice kernel smut, an important disease affecting the production of rice male sterile lines in most hybrid rice growing regions of the world. There are no successful ways of controlling this disease and there has been little study of mechanisms of resistance to T. horrida. Based on transcriptional data of different infection time points, we found 23, 782 and 23, 718 differentially expressed genes (fragments per kilobase of transcript sequence per million, FPKM >1) in Jiangcheng 3A (resistant to T. horrida) and 9311A (susceptible to T. horrida), respectively. In order to illuminate the differential responses of the two rice male sterile lines to T. horrida, we identified gene co-expression modules using the method of weighted gene co-expression network analysis (WGCNA) and compared the different biological functions of gene co-expression networks in key modules at different infection time points. The results indicated that gene co-expression networks in the two rice genotypes were different and that genes contained in some modules of the two groups may play important roles in resistance to T. horrida, such as DTH8 and OsHop/Sti1a. Furthermore, these results provide a global view of the responses of two different phenotypes to T. horrida, and assist our understanding of the regulation of expression changes after T. horrida infection.
Collapse
Affiliation(s)
- Aijun Wang
- Rice Research Institute of Sichuan Agricultural University, Chengdu, Sichuan, China
- Key laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Southwest Crop Gene Resource and Genetic Improvement of Ministry of Education, Sichuan Agricultural University, Ya’ an, Sichuan, China
| | - Xinyue Shu
- Rice Research Institute of Sichuan Agricultural University, Chengdu, Sichuan, China
- Key laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Southwest Crop Gene Resource and Genetic Improvement of Ministry of Education, Sichuan Agricultural University, Ya’ an, Sichuan, China
| | - Xianyu Niu
- Rice Research Institute of Sichuan Agricultural University, Chengdu, Sichuan, China
- Key laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Southwest Crop Gene Resource and Genetic Improvement of Ministry of Education, Sichuan Agricultural University, Ya’ an, Sichuan, China
| | - Wenjuan Zhao
- Rice Research Institute of Sichuan Agricultural University, Chengdu, Sichuan, China
- Key laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Southwest Crop Gene Resource and Genetic Improvement of Ministry of Education, Sichuan Agricultural University, Ya’ an, Sichuan, China
| | - Peng Ai
- Rice Research Institute of Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Ping Li
- Rice Research Institute of Sichuan Agricultural University, Chengdu, Sichuan, China
- Key laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Southwest Crop Gene Resource and Genetic Improvement of Ministry of Education, Sichuan Agricultural University, Ya’ an, Sichuan, China
| | - Aiping Zheng
- Rice Research Institute of Sichuan Agricultural University, Chengdu, Sichuan, China
- Key laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Southwest Crop Gene Resource and Genetic Improvement of Ministry of Education, Sichuan Agricultural University, Ya’ an, Sichuan, China
| |
Collapse
|
28
|
Siahpirani AF, Roy S. A prior-based integrative framework for functional transcriptional regulatory network inference. Nucleic Acids Res 2018; 45:e21. [PMID: 27794550 PMCID: PMC5389674 DOI: 10.1093/nar/gkw963] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2015] [Accepted: 10/12/2016] [Indexed: 12/16/2022] Open
Abstract
Transcriptional regulatory networks specify regulatory proteins controlling the context-specific expression levels of genes. Inference of genome-wide regulatory networks is central to understanding gene regulation, but remains an open challenge. Expression-based network inference is among the most popular methods to infer regulatory networks, however, networks inferred from such methods have low overlap with experimentally derived (e.g. ChIP-chip and transcription factor (TF) knockouts) networks. Currently we have a limited understanding of this discrepancy. To address this gap, we first develop a regulatory network inference algorithm, based on probabilistic graphical models, to integrate expression with auxiliary datasets supporting a regulatory edge. Second, we comprehensively analyze our and other state-of-the-art methods on different expression perturbation datasets. Networks inferred by integrating sequence-specific motifs with expression have substantially greater agreement with experimentally derived networks, while remaining more predictive of expression than motif-based networks. Our analysis suggests natural genetic variation as the most informative perturbation for network inference, and, identifies core TFs whose targets are predictable from expression. Multiple reasons make the identification of targets of other TFs difficult, including network architecture and insufficient variation of TF mRNA level. Finally, we demonstrate the utility of our inference algorithm to infer stress-specific regulatory networks and for regulator prioritization.
Collapse
Affiliation(s)
- Alireza F Siahpirani
- Department of Computer Sciences, University of Wisconsin-Madison, 1210 W. Dayton St. Madison, WI 53706-1613, USA
| | - Sushmita Roy
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Discovery Building 330 North Orchard St. Madison, WI 53715, USA.,Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, K6/446 Clinical Sciences Center 600 Highland Avenue Madison, WI 53792-4675, USA
| |
Collapse
|
29
|
Vincent BJ, Staller MV, Lopez-Rivera F, Bragdon MDJ, Pym ECG, Biette KM, Wunderlich Z, Harden TT, Estrada J, DePace AH. Hunchback is counter-repressed to regulate even-skipped stripe 2 expression in Drosophila embryos. PLoS Genet 2018; 14:e1007644. [PMID: 30192762 PMCID: PMC6145585 DOI: 10.1371/journal.pgen.1007644] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 09/19/2018] [Accepted: 08/17/2018] [Indexed: 01/18/2023] Open
Abstract
Hunchback is a bifunctional transcription factor that can activate and repress gene expression in Drosophila development. We investigated the regulatory DNA sequence features that control Hunchback function by perturbing enhancers for one of its target genes, even-skipped (eve). While Hunchback directly represses the eve stripe 3+7 enhancer, we found that in the eve stripe 2+7 enhancer, Hunchback repression is prevented by nearby sequences-this phenomenon is called counter-repression. We also found evidence that Caudal binding sites are responsible for counter-repression, and that this interaction may be a conserved feature of eve stripe 2 enhancers. Our results alter the textbook view of eve stripe 2 regulation wherein Hb is described as a direct activator. Instead, to generate stripe 2, Hunchback repression must be counteracted. We discuss how counter-repression may influence eve stripe 2 regulation and evolution.
Collapse
Affiliation(s)
- Ben J. Vincent
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Max V. Staller
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Francheska Lopez-Rivera
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Meghan D. J. Bragdon
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Edward C. G. Pym
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Kelly M. Biette
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Zeba Wunderlich
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Timothy T. Harden
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Angela H. DePace
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
30
|
Zhang J, Zhao W, Fu R, Fu C, Wang L, Liu H, Li S, Deng Q, Wang S, Zhu J, Liang Y, Li P, Zheng A. Comparison of gene co-networks reveals the molecular mechanisms of the rice (Oryza sativa L.) response to Rhizoctonia solani AG1 IA infection. Funct Integr Genomics 2018; 18:545-557. [PMID: 29730773 PMCID: PMC6097106 DOI: 10.1007/s10142-018-0607-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2017] [Revised: 03/12/2018] [Accepted: 03/20/2018] [Indexed: 12/16/2022]
Abstract
Rhizoctonia solani causes rice sheath blight, an important disease affecting the growth of rice (Oryza sativa L.). Attempts to control the disease have met with little success. Based on transcriptional profiling, we previously identified more than 11,947 common differentially expressed genes (TPM > 10) between the rice genotypes TeQing and Lemont. In the current study, we extended these findings by focusing on an analysis of gene co-expression in response to R. solani AG1 IA and identified gene modules within the networks through weighted gene co-expression network analysis (WGCNA). We compared the different genes assigned to each module and the biological interpretations of gene co-expression networks at early and later modules in the two rice genotypes to reveal differential responses to AG1 IA. Our results show that different changes occurred in the two rice genotypes and that the modules in the two groups contain a number of candidate genes possibly involved in pathogenesis, such as the VQ protein. Furthermore, these gene co-expression networks provide comprehensive transcriptional information regarding gene expression in rice in response to AG1 IA. The co-expression networks derived from our data offer ideas for follow-up experimentation that will help advance our understanding of the translational regulation of rice gene expression changes in response to AG1 IA.
Collapse
Affiliation(s)
- Jinfeng Zhang
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Wenjuan Zhao
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Rong Fu
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Chenglin Fu
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Lingxia Wang
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Huainian Liu
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Shuangcheng Li
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Qiming Deng
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Shiquan Wang
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Jun Zhu
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Yueyang Liang
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Ping Li
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| | - Aiping Zheng
- Rice Research Institute of Sichuan Agricultural University, Chengdu, 611130 China
- State Key Laboratory of Hybrid Rice, Sichuan Agricultural University, Chengdu, 611130 China
- Key Laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Chengdu, 611130 China
| |
Collapse
|
31
|
Diedrichs DR, Gomez JA, Huang CS, Rutkowski DT, Curtu R. A data-entrained computational model for testing the regulatory logic of the vertebrate unfolded protein response. Mol Biol Cell 2018; 29:1502-1517. [PMID: 29668363 PMCID: PMC6014097 DOI: 10.1091/mbc.e17-09-0565] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The vertebrate unfolded protein response (UPR) is characterized by multiple interacting nodes among its three pathways, yet the logic underlying this regulatory complexity is unclear. To begin to address this issue, we created a computational model of the vertebrate UPR that was entrained upon and then validated against experimental data. As part of this validation, the model successfully predicted the phenotypes of cells with lesions in UPR signaling, including a surprising and previously unreported differential role for the eIF2α phosphatase GADD34 in exacerbating severe stress but ameliorating mild stress. We then used the model to test the functional importance of a feedforward circuit within the PERK/CHOP axis and of cross-regulatory control of BiP and CHOP expression. We found that the wiring structure of the UPR appears to balance the ability of the response to remain sensitive to endoplasmic reticulum stress and to be deactivated rapidly by improved protein-folding conditions. This model should serve as a valuable resource for further exploring the regulatory logic of the UPR.
Collapse
Affiliation(s)
- Danilo R Diedrichs
- Department of Mathematics, College of Liberal Arts and Sciences, University of Iowa, Iowa City, IA 52242
| | - Javier A Gomez
- Department of Anatomy and Cell Biology, University of Iowa, Iowa City, IA 52242
| | - Chun-Sing Huang
- Department of Anatomy and Cell Biology, University of Iowa, Iowa City, IA 52242
| | - D Thomas Rutkowski
- Department of Anatomy and Cell Biology, University of Iowa, Iowa City, IA 52242.,Department of Internal Medicine, Carver College of Medicine, University of Iowa, Iowa City, IA 52242
| | - Rodica Curtu
- Department of Mathematics, College of Liberal Arts and Sciences, University of Iowa, Iowa City, IA 52242
| |
Collapse
|
32
|
Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding. Proc Natl Acad Sci U S A 2018; 115:E3702-E3711. [PMID: 29588420 PMCID: PMC5910820 DOI: 10.1073/pnas.1715888115] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Transcription factors (TFs) are primary regulators of gene expression in cells, where they bind specific genomic target sites to control transcription. Quantitative measurements of TF-DNA binding energies can improve the accuracy of predictions of TF occupancy and downstream gene expression in vivo and shed light on how transcriptional networks are rewired throughout evolution. Here, we present a sequencing-based TF binding assay and analysis pipeline (BET-seq, for Binding Energy Topography by sequencing) capable of providing quantitative estimates of binding energies for more than one million DNA sequences in parallel at high energetic resolution. Using this platform, we measured the binding energies associated with all possible combinations of 10 nucleotides flanking the known consensus DNA target interacting with two model yeast TFs, Pho4 and Cbf1. A large fraction of these flanking mutations change overall binding energies by an amount equal to or greater than consensus site mutations, suggesting that current definitions of TF binding sites may be too restrictive. By systematically comparing estimates of binding energies output by deep neural networks (NNs) and biophysical models trained on these data, we establish that dinucleotide (DN) specificities are sufficient to explain essentially all variance in observed binding behavior, with Cbf1 binding exhibiting significantly more nonadditivity than Pho4. NN-derived binding energies agree with orthogonal biochemical measurements and reveal that dynamically occupied sites in vivo are both energetically and mutationally distant from the highest affinity sites.
Collapse
|
33
|
Sharma S, Petsalaki E. Application of CRISPR-Cas9 Based Genome-Wide Screening Approaches to Study Cellular Signalling Mechanisms. Int J Mol Sci 2018; 19:E933. [PMID: 29561791 PMCID: PMC5979383 DOI: 10.3390/ijms19040933] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Revised: 03/15/2018] [Accepted: 03/18/2018] [Indexed: 12/26/2022] Open
Abstract
The cellular signalling process is a highly complex mechanism, involving multiple players, which together orchestrate the cell's response to environmental changes and perturbations. Given the multitude of genes that participate in the process of cellular signalling, its study in a genome-wide manner has proven challenging. Recent advances in gene editing technologies, including clustered regularly-interspaced short palindromic repeats/Cas9 (CRISPR/Cas9) approaches, have opened new opportunities to investigate global regulatory signalling programs of cells in an unbiased manner. In this review, we focus on how the application of pooled genetic screening approaches using the CRISPR/Cas9 system has contributed to a systematic understanding of cellular signalling processes in normal and disease contexts.
Collapse
Affiliation(s)
- Sumana Sharma
- European Bioinformatics Institute, European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
- Cell Surface Signalling Laboratory, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| | - Evangelia Petsalaki
- European Bioinformatics Institute, European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| |
Collapse
|
34
|
Bentovim L, Harden TT, DePace AH. Transcriptional precision and accuracy in development: from measurements to models and mechanisms. Development 2017; 144:3855-3866. [PMID: 29089359 PMCID: PMC5702068 DOI: 10.1242/dev.146563] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
During development, genes are transcribed at specific times, locations and levels. In recent years, the emergence of quantitative tools has significantly advanced our ability to measure transcription with high spatiotemporal resolution in vivo. Here, we highlight recent studies that have used these tools to characterize transcription during development, and discuss the mechanisms that contribute to the precision and accuracy of the timing, location and level of transcription. We attempt to disentangle the discrepancies in how physicists and biologists use the term ‘precision' to facilitate interactions using a common language. We also highlight selected examples in which the coupling of mathematical modeling with experimental approaches has provided important mechanistic insights, and call for a more expansive use of mathematical modeling to exploit the wealth of quantitative data and advance our understanding of animal transcription. Summary: This Review highlights how high-resolution quantitative tools and theoretical models have formed our current view of the mechanisms determining precision and accuracy in the timing, location and level of transcription in the Drosophila embryo.
Collapse
Affiliation(s)
- Lital Bentovim
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Timothy T Harden
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Angela H DePace
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
35
|
Shao B, Yuan H, Zhang R, Wang X, Zhang S, Ouyang Q, Hao N, Luo C. Reconstructing the regulatory circuit of cell fate determination in yeast mating response. PLoS Comput Biol 2017; 13:e1005671. [PMID: 28742153 PMCID: PMC5546706 DOI: 10.1371/journal.pcbi.1005671] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2017] [Revised: 08/07/2017] [Accepted: 07/09/2017] [Indexed: 12/22/2022] Open
Abstract
Massive technological advances enabled high-throughput measurements of proteomic changes in biological processes. However, retrieving biological insights from large-scale protein dynamics data remains a challenging task. Here we used the mating differentiation in yeast Saccharomyces cerevisiae as a model and developed integrated experimental and computational approaches to analyze the proteomic dynamics during the process of cell fate determination. When exposed to a high dose of mating pheromone, the yeast cell undergoes growth arrest and forms a shmoo-like morphology; however, at intermediate doses, chemotropic elongated growth is initialized. To understand the gene regulatory networks that control this differentiation switch, we employed a high-throughput microfluidic imaging system that allows real-time and simultaneous measurements of cell growth and protein expression. Using kinetic modeling of protein dynamics, we classified the stimulus-dependent changes in protein abundance into two sources: global changes due to physiological alterations and gene-specific changes. A quantitative framework was proposed to decouple gene-specific regulatory modes from the growth-dependent global modulation of protein abundance. Based on the temporal patterns of gene-specific regulation, we established the network architectures underlying distinct cell fates using a reverse engineering method and uncovered the dose-dependent rewiring of gene regulatory network during mating differentiation. Furthermore, our results suggested a potential crosstalk between the pheromone response pathway and the target of rapamycin (TOR)-regulated ribosomal biogenesis pathway, which might underlie a cell differentiation switch in yeast mating response. In summary, our modeling approach addresses the distinct impacts of the global and gene-specific regulation on the control of protein dynamics and provides new insights into the mechanisms of cell fate determination. We anticipate that our integrated experimental and modeling strategies could be widely applicable to other biological systems. A systematic characterization of the proteomic changes during the process of cell differentiation is critical for understanding the underlying molecular mechanisms. However, protein expression can be largely affected by changes in cell physiological state, which hampers the detection of regulatory interactions. Here we proposed an integrated experimental and computational framework to reconstruct regulatory circuits in mating differentiation of budding yeast Saccharomyces cerevisiae, in which distinct cell fates are triggered by alteration in pheromone concentration. A modeling approach was developed to decouple gene-specific regulation from growth-dependent global regulation of protein expression, allowing us to reverse engineering the gene regulatory circuits underlying distinct cell fates. Our work highlights the importance of model-based analysis of proteomic data and delivers new insight into dose-dependent differentiation behavior of budding yeast.
Collapse
Affiliation(s)
- Bin Shao
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing, China
| | - Haiyu Yuan
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing, China
| | - Rongfei Zhang
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Xuan Wang
- School of Informatics and Computing, Indiana University, Bloomington, Indiana, United States of America
| | - Shuwen Zhang
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Qi Ouyang
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Nan Hao
- Section of Molecular Biology, Division of Biological Sciences, University of California San Diego, La Jolla, California, United States of America
- * E-mail: (CL); (NH)
| | - Chunxiong Luo
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing, China
- * E-mail: (CL); (NH)
| |
Collapse
|
36
|
Wagner A, Regev A, Yosef N. Revealing the vectors of cellular identity with single-cell genomics. Nat Biotechnol 2017; 34:1145-1160. [PMID: 27824854 DOI: 10.1038/nbt.3711] [Citation(s) in RCA: 401] [Impact Index Per Article: 50.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Single-cell genomics has now made it possible to create a comprehensive atlas of human cells. At the same time, it has reopened definitions of a cell's identity and of the ways in which identity is regulated by the cell's molecular circuitry. Emerging computational analysis methods, especially in single-cell RNA sequencing (scRNA-seq), have already begun to reveal, in a data-driven way, the diverse simultaneous facets of a cell's identity, from discrete cell types to continuous dynamic transitions and spatial locations. These developments will eventually allow a cell to be represented as a superposition of 'basis vectors', each determining a different (but possibly dependent) aspect of cellular organization and function. However, computational methods must also overcome considerable challenges-from handling technical noise and data scale to forming new abstractions of biology. As the scale of single-cell experiments continues to increase, new computational approaches will be essential for constructing and characterizing a reference map of cell identities.
Collapse
Affiliation(s)
- Allon Wagner
- Department of Electrical Engineering and Computer Science and the Center for Computational Biology, University of California, Berkeley, California, USA
| | - Aviv Regev
- Howard Hughes Medical Institute, Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Nir Yosef
- Department of Electrical Engineering and Computer Science and the Center for Computational Biology, University of California, Berkeley, California, USA.,Ragon Institute of Massachusetts General Hospital, Massachusetts Institute of Technology, and Harvard University, Boston, Massachusetts, USA
| |
Collapse
|
37
|
Han N, Noyes HA, Brass A. TIGERi: modeling and visualizing the responses to perturbation of a transcription factor network. BMC Bioinformatics 2017; 18:260. [PMID: 28617232 PMCID: PMC5471961 DOI: 10.1186/s12859-017-1636-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Transcription factor (TF) networks play a key role in controlling the transfer of genetic information from gene to mRNA. Much progress has been made on understanding and reverse-engineering TF network topologies using a range of experimental and theoretical methodologies. Less work has focused on using these models to examine how TF networks respond to changes in the cellular environment. METHODS In this paper, we have developed a simple, pragmatic methodology, TIGERi (Transcription-factor-activity Illustrator for Global Explanation of Regulatory interaction), to model the response of an inferred TF network to changes in cellular environment. The methodology was tested using publicly available data comparing gene expression profiles of a mouse p38α (Mapk14) knock-out line to the original wild-type. RESULTS Using the model, we have examined changes in the TF network resulting from the presence or absence of p38α. A part of this network was confirmed by experimental work in the original paper. Additional relationships were identified by our analysis, for example between p38α and HNF3, and between p38α and SOX9, and these are strongly supported by published evidence. FXR and MYC were also discovered in our analysis as two novel links of p38α. To provide a computational methodology to the biomedical communities that has more user-friendly interface, we also developed a standalone GUI (graphical user interface) software for TIGERi and it is freely available at https://github.com/namshik/tigeri/ . CONCLUSIONS We therefore believe that our computational approach can identify new members of networks and new interactions between members that are supported by published data but have not been integrated into the existing network models. Moreover, ones who want to analyze their own data with TIGERi could use the software without any command line experience. This work could therefore accelerate researches in transcriptional gene regulation in higher eukaryotes.
Collapse
Affiliation(s)
- Namshik Han
- Gurdon Institute, University of Cambridge, Cambridge, UK. .,School of Computer Science and School of Health Sciences, University of Manchester, Manchester, UK.
| | - Harry A Noyes
- School of Biological Sciences, University of Liverpool, Liverpool, UK
| | - Andy Brass
- School of Computer Science and School of Health Sciences, University of Manchester, Manchester, UK.
| |
Collapse
|
38
|
Koch C, Konieczka J, Delorey T, Lyons A, Socha A, Davis K, Knaack SA, Thompson D, O'Shea EK, Regev A, Roy S. Inference and Evolutionary Analysis of Genome-Scale Regulatory Networks in Large Phylogenies. Cell Syst 2017; 4:543-558.e8. [PMID: 28544882 PMCID: PMC5515301 DOI: 10.1016/j.cels.2017.04.010] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2016] [Revised: 02/20/2017] [Accepted: 04/26/2017] [Indexed: 11/22/2022]
Abstract
Changes in transcriptional regulatory networks can significantly contribute to species evolution and adaptation. However, identification of genome-scale regulatory networks is an open challenge, especially in non-model organisms. Here, we introduce multi-species regulatory network learning (MRTLE), a computational approach that uses phylogenetic structure, sequence-specific motifs, and transcriptomic data, to infer the regulatory networks in different species. Using simulated data from known networks and transcriptomic data from six divergent yeasts, we demonstrate that MRTLE predicts networks with greater accuracy than existing methods because it incorporates phylogenetic information. We used MRTLE to infer the structure of the transcriptional networks that control the osmotic stress responses of divergent, non-model yeast species and then validated our predictions experimentally. Interrogating these networks reveals that gene duplication promotes network divergence across evolution. Taken together, our approach facilitates study of regulatory network evolutionary dynamics across multiple poorly studied species.
Collapse
Affiliation(s)
- Christopher Koch
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, Wl, USA
| | - Jay Konieczka
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Toni Delorey
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Ana Lyons
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Amanda Socha
- Dartmouth College, Biology department, Hanover, NH 03755, USA
| | - Kathleen Davis
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
| | - Sara A Knaack
- Wisconsin Institute for Discovery, 330 N. Orchard Street, Madison, Wl, USA
| | - Dawn Thompson
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Erin K O'Shea
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts, USA
- Howard Hughes Medical Institute, Harvard University, Northwest Laboratory, Cambridge, Massachusetts, USA
- Faculty of Arts and Sciences Center for Systems Biology, Harvard University, Northwest Laboratory, Cambridge, Massachusetts, USA
- Department of Molecular and Cellular Biology, Harvard University, Northwest Laboratory, Cambridge, Massachusetts, USA
| | - Aviv Regev
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Howard Hughes Medical Institute, Chevy Chase, Maryland, USA
| | - Sushmita Roy
- Wisconsin Institute for Discovery, 330 N. Orchard Street, Madison, Wl, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wl, USA
| |
Collapse
|
39
|
Simicevic J, Deplancke B. Transcription factor proteomics-Tools, applications, and challenges. Proteomics 2017; 17. [DOI: 10.1002/pmic.201600317] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 10/18/2016] [Accepted: 11/11/2016] [Indexed: 11/12/2022]
Affiliation(s)
- Jovan Simicevic
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences; Ecole Polytechnique Fédérale de Lausanne (EPFL), and Swiss Institute of Bioinformatics; Lausanne Switzerland
- LimmaTech Biologics AG; Schlieren Switzerland
| | - Bart Deplancke
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences; Ecole Polytechnique Fédérale de Lausanne (EPFL), and Swiss Institute of Bioinformatics; Lausanne Switzerland
| |
Collapse
|
40
|
Iyer S, Park BR, Kim M. Absolute quantitative measurement of transcriptional kinetic parameters in vivo. Nucleic Acids Res 2016; 44:e142. [PMID: 27378780 PMCID: PMC5062976 DOI: 10.1093/nar/gkw596] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Accepted: 06/20/2016] [Indexed: 11/24/2022] Open
Abstract
mRNA expression involves transcription initiation, elongation and degradation. In cells, these dynamic processes are highly regulated. However, experimental characterization of the dynamic processes in vivo is difficult due to the paucity of methods capable of direct measurements. We present a highly sensitive and versatile method enabling direct characterization of the dynamic processes. Our method is based on single-molecule fluorescence in situ hybridization (smFISH) and quantitative analyses of hybridization signals. We hybridized multiple probes labelled with spectrally distinct fluorophores to multiple sub-regions of single mRNAs, and visualized the kinetics of synthesis and degradation of the sub-regions. Quantitative analyses of the data lead to absolute quantification of the lag time of mRNA induction (the time it takes for external signals to activate transcription initiation), transcription initiation rate, transcription elongation speed (i.e. mRNA chain-growth speed), the rate of premature termination of transcripts and degradation rates. Applying our method to three different biological problems, we demonstrated how our method may be applicable to reveal dynamics of mRNA expression that was difficult to study previously. We expect such absolute quantification can greatly facilitate understanding of gene expression and its regulation working at the levels of transcriptional initiation, elongation and degradation.
Collapse
Affiliation(s)
- Sukanya Iyer
- Department of Physics, Emory University, Atlanta, GA 30322, USA
| | - Bo Ryoung Park
- Department of Physics, Emory University, Atlanta, GA 30322, USA
| | - Minsu Kim
- Department of Physics, Emory University, Atlanta, GA 30322, USA Graduate Division of Biological and Biomedical Sciences, Emory University, Atlanta, GA 30322, USA
| |
Collapse
|
41
|
Guo L, Zhao G, Xu J, Kistler HC, Gao L, Ma L. Compartmentalized gene regulatory network of the pathogenic fungus Fusarium graminearum. THE NEW PHYTOLOGIST 2016; 211:527-41. [PMID: 26990214 PMCID: PMC5069591 DOI: 10.1111/nph.13912] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Accepted: 01/25/2016] [Indexed: 05/09/2023]
Abstract
Head blight caused by Fusarium graminearum threatens world-wide wheat production, resulting in both yield loss and mycotoxin contamination. We reconstructed the global F. graminearum gene regulatory network (GRN) from a large collection of transcriptomic data using Bayesian network inference, a machine-learning algorithm. This GRN reveals connectivity between key regulators and their target genes. Focusing on key regulators, this network contains eight distinct but interwoven modules. Enriched for unique functions, such as cell cycle, DNA replication, transcription, translation and stress responses, each module exhibits distinct expression profiles. Evolutionarily, the F. graminearum genome can be divided into core regions shared with closely related species and variable regions harboring genes that are unique to F. graminearum and perform species-specific functions. Interestingly, the inferred top regulators regulate genes that are significantly enriched from the same genomic regions (P < 0.05), revealing a compartmentalized network structure that may reflect network rewiring related to specific adaptation of this plant pathogen. This first-ever reconstructed filamentous fungal GRN primes our understanding of pathogenicity at the systems biology level and provides enticing prospects for novel disease control strategies involving the targeting of master regulators in pathogens. The program can be used to construct GRNs of other plant pathogens.
Collapse
Affiliation(s)
- Li Guo
- Department of Biochemistry and Molecular BiologyUniversity of Massachusetts AmherstAmherstMA01003USA
| | - Guoyi Zhao
- Department of Electrical & Computer EngineeringUniversity of Massachusetts AmherstAmherstMA01003USA
| | - Jin‐Rong Xu
- Department of Botany and Plant PathologyPurdue UniversityWest LafayetteIN47907USA
| | - H. Corby Kistler
- USDA‐ARSCereal Disease LaboratoryUniversity of MinnesotaSt PaulMN55108USA
| | - Lixin Gao
- Department of Electrical & Computer EngineeringUniversity of Massachusetts AmherstAmherstMA01003USA
| | - Li‐Jun Ma
- Department of Biochemistry and Molecular BiologyUniversity of Massachusetts AmherstAmherstMA01003USA
| |
Collapse
|
42
|
Chasman D, Fotuhi Siahpirani A, Roy S. Network-based approaches for analysis of complex biological systems. Curr Opin Biotechnol 2016; 39:157-166. [PMID: 27115495 DOI: 10.1016/j.copbio.2016.04.007] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2015] [Revised: 04/04/2016] [Accepted: 04/05/2016] [Indexed: 12/22/2022]
Abstract
Cells function and respond to changes in their environment by the coordinated activity of their molecular components, including mRNAs, proteins and metabolites. At the heart of proper cellular function are molecular networks connecting these components to process extra-cellular environmental signals and drive dynamic, context-specific cellular responses. Network-based computational approaches aim to systematically integrate measurements from high-throughput experiments to gain a global understanding of cellular function under changing environmental conditions. We provide an overview of recent methodological developments toward solving two major computational problems within this field in the past two years (2013-2015): network reconstruction and network-based interpretation. Looking forward, we envision development of methods that can predict phenotypes with high accuracy as well as provide biologically plausible mechanistic hypotheses.
Collapse
Affiliation(s)
- Deborah Chasman
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, United States
| | - Alireza Fotuhi Siahpirani
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, United States; Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, United States; Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53792, United States
| | - Sushmita Roy
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, United States; Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53792, United States; Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, United States.
| |
Collapse
|
43
|
Vincent BJ, Estrada J, DePace AH. The appeasement of Doug: a synthetic approach to enhancer biology. Integr Biol (Camb) 2016; 8:475-84. [DOI: 10.1039/c5ib00321k] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Ben J. Vincent
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| | - Javier Estrada
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| | - Angela H. DePace
- Department of Systems Biology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| |
Collapse
|
44
|
Siwo G, Rider A, Tan A, Pinapati R, Emrich S, Chawla N, Ferdig M. Prediction of fine-tuned promoter activity from DNA sequence. F1000Res 2016; 5:158. [PMID: 27347373 PMCID: PMC4916984 DOI: 10.12688/f1000research.7485.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/08/2016] [Indexed: 12/16/2022] Open
Abstract
The quantitative prediction of transcriptional activity of genes using promoter sequence is fundamental to the engineering of biological systems for industrial purposes and understanding the natural variation in gene expression. To catalyze the development of new algorithms for this purpose, the Dialogue on Reverse Engineering Assessment and Methods (DREAM) organized a community challenge seeking predictive models of promoter activity given normalized promoter activity data for 90 ribosomal protein promoters driving expression of a fluorescent reporter gene. By developing an unbiased modeling approach that performs an iterative search for predictive DNA sequence features using the frequencies of various k-mers, inferred DNA mechanical properties and spatial positions of promoter sequences, we achieved the best performer status in this challenge. The specific predictive features used in the model included the frequency of the nucleotide G, the length of polymeric tracts of T and TA, the frequencies of 6 distinct trinucleotides and 12 tetranucleotides, and the predicted protein deformability of the DNA sequence. Our method accurately predicted the activity of 20 natural variants of ribosomal protein promoters (Spearman correlation r = 0.73) as compared to 33 laboratory-mutated variants of the promoters (r = 0.57) in a test set that was hidden from participants. Notably, our model differed substantially from the rest in 2 main ways: i) it did not explicitly utilize transcription factor binding information implying that subtle DNA sequence features are highly associated with gene expression, and ii) it was entirely based on features extracted exclusively from the 100 bp region upstream from the translational start site demonstrating that this region encodes much of the overall promoter activity. The findings from this study have important implications for the engineering of predictable gene expression systems and the evolution of gene expression in naturally occurring biological systems.
Collapse
Affiliation(s)
- Geoffrey Siwo
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, USA; Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA; Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, Notre Dame, IN, USA; IBM TJ Watson Research Center, NY, USA; IBM Research-Africa, Johannesberg, South Africa
| | - Andrew Rider
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, USA; Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, USA; Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, Notre Dame, IN, USA
| | - Asako Tan
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, USA; Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA; Epicentre, Madison, WI, USA
| | - Richard Pinapati
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, USA; Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA; Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, Notre Dame, IN, USA
| | - Scott Emrich
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, USA; Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, USA; Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, Notre Dame, IN, USA
| | - Nitesh Chawla
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, USA; Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, USA; Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, Notre Dame, IN, USA
| | - Michael Ferdig
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, USA; Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA; Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, Notre Dame, IN, USA
| |
Collapse
|
45
|
Yugi K, Kubota H, Hatano A, Kuroda S. Trans-Omics: How To Reconstruct Biochemical Networks Across Multiple 'Omic' Layers. Trends Biotechnol 2016; 34:276-290. [PMID: 26806111 DOI: 10.1016/j.tibtech.2015.12.013] [Citation(s) in RCA: 159] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Revised: 12/14/2015] [Accepted: 12/16/2015] [Indexed: 12/17/2022]
Abstract
We propose 'trans-omic' analysis for reconstructing global biochemical networks across multiple omic layers by use of both multi-omic measurements and computational data integration. We introduce technologies for connecting multi-omic data based on prior knowledge of biochemical interactions and characterize a biochemical trans-omic network by concepts of a static and dynamic nature. We introduce case studies of metabolism-centric trans-omic studies to show how to reconstruct a biochemical trans-omic network by connecting multi-omic data and how to analyze it in terms of the static and dynamic nature. We propose a trans-ome-wide association study (trans-OWAS) connecting phenotypes with trans-omic networks that reflect both genetic and environmental factors, which can characterize several complex lifestyle diseases as breakdowns in the trans-omic system.
Collapse
Affiliation(s)
- Katsuyuki Yugi
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, Japan; PRESTO, Japan Science and Technology Agency, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, Japan.
| | - Hiroyuki Kubota
- Division of Integrated Omics, Research Center for Transomics Medicine, Medical Institute of Bioregulation, Kyushu University, 3-1-1 Maidashi, Higashi-ku, Fukuoka, Fukuoka 812-8582, Japan; PRESTO, Japan Science and Technology Agency, Higashi-ku, Fukuoka, Fukuoka 812-8582, Japan
| | - Atsushi Hatano
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, Japan
| | - Shinya Kuroda
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, Japan; Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562, Japan; CREST, Japan Science and Technology Agency, Bunkyo-ku, Tokyo 113-0033, Japan.
| |
Collapse
|
46
|
Nucleosomes Are Essential for Proper Regulation of a Multigated Promoter in Saccharomyces cerevisiae. Genetics 2015; 202:551-63. [PMID: 26627840 DOI: 10.1534/genetics.115.183715] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 11/29/2015] [Indexed: 12/22/2022] Open
Abstract
Nucleosome-depleted regions (NDRs) are present immediately adjacent to the transcription start site in most eukaryotic promoters. Here we show that NDRs in the upstream promoter region can profoundly affect gene regulation. Chromatin at the yeast HO promoter is highly repressive and numerous coactivators are required for expression. We modified the HO promoter with segments from the well-studied CLN2 NDR, creating chimeric promoters differing in nucleosome occupancy but with binding sites for the same activator, SBF. Nucleosome depletion resulted in substantial increases in both factor binding and gene expression and allowed activation from a much longer distance, probably by allowing recruited coactivators to act further downstream. Nucleosome depletion also affected sequential activation of the HO promoter; HO activation typically requires the ordered recruitment of activators first to URS1, second to the left-half of URS2 (URS2-L), and finally to the right-half of URS2 (URS2-R), with each region representing distinct gates that must be unlocked to achieve activation. The absence of nucleosomes at URS2-L resulted in promoters no longer requiring both the URS1 and URS2-L gates, as either gate alone is now sufficient to promote binding of the SBF factor to URS2-R. Furthermore, nucleosome depletion at URS2 altered the timing of HO expression and bypassed the regulation that restricts expression to mother cells. Our results reveal insight into how nucleosomes can create a requirement for ordered recruitment of factors to facilitate complex transcriptional regulation.
Collapse
|
47
|
Mahajan G, Mande SC. From System-Wide Differential Gene Expression to Perturbed Regulatory Factors: A Combinatorial Approach. PLoS One 2015; 10:e0142147. [PMID: 26562430 PMCID: PMC4642966 DOI: 10.1371/journal.pone.0142147] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2015] [Accepted: 10/19/2015] [Indexed: 11/19/2022] Open
Abstract
High-throughput experiments such as microarrays and deep sequencing provide large scale information on the pattern of gene expression, which undergoes extensive remodeling as the cell dynamically responds to varying environmental cues or has its function disrupted under pathological conditions. An important initial step in the systematic analysis and interpretation of genome-scale expression alteration involves identification of a set of perturbed transcriptional regulators whose differential activity can provide a proximate hypothesis to account for these transcriptomic changes. In the present work, we propose an unbiased and logically natural approach to transcription factor enrichment. It involves overlaying a list of experimentally determined differentially expressed genes on a background regulatory network coming from e.g. literature curation or computational motif scanning, and identifying that subset of regulators whose aggregated target set best discriminates between the altered and the unaffected genes. In other words, our methodology entails testing of all possible regulatory subnetworks, rather than just the target sets of individual regulators as is followed in most standard approaches. We have proposed an iterative search method to efficiently find such a combination, and benchmarked it on E. coli microarray and regulatory network data available in the public domain. Comparative analysis carried out on artificially generated differential expression profiles, as well as empirical factor overexpression data for M. tuberculosis, shows that our methodology provides marked improvement in accuracy of regulatory inference relative to the standard method that involves evaluating factor enrichment in an individual manner.
Collapse
|
48
|
Thompson D, Regev A, Roy S. Comparative analysis of gene regulatory networks: from network reconstruction to evolution. Annu Rev Cell Dev Biol 2015; 31:399-428. [PMID: 26355593 DOI: 10.1146/annurev-cellbio-100913-012908] [Citation(s) in RCA: 95] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Regulation of gene expression is central to many biological processes. Although reconstruction of regulatory circuits from genomic data alone is therefore desirable, this remains a major computational challenge. Comparative approaches that examine the conservation and divergence of circuits and their components across strains and species can help reconstruct circuits as well as provide insights into the evolution of gene regulatory processes and their adaptive contribution. In recent years, advances in genomic and computational tools have led to a wealth of methods for such analysis at the sequence, expression, pathway, module, and entire network level. Here, we review computational methods developed to study transcriptional regulatory networks using comparative genomics, from sequence to functional data. We highlight how these methods use evolutionary conservation and divergence to reliably detect regulatory components as well as estimate the extent and rate of divergence. Finally, we discuss the promise and open challenges in linking regulatory divergence to phenotypic divergence and adaptation.
Collapse
Affiliation(s)
- Dawn Thompson
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142
| | | | | |
Collapse
|
49
|
Pemberton-Ross PJ, Pachkov M, van Nimwegen E. ARMADA: Using motif activity dynamics to infer gene regulatory networks from gene expression data. Methods 2015; 85:62-74. [PMID: 26164700 DOI: 10.1016/j.ymeth.2015.06.024] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Revised: 06/22/2015] [Accepted: 06/23/2015] [Indexed: 11/19/2022] Open
Abstract
Analysis of gene expression data remains one of the most promising avenues toward reconstructing genome-wide gene regulatory networks. However, the large dimensionality of the problem prohibits the fitting of explicit dynamical models of gene regulatory networks, whereas machine learning methods for dimensionality reduction such as clustering or principal component analysis typically fail to provide mechanistic interpretations of the reduced descriptions. To address this, we recently developed a general methodology called motif activity response analysis (MARA) that, by modeling gene expression patterns in terms of the activities of concrete regulators, accomplishes dramatic dimensionality reduction while retaining mechanistic biological interpretations of its predictions (Balwierz, 2014). Here we extend MARA by presenting ARMADA, which models the activity dynamics of regulators across a time course, and infers the causal interactions between the regulators that drive the dynamics of their activities across time. We have implemented ARMADA as part of our ISMARA webserver, ismara.unibas.ch, allowing any researcher to automatically apply it to any gene expression time course. To illustrate the method, we apply ARMADA to a time course of human umbilical vein endothelial cells treated with TNF. Remarkably, ARMADA is able to reproduce the complex observed motif activity dynamics using a relatively small set of interactions between the key regulators in this system. In addition, we show that ARMADA successfully infers many of the key regulatory interactions known to drive this inflammatory response and discuss several novel interactions that ARMADA predicts. In combination with ISMARA, ARMADA provides a powerful approach to generating plausible hypotheses for the key interactions between regulators that control gene expression in any system for which time course measurements are available.
Collapse
Affiliation(s)
- Peter J Pemberton-Ross
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, Basel, Switzerland.
| | - Mikhail Pachkov
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, Basel, Switzerland.
| | - Erik van Nimwegen
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, Basel, Switzerland.
| |
Collapse
|
50
|
Function does not follow form in gene regulatory circuits. Sci Rep 2015; 5:13015. [PMID: 26290154 PMCID: PMC4542331 DOI: 10.1038/srep13015] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Accepted: 07/06/2015] [Indexed: 11/08/2022] Open
Abstract
Gene regulatory circuits are to the cell what arithmetic logic units are to the chip: fundamental components of information processing that map an input onto an output. Gene regulatory circuits come in many different forms, distinct structural configurations that determine who regulates whom. Studies that have focused on the gene expression patterns (functions) of circuits with a given structure (form) have examined just a few structures or gene expression patterns. Here, we use a computational model to exhaustively characterize the gene expression patterns of nearly 17 million three-gene circuits in order to systematically explore the relationship between circuit form and function. Three main conclusions emerge. First, function does not follow form. A circuit of any one structure can have between twelve and nearly thirty thousand distinct gene expression patterns. Second, and conversely, form does not follow function. Most gene expression patterns can be realized by more than one circuit structure. And third, multifunctionality severely constrains circuit form. The number of circuit structures able to drive multiple gene expression patterns decreases rapidly with the number of these patterns. These results indicate that it is generally not possible to infer circuit function from circuit form, or vice versa.
Collapse
|