1
|
Naz S, Chatha AMM. Transforming growth factor-beta gene family in Labeo rohita: Genome-wide molecular characterization and phylogenetic relationship with other vertebrates. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY. PART D, GENOMICS & PROTEOMICS 2025; 55:101511. [PMID: 40267860 DOI: 10.1016/j.cbd.2025.101511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2024] [Revised: 04/11/2025] [Accepted: 04/12/2025] [Indexed: 04/25/2025]
Abstract
Recent advancements in animal genetics, biotechnology, and bioinformatics can potentially contribute to the required improvement in fish production and quality. Current study explored for the first-time, various genes from Transforming growth factor-beta (TGF-β) gene family which could play vital role in growth and development of Labeo rohita, an economically important freshwater fish. A number of tools and analysis like characterizing the selected genes, multiple alignment and clade-wise phylogeny, dual Synteny analysis and chromosomal distribution, characterization of proteins, secondary structure and 3D models of proteins, PROSITE scan analysis, and transcription factor binding sites (TFBSs) of selected genes were performed to explore the relationship between selected genes from Labeo rohita and referenced species (Homo sapiens, Danio rerio, Oreochromis niloticus, Chelonia mydas, and Parus major). The study evaluated 27 genes from TGF-β gene family. The results suggested that physicochemical characteristics of studied genes exhibit a basic nature with the few exceptions. Two main clades (BPM like and GDF like) were obtained by the phylogenetic analysis across six vertebrate species. Homogeneity was observed in the gene structure for all selected genes of TGF-β gene superfamily. TGF-β family and TGF-β propeptide family were dominant in domain regions of studied genes. Gene structure comparisons suggested that the TGF-β gene superfamily has arisen by gene duplications events. The study identified five pairs of duplicated gene (segmental duplications) with the Ka/Ks < 1, indicating the negative selection pressure of these genes. The length of selected TFBSs (GATA, HOXD, STAT, HNF-1A, and YY1) ranged from 5 to 9 bp, with the HOXD having maximum bp. This study explored the molecular characterization for TGF-β gene family and related proteins in L. rohita and will potentially serve as basis of development of novel strategies in improvement of fish culture.
Collapse
Affiliation(s)
- Saima Naz
- Department of Zoology, Government Sadiq College Women University, Bahawalpur, Pakistan.
| | - Ahmad Manan Mustafa Chatha
- Department of Entomology, Faculty of Agriculture and Environment, The Islamia University of Bahawalpur, Pakistan.
| |
Collapse
|
2
|
Znaidi S. When HSFs bring the heat-mapping the transcriptional circuitries of HSF-type regulators in Candida albicans. mSphere 2025; 10:e0064423. [PMID: 39704513 PMCID: PMC11774045 DOI: 10.1128/msphere.00644-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2024] Open
Abstract
Heat shock factor (HSF)-type regulators are stress-responsive transcription factors widely distributed among eukaryotes, including fungi. They carry a four-stranded winged helix-turn-helix DNA-binding domain considered as the signature domain for HSFs. The genome of the opportunistic yeast Candida albicans encodes four HSF members, namely, Sfl1, Sfl2, Skn7, and the essential regulator, Hsf1. C. albicans HSFs do not only respond to heat shock and/or temperature variation but also to CO2 levels, oxidative stress, and quorum sensing, acting this way as central decision makers. In this minireview, I follow on the heels of my mSphere of Influence commentary (2020) to provide an overview of the repertoire of HSF regulators in Saccharomyces cerevisiae and C. albicans and describe how their genetic perturbation in C. albicans, coupled with genome-wide expression and location analyses, allow to map their transcriptional circuitry. I highlight how they can regulate, in common, a crucial developmental program: filamentous growth.
Collapse
Affiliation(s)
- Sadri Znaidi
- Institut Pasteur de Tunis, University of Tunis El Manar, Laboratoire de Microbiologie Moléculaire, Vaccinologie et Développement Biotechnologique, Tunis, Tunisia
- Institut Pasteur, INRA, Département Mycologie, Unité Biologie et Pathogénicité Fongiques, Paris, France
| |
Collapse
|
3
|
Sultana M, Tayyab M, Sunil, Parveen S, Hussain M, Saeed S, Riaz Z, Shabbir S. In silico molecular characterization of TGF-β gene family in Bufo bufo: genome-wide analysis. J Biomol Struct Dyn 2024:1-15. [PMID: 38345010 DOI: 10.1080/07391102.2024.2313168] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 01/27/2024] [Indexed: 03/08/2025]
Abstract
Bufo bufo is a living example of evolutionary processes due to its numerous physiological and ecological adaptations. This is the first study to genetically characterize the TGF-β gene family in B. bufo at the genome-wide level, and a total of 28 TGF-β gene family homologs are identified. Physicochemical characteristics of TGF-β homologs exhibit a basic nature except for BMP1, BMP4, BMP10, BMP15, AMH, INHA, NODAL Modulator and TGFB1. Phylogenetic analysis divided the TGF-β gene family homologs into 2 major clades along with other vertebrate species. In domain and motif composition analysis, the gene structure for all TGF-β homologs exhibited homogeneity except BMP1. We have identified the TGF-β propeptide domain together with the TGF-β in all family homologs of TGF-β superfamily. Gene structure comparisons indicated that the TGF-β gene family have arisen by gene duplications. We also identified 10 duplicated gene pairs, all of which were detected to be segmental duplications. The Ka/Ks test ratio findings for every pair of genes revealed that none of the ratios surpassed 1 except for one gene pair (INHA/BMP1), indicating that these proteins are under positive selection. Circos analysis showed that TGF-β gene family homologs are arranged in 11 dispersed clusters and all were segmentally arrayed in the genome. This study provides a molecular basis for TGF-β ligand protein functional analysis and may serve as a reference for in-depth phylogenomics and may promote the development of novel strategies.
Collapse
Affiliation(s)
- Mehwish Sultana
- Department of Zoology, The Government Sadiq College Women University, Bahawalpur, Punjab, Pakistan
| | | | - Sunil
- University of Agriculture, Faisalabad, Punjab, Pakistan
| | - Shakeela Parveen
- Department of Zoology, The Government Sadiq College Women University, Bahawalpur, Punjab, Pakistan
| | | | - Saba Saeed
- Department of Zoology, The Government Sadiq College Women University, Bahawalpur, Punjab, Pakistan
| | - Zainab Riaz
- Department of Zoology, The Government Sadiq College Women University, Bahawalpur, Punjab, Pakistan
| | - Saman Shabbir
- Department of Zoology, The Government Sadiq College Women University, Bahawalpur, Punjab, Pakistan
| |
Collapse
|
4
|
Freyre-González JA, Escorcia-Rodríguez JM, Gutiérrez-Mondragón LF, Martí-Vértiz J, Torres-Franco CN, Zorro-Aranda A. System Principles Governing the Organization, Architecture, Dynamics, and Evolution of Gene Regulatory Networks. Front Bioeng Biotechnol 2022; 10:888732. [PMID: 35646858 PMCID: PMC9135355 DOI: 10.3389/fbioe.2022.888732] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 04/27/2022] [Indexed: 11/21/2022] Open
Abstract
Synthetic biology aims to apply engineering principles for the rational, systematical design and construction of biological systems displaying functions that do not exist in nature or even building a cell from scratch. Understanding how molecular entities interconnect, work, and evolve in an organism is pivotal to this aim. Here, we summarize and discuss some historical organizing principles identified in bacterial gene regulatory networks. We propose a new layer, the concilion, which is the group of structural genes and their local regulators responsible for a single function that, organized hierarchically, coordinate a response in a way reminiscent of the deliberation and negotiation that take place in a council. We then highlight the importance that the network structure has, and discuss that the natural decomposition approach has unveiled the system-level elements shaping a common functional architecture governing bacterial regulatory networks. We discuss the incompleteness of gene regulatory networks and the need for network inference and benchmarking standardization. We point out the importance that using the network structural properties showed to improve network inference. We discuss the advances and controversies regarding the consistency between reconstructions of regulatory networks and expression data. We then discuss some perspectives on the necessity of studying regulatory networks, considering the interactions’ strength distribution, the challenges to studying these interactions’ strength, and the corresponding effects on network structure and dynamics. Finally, we explore the ability of evolutionary systems biology studies to provide insights into how evolution shapes functional architecture despite the high evolutionary plasticity of regulatory networks.
Collapse
Affiliation(s)
- Julio A Freyre-González
- Regulatory Systems Biology Research Group, Program of Systems Biology, Center for Genomic Sciences, Universidad Nacional Autónoma de México, Cuernavaca, México
| | - Juan M Escorcia-Rodríguez
- Regulatory Systems Biology Research Group, Program of Systems Biology, Center for Genomic Sciences, Universidad Nacional Autónoma de México, Cuernavaca, México
| | - Luis F Gutiérrez-Mondragón
- Regulatory Systems Biology Research Group, Program of Systems Biology, Center for Genomic Sciences, Universidad Nacional Autónoma de México, Cuernavaca, México
- Undergraduate Program in Genomic Sciences, Center for Genomic Sciences, Universidad Nacional Autónoma de México, Cuernavaca, México
| | - Jerónimo Martí-Vértiz
- Regulatory Systems Biology Research Group, Program of Systems Biology, Center for Genomic Sciences, Universidad Nacional Autónoma de México, Cuernavaca, México
| | - Camila N Torres-Franco
- Regulatory Systems Biology Research Group, Program of Systems Biology, Center for Genomic Sciences, Universidad Nacional Autónoma de México, Cuernavaca, México
| | - Andrea Zorro-Aranda
- Regulatory Systems Biology Research Group, Program of Systems Biology, Center for Genomic Sciences, Universidad Nacional Autónoma de México, Cuernavaca, México
- Department of Chemical Engineering, Universidad de Antioquia, Medellín, Colombia
| |
Collapse
|
5
|
Wang D, Wang T, Gill A, Hilliard T, Chen F, Karamyshev AL, Zhang F. Uncovering the cellular capacity for intensive and specific feedback self-control of the argonautes and MicroRNA targeting activity. Nucleic Acids Res 2020; 48:4681-4697. [PMID: 32297952 PMCID: PMC7229836 DOI: 10.1093/nar/gkaa209] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 03/13/2020] [Accepted: 03/24/2020] [Indexed: 11/13/2022] Open
Abstract
The miRNA pathway has three segments—biogenesis, targeting and downstream regulatory effectors. We aimed to better understand their cellular control by exploring the miRNA-mRNA-targeting relationships. We first used human evolutionarily conserved sites. Strikingly, AGOs 1–3 are all among the top 14 mRNAs with the highest miRNA site counts, along with ANKRD52, the phosphatase regulatory subunit of the recently identified AGO phosphorylation cycle; and the AGO phosphorylation cycle mRNAs share much more than expected miRNA sites. The mRNAs for TNRC6, which acts with AGOs to channel miRNA-mediated regulatory actions onto specific mRNAs, are also heavily miRNA-targeted. In contrast, upstream miRNA biogenesis mRNAs are not, and neither are downstream regulatory effectors. In short, binding site enrichment in miRNA targeting machinery mRNAs, but neither upstream biogenesis nor downstream effector mRNAs, was observed, endowing a cellular capacity for intensive and specific feedback control of the targeting activity. The pattern was confirmed with experimentally determined miRNA-mRNA target relationships. Moreover, genetic experiments demonstrated cellular utilization of this capacity. Thus, we uncovered a capacity for intensive, and specific, feedback-regulation of miRNA targeting activity directly by miRNAs themselves, i.e. segment-specific feedback auto-regulation of miRNA pathway, complementing miRNAs pairing with transcription factors to form hybrid feedback-loop.
Collapse
Affiliation(s)
- Degeng Wang
- Department of Environmental Toxicology, Lubbock, TX 79409, USA.,The Institute of Environmental and Human Health (TIEHH), Lubbock, TX 79409, USA
| | - Tingzeng Wang
- Department of Environmental Toxicology, Lubbock, TX 79409, USA.,The Institute of Environmental and Human Health (TIEHH), Lubbock, TX 79409, USA
| | - Audrey Gill
- Department of Mathematics and Statistics, Texas Tech University, Lubbock, TX 79409, USA
| | - Terrell Hilliard
- Department of Environmental Toxicology, Lubbock, TX 79409, USA.,The Institute of Environmental and Human Health (TIEHH), Lubbock, TX 79409, USA
| | - Fengqian Chen
- Department of Environmental Toxicology, Lubbock, TX 79409, USA.,The Institute of Environmental and Human Health (TIEHH), Lubbock, TX 79409, USA
| | - Andrey L Karamyshev
- Department of Cell Biology and Biochemistry, Texas Tech University Health Sciences Center, 3601 4th Street, Lubbock TX 79430, USA
| | - Fangyuan Zhang
- Department of Mathematics and Statistics, Texas Tech University, Lubbock, TX 79409, USA
| |
Collapse
|
6
|
Inference of Bacterial Small RNA Regulatory Networks and Integration with Transcription Factor-Driven Regulatory Networks. mSystems 2020; 5:5/3/e00057-20. [PMID: 32487739 PMCID: PMC8534726 DOI: 10.1128/msystems.00057-20] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Small noncoding RNAs (sRNAs) are key regulators of bacterial gene expression. Through complementary base pairing, sRNAs affect mRNA stability and translation efficiency. Here, we describe a network inference approach designed to identify sRNA-mediated regulation of transcript levels. We use existing transcriptional data sets and prior knowledge to infer sRNA regulons using our network inference tool, the Inferelator. This approach produces genome-wide gene regulatory networks that include contributions by both transcription factors and sRNAs. We show the benefits of estimating and incorporating sRNA activities into network inference pipelines using available experimental data. We also demonstrate how these estimated sRNA regulatory activities can be mined to identify the experimental conditions where sRNAs are most active. We uncover 45 novel experimentally supported sRNA-mRNA interactions in Escherichia coli, outperforming previous network-based efforts. Additionally, our pipeline complements sequence-based sRNA-mRNA interaction prediction methods by adding a data-driven filtering step. Finally, we show the general applicability of our approach by identifying 24 novel, experimentally supported, sRNA-mRNA interactions in Pseudomonas aeruginosa, Staphylococcus aureus, and Bacillus subtilis. Overall, our strategy generates novel insights into the functional context of sRNA regulation in multiple bacterial species. IMPORTANCE Individual bacterial genomes can have dozens of small noncoding RNAs with largely unexplored regulatory functions. Although bacterial sRNAs influence a wide range of biological processes, including antibiotic resistance and pathogenicity, our current understanding of sRNA-mediated regulation is far from complete. Most of the available information is restricted to a few well-studied bacterial species; and even in those species, only partial sets of sRNA targets have been characterized in detail. To close this information gap, we developed a computational strategy that takes advantage of available transcriptional data and knowledge about validated and putative sRNA-mRNA interactions for inferring expanded sRNA regulons. Our approach facilitates the identification of experimentally supported novel interactions while filtering out false-positive results. Due to its data-driven nature, our method prioritizes biologically relevant interactions among lists of candidate sRNA-target pairs predicted in silico from sequence analysis or derived from sRNA-mRNA binding experiments.
Collapse
|
7
|
Barco B, Clay NK. Hierarchical and Dynamic Regulation of Defense-Responsive Specialized Metabolism by WRKY and MYB Transcription Factors. FRONTIERS IN PLANT SCIENCE 2020; 10:1775. [PMID: 32082343 PMCID: PMC7005594 DOI: 10.3389/fpls.2019.01775] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Accepted: 12/19/2019] [Indexed: 05/07/2023]
Abstract
The plant kingdom produces hundreds of thousands of specialized bioactive metabolites, some with pharmaceutical and biotechnological importance. Their biosynthesis and function have been studied for decades, but comparatively less is known about how transcription factors with overlapping functions and contrasting regulatory activities coordinately control the dynamics and output of plant specialized metabolism. Here, we performed temporal studies on pathogen-infected intact host plants with perturbed transcription factors. We identified WRKY33 as the condition-dependent master regulator and MYB51 as the dual functional regulator in a hierarchical gene network likely responsible for the gene expression dynamics and metabolic fluxes in the camalexin and 4-hydroxy-indole-3-carbonylnitrile (4OH-ICN) pathways. This network may have also facilitated the regulatory capture of the newly evolved 4OH-ICN pathway in Arabidopsis thaliana by the more-conserved transcription factor MYB51. It has long been held that the plasticity of plant specialized metabolism and the canalization of development should be differently regulated; our findings imply a common hierarchical regulatory architecture orchestrated by transcription factors for specialized metabolism and development, making it an attractive target for metabolic engineering.
Collapse
Affiliation(s)
| | - Nicole K. Clay
- Department of Molecular, Cellular & Developmental Biology, Yale University, New Haven, CT, United States
| |
Collapse
|
8
|
Ferro E, Enrico Bena C, Grigolon S, Bosia C. From Endogenous to Synthetic microRNA-Mediated Regulatory Circuits: An Overview. Cells 2019; 8:E1540. [PMID: 31795372 PMCID: PMC6952906 DOI: 10.3390/cells8121540] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Revised: 11/22/2019] [Accepted: 11/25/2019] [Indexed: 12/13/2022] Open
Abstract
MicroRNAs are short non-coding RNAs that are evolutionarily conserved and are pivotal post-transcriptional mediators of gene regulation. Together with transcription factors and epigenetic regulators, they form a highly interconnected network whose building blocks can be classified depending on the number of molecular species involved and the type of interactions amongst them. Depending on their topology, these molecular circuits may carry out specific functions that years of studies have related to the processing of gene expression noise. In this review, we first present the different over-represented network motifs involving microRNAs and their specific role in implementing relevant biological functions, reviewing both theoretical and experimental studies. We then illustrate the recent advances in synthetic biology, such as the construction of artificially synthesised circuits, which provide a controlled tool to test experimentally the possible microRNA regulatory tasks and constitute a starting point for clinical applications.
Collapse
Affiliation(s)
- Elsi Ferro
- IIGM—Italian Institute for Genomic Medicine, c/o IRCCS, 10060 Candiolo (Torino), Italy
- Candiolo Cancer Institute, FPO-IRCCS, 10060 Candiolo (Torino), Italy
| | - Chiara Enrico Bena
- IIGM—Italian Institute for Genomic Medicine, c/o IRCCS, 10060 Candiolo (Torino), Italy
- Candiolo Cancer Institute, FPO-IRCCS, 10060 Candiolo (Torino), Italy
| | - Silvia Grigolon
- The Francis Crick Institute, 1 Midland Road, London NW1 1AT, UK
| | - Carla Bosia
- IIGM—Italian Institute for Genomic Medicine, c/o IRCCS, 10060 Candiolo (Torino), Italy
- Department of Applied Science and Technology, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129 Torino, Italy
| |
Collapse
|
9
|
Kulyté A, Kwok KHM, de Hoon M, Carninci P, Hayashizaki Y, Arner P, Arner E. MicroRNA-27a/b-3p and PPARG regulate SCAMP3 through a feed-forward loop during adipogenesis. Sci Rep 2019; 9:13891. [PMID: 31554889 PMCID: PMC6761119 DOI: 10.1038/s41598-019-50210-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 09/06/2019] [Indexed: 12/12/2022] Open
Abstract
MicroRNAs (miRNA) modulate gene expression through feed-back and forward loops. Previous studies identified miRNAs that regulate transcription factors, including Peroxisome Proliferator Activated Receptor Gamma (PPARG), in adipocytes, but whether they influence adipogenesis via such regulatory loops remain elusive. Here we predicted and validated a novel feed-forward loop regulating adipogenesis and involved miR-27a/b-3p, PPARG and Secretory Carrier Membrane Protein 3 (SCAMP3). In this loop, expression of both PPARG and SCAMP3 was independently suppressed by miR-27a/b-3p overexpression. Knockdown of PPARG downregulated SCAMP3 expression at the late phase of adipogenesis, whereas reduction of SCAMP3 mRNA levels increased PPARG expression at early phase in differentiation. The latter was accompanied with upregulation of adipocyte-enriched genes, including ADIPOQ and FABP4, suggesting an anti-adipogenic role for SCAMP3. PPARG and SCAMP3 exhibited opposite behaviors regarding correlations with clinical phenotypes, including body mass index, body fat mass, adipocyte size, lipolytic and lipogenic capacity, and secretion of pro-inflammatory cytokines. While adipose PPARG expression was associated with more favorable metabolic phenotypes, SCAMP3 expression was linked to increased fat mass and insulin resistance. Together, we identified a feed-forward loop through which miR-27a/b-3p, PPARG and SCAMP3 cooperatively fine tune the regulation of adipogenesis, which potentially may impact whole body metabolism.
Collapse
Affiliation(s)
- Agné Kulyté
- Lipid laboratory, Department of Medicine H7, Karolinska Institutet, Huddinge, Sweden.
| | - Kelvin Ho Man Kwok
- Lipid laboratory, Department of Medicine H7, Karolinska Institutet, Huddinge, Sweden.,Department of Biosciences and Nutrition, Karolinska Institutet, Huddinge, Sweden
| | - Michiel de Hoon
- RIKEN Center for Integrative Medical Sciences (IMS), Yokohama, Kanagawa, 230-0045, Japan
| | - Piero Carninci
- RIKEN Center for Integrative Medical Sciences (IMS), Yokohama, Kanagawa, 230-0045, Japan
| | - Yoshihide Hayashizaki
- RIKEN Preventive Medicine and Diagnosis Innovation Program, Yokohama, Kanagawa, 230-0045, Japan
| | - Peter Arner
- Lipid laboratory, Department of Medicine H7, Karolinska Institutet, Huddinge, Sweden
| | - Erik Arner
- RIKEN Center for Integrative Medical Sciences (IMS), Yokohama, Kanagawa, 230-0045, Japan.
| |
Collapse
|
10
|
Condition-Specific Modeling of Biophysical Parameters Advances Inference of Regulatory Networks. Cell Rep 2019; 23:376-388. [PMID: 29641998 PMCID: PMC5987223 DOI: 10.1016/j.celrep.2018.03.048] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2017] [Revised: 01/12/2018] [Accepted: 03/12/2018] [Indexed: 12/31/2022] Open
Abstract
Large-scale inference of eukaryotic transcription-regulatory networks remains challenging. One underlying reason is that existing algorithms typically ignore crucial regulatory mechanisms, such as RNA degradation and post-transcriptional processing. Here, we describe InfereCLaDR, which incorporates such elements and advances prediction in Saccharomyces cerevisiae. First, InfereCLaDR employs a high-quality Gold Standard dataset that we use separately as prior information and for model validation. Second, InfereCLaDR explicitly models transcription factor activity and RNA half-lives. Third, it introduces expression subspaces to derive condition-responsive regulatory networks for every gene. InfereCLaDR’s final network is validated by known data and trends and results in multiple insights. For example, it predicts long half-lives for transcripts of the nucleic acid metabolism genes and members of the cytosolic chaperonin complex as targets of the proteasome regulator Rpn4p. InfereCLaDR demonstrates that more biophysically realistic modeling of regulatory networks advances prediction accuracy both in eukaryotes and prokaryotes.
Collapse
|
11
|
Defoort J, Van de Peer Y, Vermeirssen V. Function, dynamics and evolution of network motif modules in integrated gene regulatory networks of worm and plant. Nucleic Acids Res 2019; 46:6480-6503. [PMID: 29873777 PMCID: PMC6061849 DOI: 10.1093/nar/gky468] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Accepted: 05/14/2018] [Indexed: 12/29/2022] Open
Abstract
Gene regulatory networks (GRNs) consist of different molecular interactions that closely work together to establish proper gene expression in time and space. Especially in higher eukaryotes, many questions remain on how these interactions collectively coordinate gene regulation. We study high quality GRNs consisting of undirected protein–protein, genetic and homologous interactions, and directed protein–DNA, regulatory and miRNA–mRNA interactions in the worm Caenorhabditis elegans and the plant Arabidopsis thaliana. Our data-integration framework integrates interactions in composite network motifs, clusters these in biologically relevant, higher-order topological network motif modules, overlays these with gene expression profiles and discovers novel connections between modules and regulators. Similar modules exist in the integrated GRNs of worm and plant. We show how experimental or computational methodologies underlying a certain data type impact network topology. Through phylogenetic decomposition, we found that proteins of worm and plant tend to functionally interact with proteins of a similar age, while at the regulatory level TFs favor same age, but also older target genes. Despite some influence of the duplication mode difference, we also observe at the motif and module level for both species a preference for age homogeneity for undirected and age heterogeneity for directed interactions. This leads to a model where novel genes are added together to the GRNs in a specific biological functional context, regulated by one or more TFs that also target older genes in the GRNs. Overall, we detected topological, functional and evolutionary properties of GRNs that are potentially universal in all species.
Collapse
Affiliation(s)
- Jonas Defoort
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,VIB Center for Plant Systems Biology, 9052 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,VIB Center for Plant Systems Biology, 9052 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium.,Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria 0028, South Africa
| | - Vanessa Vermeirssen
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,VIB Center for Plant Systems Biology, 9052 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium
| |
Collapse
|
12
|
John JP, Thirunavukkarasu P, Ishizuka K, Parekh P, Sawa A. An in-silico approach for discovery of microRNA-TF regulation of DISC1 interactome mediating neuronal migration. NPJ Syst Biol Appl 2019; 5:17. [PMID: 31098296 PMCID: PMC6504871 DOI: 10.1038/s41540-019-0094-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2018] [Accepted: 04/15/2019] [Indexed: 11/25/2022] Open
Abstract
Neuronal migration constitutes an important step in corticogenesis; dysregulation of the molecular mechanisms mediating this crucial step in neurodevelopment may result in various neuropsychiatric disorders. By curating experimental data from published literature, we identified eight functional modules involving Disrupted-in-schizophrenia 1 (DISC1) and its interacting proteins that regulate neuronal migration. We then identified miRNAs and transcription factors (TFs) that form functional feedback loops and regulate gene expression of the DISC1 interactome. Using this curated data, we conducted in-silico modeling of the DISC1 interactome involved in neuronal migration and identified the proteins that either facilitate or inhibit neuronal migrational processes. We also studied the effect of perturbation of miRNAs and TFs in feedback loops on the DISC1 interactome. From these analyses, we discovered that STAT3, TCF3, and TAL1 (through feedback loop with miRNAs) play a critical role in the transcriptional control of DISC1 interactome thereby regulating neuronal migration. To the best of our knowledge, regulation of the DISC1 interactome mediating neuronal migration by these TFs has not been previously reported. These potentially important TFs can serve as targets for undertaking validation studies, which in turn can reveal the molecular processes that cause neuronal migration defects underlying neurodevelopmental disorders. This underscores the importance of the use of in-silico techniques in aiding the discovery of mechanistic evidence governing important molecular and cellular processes. The present work is one such step towards the discovery of regulatory factors of the DISC1 interactome that mediates neuronal migration.
Collapse
Affiliation(s)
- John P. John
- Multimodal Brain Image Analysis Laboratory (MBIAL), National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, 560029 India
- Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, 560029 India
| | - Priyadarshini Thirunavukkarasu
- Multimodal Brain Image Analysis Laboratory (MBIAL), National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, 560029 India
- Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, 560029 India
| | - Koko Ishizuka
- Department of Psychiatry and Behavioral Sciences, School of Medicine, Johns Hopkins University, Baltimore, MD 21287 USA
| | - Pravesh Parekh
- Multimodal Brain Image Analysis Laboratory (MBIAL), National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, 560029 India
- Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, 560029 India
| | - Akira Sawa
- Departments of Psychiatry, Mental Health, Neuroscience, and Biomedical Engineering, School of Medicine, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD 21287 USA
| |
Collapse
|
13
|
Zhao X, Yin H, Li N, Zhu Y, Shen W, Qian S, He G, Li J, Wang X. An Integrated Regulatory Network Based on Comprehensive Analysis of mRNA Expression, Gene Methylation and Expression of Long Non-coding RNAs (lncRNAs) in Myelodysplastic Syndromes. Front Oncol 2019; 9:200. [PMID: 30984623 PMCID: PMC6450213 DOI: 10.3389/fonc.2019.00200] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Accepted: 03/08/2019] [Indexed: 12/12/2022] Open
Abstract
Myelodysplastic syndromes (MDS) are a heterogeneous group of disorders characterized by ineffective hematopoiesis, defective differentiation of hematopoietic precursors, and expansion of the abnormal clones. The prevalence of MDS has raised great concerns worldwide, but its pathogenetic mechanisms remain elusive. To provide insights on novel biomarkers for the diagnosis and therapy of MDS, we performed high-throughput genome-wide mRNA expression profiling, DNA methylation analysis, and long non-coding RNAs (lncRNA) analysis on bone marrows from four MDS patients and four age-matched healthy controls. We identified 1,937 differentially expressed genes (DEGs), 515 methylated genes, and 214 lncRNA that showed statistically significant differences. As the most significant module-related DEGs, TCL1A, PTGS2, and MME were revealed to be enriched in regulation of cell differentiation and cell death pathways. In addition, the GeneGo pathway maps identified by top DEGs were shown to converge on cancer, immunoregulation, apoptosis and regulation of actin cytoskeleton, most of which are known contributors in MDS etiology and pathogenesis. Notably, as potential biomarkers for diagnosis of MDS, four specific genes (ABAT, FADD, DAPP1, and SMPD3) were further subjected to detailed pathway analysis. Our integrative analysis on mRNA expression, gene methylation and lncRNAs profiling facilitates further understanding of the pathogenesis of MDS, and may promote the diagnosis and novel therapeutics for this disease.
Collapse
Affiliation(s)
- Xiaoli Zhao
- Key Laboratory of Hematology, Department of Hematology, Collaborative Innovation Center for Cancer Personalized Medicine, The First Affiliated Hospital of Nanjing Medical University, Jiangsu Province Hospital, Nanjing Medical University, Nanjing, China.,Department of Haematology, Huashan Hospital, Fudan University, Shanghai, China
| | - Hua Yin
- Key Laboratory of Hematology, Department of Hematology, Collaborative Innovation Center for Cancer Personalized Medicine, The First Affiliated Hospital of Nanjing Medical University, Jiangsu Province Hospital, Nanjing Medical University, Nanjing, China
| | - Nianyi Li
- Department of Haematology, Huashan Hospital, Fudan University, Shanghai, China
| | - Yu Zhu
- Key Laboratory of Hematology, Department of Hematology, Collaborative Innovation Center for Cancer Personalized Medicine, The First Affiliated Hospital of Nanjing Medical University, Jiangsu Province Hospital, Nanjing Medical University, Nanjing, China
| | - Wenyi Shen
- Key Laboratory of Hematology, Department of Hematology, Collaborative Innovation Center for Cancer Personalized Medicine, The First Affiliated Hospital of Nanjing Medical University, Jiangsu Province Hospital, Nanjing Medical University, Nanjing, China
| | - Sixuan Qian
- Key Laboratory of Hematology, Department of Hematology, Collaborative Innovation Center for Cancer Personalized Medicine, The First Affiliated Hospital of Nanjing Medical University, Jiangsu Province Hospital, Nanjing Medical University, Nanjing, China
| | - Guangsheng He
- Key Laboratory of Hematology, Department of Hematology, Collaborative Innovation Center for Cancer Personalized Medicine, The First Affiliated Hospital of Nanjing Medical University, Jiangsu Province Hospital, Nanjing Medical University, Nanjing, China
| | - Jianyong Li
- Key Laboratory of Hematology, Department of Hematology, Collaborative Innovation Center for Cancer Personalized Medicine, The First Affiliated Hospital of Nanjing Medical University, Jiangsu Province Hospital, Nanjing Medical University, Nanjing, China
| | - Xiaoqin Wang
- Department of Haematology, Huashan Hospital, Fudan University, Shanghai, China
| |
Collapse
|
14
|
Chen H, Wang JP, Liu H, Li H, Lin YCJ, Shi R, Yang C, Gao J, Zhou C, Li Q, Sederoff RR, Li W, Chiang VL. Hierarchical Transcription Factor and Chromatin Binding Network for Wood Formation in Black Cottonwood ( Populus trichocarpa). THE PLANT CELL 2019; 31:602-626. [PMID: 30755461 PMCID: PMC6482634 DOI: 10.1105/tpc.18.00620] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Revised: 01/15/2019] [Accepted: 02/07/2019] [Indexed: 05/18/2023]
Abstract
Wood remains the world's most abundant and renewable resource for timber and pulp and is an alternative to fossil fuels. Understanding the molecular regulation of wood formation can advance the engineering of wood for more efficient material and energy productions. We integrated a black cottonwood (Populus trichocarpa) wood-forming cell system with quantitative transcriptomics and chromatin binding assays to construct a transcriptional regulatory network (TRN) directed by a key transcription factor (TF), PtrSND1-B1 (secondary wall-associated NAC-domain protein). The network consists of four layers of TF-target gene interactions with quantitative regulatory effects, describing the specificity of how the regulation is transduced through these interactions to activate cell wall genes (effector genes) for wood formation. PtrSND1-B1 directs 57 TF-DNA interactions through 17 TFs transregulating 27 effector genes. Of the 57 interactions, 55 are novel. We tested 42 of these 57 interactions in 30 genotypes of transgenic P. trichocarpa and verified that ∼90% of the tested interactions function in vivo. The TRN reveals common transregulatory targets for distinct TFs, leading to the discovery of nine TF protein complexes (dimers and trimers) implicated in regulating the biosynthesis of specific types of lignin. Our work suggests that wood formation may involve regulatory homeostasis determined by combinations of TF-DNA and TF-TF (protein-protein) regulations.
Collapse
Affiliation(s)
- Hao Chen
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, North Carolina 27695
- Department of Genetics, North Carolina State University, Raleigh, North Carolina 27695
| | - Jack P. Wang
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, North Carolina 27695
| | - Huizi Liu
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
| | - Huiyu Li
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
| | - Ying-Chung Jimmy Lin
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
- Department of Life Sciences, College of Life Science, National Taiwan University, Taipei, 10617, Taiwan
- Institute of Plant Biology, College of Life Science, National Taiwan University, Taipei 10617, Taiwan
| | - Rui Shi
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, North Carolina 27695
| | - Chenmin Yang
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, North Carolina 27695
| | - Jinghui Gao
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
| | - Chenguang Zhou
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
| | - Quanzi Li
- State Key Laboratory of Tree Genetics and Breeding, Chinese Academy of Forestry, Beijing 100091, China
| | - Ronald R. Sederoff
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, North Carolina 27695
| | - Wei Li
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, North Carolina 27695
| | - Vincent L. Chiang
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, North Carolina 27695
- Department of Forest Biomaterials, North Carolina State University, Raleigh, North Carolina 27695
| |
Collapse
|
15
|
Dragomir A, Vrahatis AG, Bezerianos A. A Network-Based Perspective in Alzheimer's Disease: Current State and an Integrative Framework. IEEE J Biomed Health Inform 2018; 23:14-25. [PMID: 30080151 DOI: 10.1109/jbhi.2018.2863202] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
A major rise in the prevalence and impact of Alzheimer's disease (AD) is projected in the coming decades, resulting from increasing life expectancy, thus leading to substantially increased healthcare costs. While brain disfunctions at the time of diagnosis are irreversible, it is widely accepted that AD pathology develops decades before clinical symptoms onset. If incipient processes can be detected early in the disease progression, prospective intervention for preventing or slowing the disease can be designed. Currently, there is no noninvasive biomarker available to detect and monitor early stages of disease progression. The complex etiology of AD warrants a systems-based approach supporting the integration of multimodal and multilevel data, while network-based modeling provides the scaffolding for methods revealing complex systems-level disruptions initiated by the disease. In this work, we review current state-of-the-art, focusing on network-based biomarkers at molecular and brain functional connectivity levels. Particular emphasis is placed on outlining recent trends, which highlight the functional importance of modular substructures in molecular and connectivity networks and their potential biomarker value. Our perspective is rooted in network medicine and summarizes the pipelines for identifying network-based biomarkers, as well as the benefits of integrating genotype and brain phenotype information for a comprehensively noninvasive approach in the early diagnosis of AD. Finally, we propose a framework for integrating knowledge from molecular and brain connectivity levels, which has the potential to enable noninvasive diagnosis, provide support for monitoring therapies, and help understand heretofore unexamined deep level relations between genotype and brain phenotype.
Collapse
|
16
|
Liu ZP. Towards precise reconstruction of gene regulatory networks by data integration. QUANTITATIVE BIOLOGY 2018. [DOI: 10.1007/s40484-018-0139-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
|
17
|
Inukai S, Pincus Z, de Lencastre A, Slack FJ. A microRNA feedback loop regulates global microRNA abundance during aging. RNA (NEW YORK, N.Y.) 2018; 24:159-172. [PMID: 29114017 PMCID: PMC5769744 DOI: 10.1261/rna.062190.117] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2017] [Accepted: 10/29/2017] [Indexed: 06/07/2023]
Abstract
Expression levels of many microRNAs (miRNAs) change during aging, notably declining globally in a number of organisms and tissues across taxa. However, little is known about the mechanisms or the biological relevance for this change. We investigated the network of genes that controls miRNA transcription and processing during C. elegans aging. We found that miRNA biogenesis genes are highly networked with transcription factors and aging-associated miRNAs. In particular, miR-71, known to influence life span and itself up-regulated during aging, represses alg-1/Argonaute expression post-transcriptionally during aging. Increased ALG-1 abundance in mir-71 loss-of-function mutants led to globally increased miRNA expression. Interestingly, these mutants demonstrated widespread mRNA expression dysregulation and diminished levels of variability both in gene expression and in overall life span. Thus, the progressive molecular decline often thought to be the result of accumulated damage over an organism's life may be partially explained by a miRNA-directed mechanism of age-associated decline.
Collapse
Affiliation(s)
- Sachi Inukai
- Department of Molecular, Cellular and Developmental Biology, Yale University, P.O. Box 208103, New Haven, Connecticut 06520, USA
- Institute for RNA Medicine, Department of Pathology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts 02215, USA
| | - Zachary Pincus
- Department of Developmental Biology
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Alexandre de Lencastre
- Department of Molecular, Cellular and Developmental Biology, Yale University, P.O. Box 208103, New Haven, Connecticut 06520, USA
| | - Frank J Slack
- Department of Molecular, Cellular and Developmental Biology, Yale University, P.O. Box 208103, New Haven, Connecticut 06520, USA
- Institute for RNA Medicine, Department of Pathology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts 02215, USA
| |
Collapse
|
18
|
Li YF, Altman RB. Systematic target function annotation of human transcription factors. BMC Biol 2018; 16:4. [PMID: 29325558 PMCID: PMC5795274 DOI: 10.1186/s12915-017-0469-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2017] [Accepted: 12/06/2017] [Indexed: 01/03/2023] Open
Abstract
Background Transcription factors (TFs), the key players in transcriptional regulation, have attracted great experimental attention, yet the functions of most human TFs remain poorly understood. Recent capabilities in genome-wide protein binding profiling have stimulated systematic studies of the hierarchical organization of human gene regulatory network and DNA-binding specificity of TFs, shedding light on combinatorial gene regulation. We show here that these data also enable a systematic annotation of the biological functions and functional diversity of TFs. Result We compiled a human gene regulatory network for 384 TFs covering the 146,096 TF–target gene (TF–TG) relationships, extracted from over 850 ChIP-seq experiments as well as the literature. By integrating this network of TF–TF and TF–TG relationships with 3715 functional concepts from six sources of gene function annotations, we obtained over 9000 confident functional annotations for 279 TFs. We observe extensive connectivity between TFs and Mendelian diseases, GWAS phenotypes, and pharmacogenetic pathways. Further, we show that TFs link apparently unrelated functions, even when the two functions do not share common genes. Finally, we analyze the pleiotropic functions of TFs and suggest that the increased number of upstream regulators contributes to the functional pleiotropy of TFs. Conclusion Our computational approach is complementary to focused experimental studies on TF functions, and the resulting knowledge can guide experimental design for the discovery of unknown roles of TFs in human disease and drug response. Electronic supplementary material The online version of this article (doi:10.1186/s12915-017-0469-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yong Fuga Li
- Stanford Genome Technology Center, Stanford, CA, USA. .,Department of Bioengineering, Stanford University, Stanford, CA, USA. .,Present address: Department of Bioinformatics, Illumina Inc., San Diego, CA, USA.
| | - Russ B Altman
- Department of Bioengineering, Stanford University, Stanford, CA, USA. .,Department of Genetics, Stanford University, Stanford, CA, USA.
| |
Collapse
|
19
|
Lichtblau Y, Zimmermann K, Haldemann B, Lenze D, Hummel M, Leser U. Comparative assessment of differential network analysis methods. Brief Bioinform 2017; 18:837-850. [PMID: 27473063 DOI: 10.1093/bib/bbw061] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Indexed: 12/31/2022] Open
Abstract
Differential network analysis (DiNA) denotes a recent class of network-based Bioinformatics algorithms which focus on the differences in network topologies between two states of a cell, such as healthy and disease, to identify key players in the discriminating biological processes. In contrast to conventional differential analysis, DiNA identifies changes in the interplay between molecules, rather than changes in single molecules. This ability is especially important in cases where effectors are changed, e.g. mutated, but their expression is not. A number of different DiNA approaches have been proposed, yet a comparative assessment of their performance in different settings is still lacking. In this paper, we evaluate 10 different DiNA algorithms regarding their ability to recover genetic key players from transcriptome data. We construct high-quality regulatory networks and enrich them with co-expression data from four different types of cancer. Next, we assess the results of applying DiNA algorithms on these data sets using a gold standard list (GSL). We find that local DiNA algorithms are generally superior to global algorithms, and that all DiNA algorithms outperform conventional differential expression analysis. We also assess the ability of DiNA methods to exploit additional knowledge in the underlying cellular networks. To this end, we enrich the cancer-type specific networks with known regulatory miRNAs and compare the algorithms performance in networks with and without miRNA. We find that including miRNAs consistently and considerably improves the performance of almost all tested algorithms. Our results underline the advantages of comprehensive cell models for the analysis of -omics data.
Collapse
|
20
|
Kato M, Kashem MA, Cheng C. An intestinal microRNA modulates the homeostatic adaptation to chronic oxidative stress in C. elegans. Aging (Albany NY) 2017; 8:1979-2005. [PMID: 27623524 PMCID: PMC5076448 DOI: 10.18632/aging.101029] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2016] [Accepted: 08/19/2016] [Indexed: 12/22/2022]
Abstract
Adaptation to an environmental or metabolic perturbation is a feature of the evolutionary process. Recent insights into microRNA function suggest that microRNAs serve as key players in a robust adaptive response against stress in animals through their capacity to fine-tune gene expression. However, it remains largely unclear how a microRNA-modulated downstream mechanism contributes to the process of homeostatic adaptation. Here we show that loss of an intestinally expressed microRNA gene, mir-60, in the nematode C. elegans promotes an adaptive response to chronic - a mild and long-term - oxidative stress exposure. The pathway involved appears to be unique since the canonical stress-responsive factors, such as DAF-16/FOXO, are dispensable for mir-60 loss to enhance oxidative stress resistance. Gene expression profiles revealed that genes encoding lysosomal proteases and those involved in xenobiotic metabolism and pathogen defense responses are up-regulated by the loss of mir-60. Detailed genetic studies and computational microRNA target prediction suggest that endocytosis components and a bZip transcription factor gene zip-10, which functions in innate immune response, are directly modulated by miR-60 in the intestine. Our findings suggest that the mir-60 loss facilitates adaptive response against chronic oxidative stress by ensuring the maintenance of cellular homeostasis.
Collapse
Affiliation(s)
- Masaomi Kato
- The Laboratory of Ageing, Centenary Institute, Camperdown, NSW 2050, Australia.,Sydney Medical School, The University of Sydney, Camperdown, NSW 2050, Australia
| | - Mohammed Abul Kashem
- The Laboratory of Ageing, Centenary Institute, Camperdown, NSW 2050, Australia.,Sydney Medical School, The University of Sydney, Camperdown, NSW 2050, Australia
| | - Chao Cheng
- Department of Biomedical Data Science, Geisel School of Medicine, Dartmouth College, Lebanon, NH 03756, USA.,Department of Molecular and Systems Biology, Geisel School of Medicine, Dartmouth College, Hanover, NH 03755, USA
| |
Collapse
|
21
|
Wang B, Singh R, Qi Y. A constrained $$\ell $$1 minimization approach for estimating multiple sparse Gaussian or nonparanormal graphical models. Mach Learn 2017. [DOI: 10.1007/s10994-017-5635-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
22
|
Cora' D, Re A, Caselle M, Bussolino F. MicroRNA-mediated regulatory circuits: outlook and perspectives. Phys Biol 2017; 14:045001. [PMID: 28586314 DOI: 10.1088/1478-3975/aa6f21] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
MicroRNAs have been found to be necessary for regulating genes implicated in almost all signaling pathways, and consequently their dysfunction influences many diseases, including cancer. Understanding of the complexity of the microRNA-mediated regulatory network has grown in terms of size, connectivity and dynamics with the development of computational and, more recently, experimental high-throughput approaches for microRNA target identification. Newly developed studies on recurrent microRNA-mediated circuits in regulatory networks, also known as network motifs, have substantially contributed to addressing this complexity, and therefore to helping understand the ways by which microRNAs achieve their regulatory role. This review provides a summarizing view of the state-of-the-art, and perspectives of research efforts on microRNA-mediated regulatory motifs. In this review, we discuss the topological properties characterizing different types of circuits, and the regulatory features theoretically enabled by such properties, with a special emphasis on examples of circuits typifying their biological significance in experimentally validated contexts. Finally, we will consider possible future developments, in particular regarding microRNA-mediated circuits involving long non-coding RNAs and epigenetic regulators.
Collapse
Affiliation(s)
- Davide Cora'
- Department of Oncology, University of Torino, Str. Prov. 142 Km 3.95, I-10060 Candiolo, Italy. Candiolo Cancer Institute-FPO, IRCCS, Str. Prov. 142 Km 3.95, I-10060 Candiolo, Italy. Center for Molecular Systems Biology, University of Torino, Regione Gonzole 10, I-10043 Orbassano, Italy. Current address: Department of Translational Medicine, Piemonte Orientale University 'Amedeo Avogadro', Via Solaroli 17, I-28100 Novara, Italy
| | | | | | | |
Collapse
|
23
|
Greene CS, Himmelstein DS. Genetic Association-Guided Analysis of Gene Networks for the Study of Complex Traits. ACTA ACUST UNITED AC 2017; 9:179-84. [PMID: 27094199 DOI: 10.1161/circgenetics.115.001181] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Accepted: 03/08/2016] [Indexed: 12/29/2022]
Affiliation(s)
- Casey S Greene
- From the Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia (C.S.G.); and Biological and Medical Informatics, University of California, San Francisco (D.S.H.).
| | - Daniel S Himmelstein
- From the Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia (C.S.G.); and Biological and Medical Informatics, University of California, San Francisco (D.S.H.)
| |
Collapse
|
24
|
Zuo C, Chen K, Hewitt KJ, Bresnick EH, Keleş S. A Hierarchical Framework for State-Space Matrix Inference and Clustering. Ann Appl Stat 2016; 10:1348-1372. [PMID: 29910842 DOI: 10.1214/16-aoas938] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
In recent years, a large number of genomic and epigenomic studies have been focusing on the integrative analysis of multiple experimental datasets measured over a large number of observational units. The objectives of such studies include not only inferring a hidden state of activity for each unit over individual experiments, but also detecting highly associated clusters of units based on their inferred states. Although there are a number of methods tailored for specific datasets, there is currently no state-of-the-art modeling framework for this general class of problems. In this paper, we develop the MBASIC (Matrix Based Analysis for State-space Inference and Clustering) framework. MBASIC consists of two parts: state-space mapping and state-space clustering. In state-space mapping, it maps observations onto a finite state-space, representing the activation states of units across conditions. In state-space clustering, MBASIC incorporates a finite mixture model to cluster the units based on their inferred state-space profiles across all conditions. Both the state-space mapping and clustering can be simultaneously estimated through an Expectation-Maximization algorithm. MBASIC flexibly adapts to a large number of parametric distributions for the observed data, as well as the heterogeneity in replicate experiments. It allows for imposing structural assumptions on each cluster, and enables model selection using information criterion. In our data-driven simulation studies, MBASIC showed significant accuracy in recovering both the underlying state-space variables and clustering structures. We applied MBASIC to two genome research problems using large numbers of datasets from the ENCODE project. The first application grouped genes based on transcription factor occupancy profiles of their promoter regions in two different cell types. The second application focused on identifying groups of loci that are similar to a GATA2 binding site that is functional at its endogenous locus by utilizing transcription factor occupancy data and illustrated applicability of MBASIC in a wide variety of problems. In both studies, MBASIC showed higher levels of raw data fidelity than analyzing these data with a two-step approach using ENCODE results on transcription factor occupancy data.
Collapse
Affiliation(s)
- Chandler Zuo
- Department of Statistics, University of Wisconsin, Madison, WI, U.S.A.,Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI, U.S.A
| | - Kailei Chen
- Department of Statistics, University of Wisconsin, Madison, WI, U.S.A.,Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI, U.S.A
| | - Kyle J Hewitt
- Department of Cell and Regenerative Biology, University of Wisconsin, Madison, WI, U.S.A
| | - Emery H Bresnick
- Department of Cell and Regenerative Biology, University of Wisconsin, Madison, WI, U.S.A
| | - Sündüz Keleş
- Department of Statistics, University of Wisconsin, Madison, WI, U.S.A.,Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI, U.S.A
| |
Collapse
|
25
|
Kim D, Kang M, Biswas A, Liu C, Gao J. Integrative approach for inference of gene regulatory networks using lasso-based random featuring and application to psychiatric disorders. BMC Med Genomics 2016; 9 Suppl 2:50. [PMID: 27510319 PMCID: PMC4980788 DOI: 10.1186/s12920-016-0202-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND Inferring gene regulatory networks is one of the most interesting research areas in the systems biology. Many inference methods have been developed by using a variety of computational models and approaches. However, there are two issues to solve. First, depending on the structural or computational model of inference method, the results tend to be inconsistent due to innately different advantages and limitations of the methods. Therefore the combination of dissimilar approaches is demanded as an alternative way in order to overcome the limitations of standalone methods through complementary integration. Second, sparse linear regression that is penalized by the regularization parameter (lasso) and bootstrapping-based sparse linear regression methods were suggested in state of the art methods for network inference but they are not effective for a small sample size data and also a true regulator could be missed if the target gene is strongly affected by an indirect regulator with high correlation or another true regulator. RESULTS We present two novel network inference methods based on the integration of three different criteria, (i) z-score to measure the variation of gene expression from knockout data, (ii) mutual information for the dependency between two genes, and (iii) linear regression-based feature selection. Based on these criterion, we propose a lasso-based random feature selection algorithm (LARF) to achieve better performance overcoming the limitations of bootstrapping as mentioned above. CONCLUSIONS In this work, there are three main contributions. First, our z score-based method to measure gene expression variations from knockout data is more effective than similar criteria of related works. Second, we confirmed that the true regulator selection can be effectively improved by LARF. Lastly, we verified that an integrative approach can clearly outperform a single method when two different methods are effectively jointed. In the experiments, our methods were validated by outperforming the state of the art methods on DREAM challenge data, and then LARF was applied to inferences of gene regulatory network associated with psychiatric disorders.
Collapse
Affiliation(s)
- Dongchul Kim
- Department of Computer Science, University of Texas Rio Grande Valley, Edinburg, 78541 TX US
| | - Mingon Kang
- Department of Computer Science, Kennesaw State University, Marietta, 30144 GA US
| | - Ashis Biswas
- Department of Computer Science and Engineering, University of Texas at Arlington, Arlington, 76019 TX US
| | - Chunyu Liu
- Department of Psychiatry, University of Illinois at Chicago, Chicago, 60607 IL US
| | - Jean Gao
- Department of Computer Science and Engineering, University of Texas at Arlington, Arlington, 76019 TX US
| |
Collapse
|
26
|
Zhao H, Zhang G, Pang L, Lan Y, Wang L, Yu F, Hu J, Li F, Zhao T, Xiao Y, Li X. ‘Traffic light rules’: Chromatin states direct miRNA-mediated network motifs running by integrating epigenome and regulatome. Biochim Biophys Acta Gen Subj 2016; 1860:1475-88. [DOI: 10.1016/j.bbagen.2016.04.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2015] [Revised: 03/11/2016] [Accepted: 04/13/2016] [Indexed: 12/14/2022]
|
27
|
Ung MH, Liu CC, Cheng C. Integrative analysis of cancer genes in a functional interactome. Sci Rep 2016; 6:29228. [PMID: 27356765 PMCID: PMC4928112 DOI: 10.1038/srep29228] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Accepted: 06/16/2016] [Indexed: 11/09/2022] Open
Abstract
The post-genomic era has resulted in the accumulation of high-throughput cancer data from a vast array of genomic technologies including next-generation sequencing and microarray. As such, the large amounts of germline variant and somatic mutation data that have been generated from GWAS and sequencing projects, respectively, show great promise in providing a systems-level view of these genetic aberrations. In this study, we analyze publicly available GWAS, somatic mutation, and drug target data derived from large databanks using a network-based approach that incorporates directed edge information under a randomized network hypothesis testing procedure. We show that these three classes of disease-associated nodes exhibit non-random topological characteristics in the context of a functional interactome. Specifically, we show that drug targets tend to lie upstream of somatic mutations and disease susceptibility germline variants. In addition, we introduce a new approach to measuring hierarchy between drug targets, somatic mutants, and disease susceptibility genes by utilizing directionality and path length information. Overall, our results provide new insight into the intrinsic relationships between these node classes that broaden our understanding of cancer. In addition, our results align with current knowledge on the therapeutic actionability of GWAS and somatic mutant nodes, while demonstrating relationships between node classes from a global network perspective.
Collapse
Affiliation(s)
- Matthew H Ung
- Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, 03755 USA.,Program in Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, 03755 USA
| | - Chun-Chi Liu
- Institute of Genomics and Bioinformatics, National Chung Hsing University, Taiwan
| | - Chao Cheng
- Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, 03755 USA.,Program in Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, 03755 USA.,Norris Cotton Cancer Center, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, 03766 USA
| |
Collapse
|
28
|
Chaitankar V, Karakülah G, Ratnapriya R, Giuste FO, Brooks MJ, Swaroop A. Next generation sequencing technology and genomewide data analysis: Perspectives for retinal research. Prog Retin Eye Res 2016; 55:1-31. [PMID: 27297499 DOI: 10.1016/j.preteyeres.2016.06.001] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2016] [Revised: 06/06/2016] [Accepted: 06/08/2016] [Indexed: 02/08/2023]
Abstract
The advent of high throughput next generation sequencing (NGS) has accelerated the pace of discovery of disease-associated genetic variants and genomewide profiling of expressed sequences and epigenetic marks, thereby permitting systems-based analyses of ocular development and disease. Rapid evolution of NGS and associated methodologies presents significant challenges in acquisition, management, and analysis of large data sets and for extracting biologically or clinically relevant information. Here we illustrate the basic design of commonly used NGS-based methods, specifically whole exome sequencing, transcriptome, and epigenome profiling, and provide recommendations for data analyses. We briefly discuss systems biology approaches for integrating multiple data sets to elucidate gene regulatory or disease networks. While we provide examples from the retina, the NGS guidelines reviewed here are applicable to other tissues/cell types as well.
Collapse
Affiliation(s)
- Vijender Chaitankar
- Neurobiology-Neurodegeneration & Repair Laboratory, National Eye Institute, National Institutes of Health, 6 Center Drive, Bethesda, MD, 20892-0610, USA
| | - Gökhan Karakülah
- Neurobiology-Neurodegeneration & Repair Laboratory, National Eye Institute, National Institutes of Health, 6 Center Drive, Bethesda, MD, 20892-0610, USA
| | - Rinki Ratnapriya
- Neurobiology-Neurodegeneration & Repair Laboratory, National Eye Institute, National Institutes of Health, 6 Center Drive, Bethesda, MD, 20892-0610, USA
| | - Felipe O Giuste
- Neurobiology-Neurodegeneration & Repair Laboratory, National Eye Institute, National Institutes of Health, 6 Center Drive, Bethesda, MD, 20892-0610, USA
| | - Matthew J Brooks
- Neurobiology-Neurodegeneration & Repair Laboratory, National Eye Institute, National Institutes of Health, 6 Center Drive, Bethesda, MD, 20892-0610, USA
| | - Anand Swaroop
- Neurobiology-Neurodegeneration & Repair Laboratory, National Eye Institute, National Institutes of Health, 6 Center Drive, Bethesda, MD, 20892-0610, USA.
| |
Collapse
|
29
|
LIN XIAOCONG, XU YONG, SUN GUOPING, WEN JINLI, LI NING, ZHANG YUMING, YANG ZHIGANG, ZHANG HAITAO, DAI YONG. Molecular dysfunctions in acute myeloid leukemia revealed by integrated analysis of microRNA and transcription factor. Int J Oncol 2016; 48:2367-80. [DOI: 10.3892/ijo.2016.3489] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2015] [Accepted: 01/19/2016] [Indexed: 11/05/2022] Open
|
30
|
Abstract
Biological systems are complex. In particular, the interactions between molecular components often form dense networks that, more often than not, are criticized for being inscrutable 'hairballs'. We argue that one way of untangling these hairballs is through cross-disciplinary network comparison-leveraging advances in other disciplines to obtain new biological insights. In some cases, such comparisons enable the direct transfer of mathematical formalism between disciplines, precisely describing the abstract associations between entities and allowing us to apply a variety of sophisticated formalisms to biology. In cases where the detailed structure of the network does not permit the transfer of complete formalisms between disciplines, comparison of mechanistic interactions in systems for which we have significant day-to-day experience can provide analogies for interpreting relatively more abstruse biological networks. Here, we illustrate how these comparisons benefit the field with a few specific examples related to network growth, organizational hierarchies, and the evolution of adaptive systems.
Collapse
|
31
|
Construction and analysis of dynamic transcription factor regulatory networks in the progression of glioma. Sci Rep 2015; 5:15953. [PMID: 26526635 PMCID: PMC4630656 DOI: 10.1038/srep15953] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2015] [Accepted: 10/02/2015] [Indexed: 12/21/2022] Open
Abstract
The combinatorial cross-regulation of transcription factors (TFs) plays an important role in cellular identity and function; however, the dynamic regulation of TFs during glioma progression remains largely unknown. Here, we used the genome-wide expression of TFs to construct an extensive human TF network comprising interactions among 513 TFs and to analyse the dynamics of the TF-TF network during glioma progression. We found that the TF regulatory networks share a common architecture and that the topological structures are conserved. Strikingly, despite the conservation of the network architecture, TF regulatory networks are highly grade specific, and TF circuitry motifs are dynamically rewired during glioma progression. In addition, the most frequently observed structure in the grade-specific TF networks was the feedforward loop (FFL). We described applications that show how investigating the behaviour of FFLs in glioblastoma can reveal FFLs (such as RARG-NR1I2-CDX2) that are associated with prognosis. We constructed comprehensive TF-TF networks and systematically analysed the circuitry, dynamics, and topological principles of the networks during glioma progression, which will further enhance our understanding of the functions of TFs in glioma.
Collapse
|
32
|
Roy S, Siahpirani AF, Chasman D, Knaack S, Ay F, Stewart R, Wilson M, Sridharan R. A predictive modeling approach for cell line-specific long-range regulatory interactions. Nucleic Acids Res 2015; 43:8694-712. [PMID: 26338778 PMCID: PMC4605315 DOI: 10.1093/nar/gkv865] [Citation(s) in RCA: 85] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2015] [Revised: 08/16/2015] [Accepted: 08/17/2015] [Indexed: 01/28/2023] Open
Abstract
Long range regulatory interactions among distal enhancers and target genes are important for tissue-specific gene expression. Genome-scale identification of these interactions in a cell line-specific manner, especially using the fewest possible datasets, is a significant challenge. We develop a novel computational approach, Regulatory Interaction Prediction for Promoters and Long-range Enhancers (RIPPLE), that integrates published Chromosome Conformation Capture (3C) data sets with a minimal set of regulatory genomic data sets to predict enhancer-promoter interactions in a cell line-specific manner. Our results suggest that CTCF, RAD21, a general transcription factor (TBP) and activating chromatin marks are important determinants of enhancer-promoter interactions. To predict interactions in a new cell line and to generate genome-wide interaction maps, we develop an ensemble version of RIPPLE and apply it to generate interactions in five human cell lines. Computational validation of these predictions using existing ChIA-PET and Hi-C data sets showed that RIPPLE accurately predicts interactions among enhancers and promoters. Enhancer-promoter interactions tend to be organized into subnetworks representing coordinately regulated sets of genes that are enriched for specific biological processes and cis-regulatory elements. Overall, our work provides a systematic approach to predict and interpret enhancer-promoter interactions in a genome-wide cell-type specific manner using a few experimentally tractable measurements.
Collapse
Affiliation(s)
- Sushmita Roy
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA Wisconsin Institute for Discovery, 330 N. Orchard Street, Madison, WI, USA
| | | | - Deborah Chasman
- Wisconsin Institute for Discovery, 330 N. Orchard Street, Madison, WI, USA
| | - Sara Knaack
- Wisconsin Institute for Discovery, 330 N. Orchard Street, Madison, WI, USA
| | - Ferhat Ay
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Ron Stewart
- Morgridge Institute for Research, Madison, WI 53715, USA
| | - Michael Wilson
- Genetics & Genome Biology Program, Hospital for Sick Children (SickKids) and Department of Molecular Genetics, University of Toronto,Toronto, ON, Canada Department of Molecular Genetics, University of Toronto, ON, Canada
| | - Rupa Sridharan
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA Department of Cell and Regenerative biology, University of Wisconsin, Madison, WI 53715, USA
| |
Collapse
|
33
|
Narang V, Ramli MA, Singhal A, Kumar P, de Libero G, Poidinger M, Monterola C. Automated Identification of Core Regulatory Genes in Human Gene Regulatory Networks. PLoS Comput Biol 2015; 11:e1004504. [PMID: 26393364 PMCID: PMC4578944 DOI: 10.1371/journal.pcbi.1004504] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2015] [Accepted: 08/11/2015] [Indexed: 12/20/2022] Open
Abstract
Human gene regulatory networks (GRN) can be difficult to interpret due to a tangle of edges interconnecting thousands of genes. We constructed a general human GRN from extensive transcription factor and microRNA target data obtained from public databases. In a subnetwork of this GRN that is active during estrogen stimulation of MCF-7 breast cancer cells, we benchmarked automated algorithms for identifying core regulatory genes (transcription factors and microRNAs). Among these algorithms, we identified K-core decomposition, pagerank and betweenness centrality algorithms as the most effective for discovering core regulatory genes in the network evaluated based on previously known roles of these genes in MCF-7 biology as well as in their ability to explain the up or down expression status of up to 70% of the remaining genes. Finally, we validated the use of K-core algorithm for organizing the GRN in an easier to interpret layered hierarchy where more influential regulatory genes percolate towards the inner layers. The integrated human gene and miRNA network and software used in this study are provided as supplementary materials (S1 Data) accompanying this manuscript. A gene regulatory network (GRN) represents how some genes encoding regulatory molecules such as transcription factors or microRNAs regulate the expression of other genes. Researchers commonly study GRNs involved in a specific biological process with the aim of identifying a few important regulatory genes. In higher organisms such as humans, a regulatory gene regulates multiple target genes and correspondingly any gene is regulated by multiple regulatory genes. Due to such multiplicity of interactions, a GRN usually resembles a tangled hairball wherein it is difficult to identify few most influential regulatory genes. In this study, we show that network analysis algorithms such as K-core, pagerank and betweenness centrality are useful for identifying a few important or core regulatory genes in a GRN, and the K-core algorithm is also useful for organizing regulatory genes in a hierarchical layered structure where the most influential genes in a GRN are found within the innermost layer or core. These few core regulatory genes determine to a large extent the expression status of the remaining genes in the network. We illustrate a pragmatic application of this technique to GRNs reconstructed from genome-wide gene expression measurements in the MCF-7 human breast cancer cell line.
Collapse
|
34
|
Thompson D, Regev A, Roy S. Comparative analysis of gene regulatory networks: from network reconstruction to evolution. Annu Rev Cell Dev Biol 2015; 31:399-428. [PMID: 26355593 DOI: 10.1146/annurev-cellbio-100913-012908] [Citation(s) in RCA: 95] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Regulation of gene expression is central to many biological processes. Although reconstruction of regulatory circuits from genomic data alone is therefore desirable, this remains a major computational challenge. Comparative approaches that examine the conservation and divergence of circuits and their components across strains and species can help reconstruct circuits as well as provide insights into the evolution of gene regulatory processes and their adaptive contribution. In recent years, advances in genomic and computational tools have led to a wealth of methods for such analysis at the sequence, expression, pathway, module, and entire network level. Here, we review computational methods developed to study transcriptional regulatory networks using comparative genomics, from sequence to functional data. We highlight how these methods use evolutionary conservation and divergence to reliably detect regulatory components as well as estimate the extent and rate of divergence. Finally, we discuss the promise and open challenges in linking regulatory divergence to phenotypic divergence and adaptation.
Collapse
Affiliation(s)
- Dawn Thompson
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142
| | | | | |
Collapse
|
35
|
Yao S, Yoo S, Yu D. Prior knowledge driven Granger causality analysis on gene regulatory network discovery. BMC Bioinformatics 2015; 16:273. [PMID: 26316173 PMCID: PMC4551367 DOI: 10.1186/s12859-015-0710-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2015] [Accepted: 08/17/2015] [Indexed: 12/20/2022] Open
Abstract
Background Our study focuses on discovering gene regulatory networks from time series gene expression data using the Granger causality (GC) model. However, the number of available time points (T) usually is much smaller than the number of target genes (n) in biological datasets. The widely applied pairwise GC model (PGC) and other regularization strategies can lead to a significant number of false identifications when n>>T. Results In this study, we proposed a new method, viz., CGC-2SPR (CGC using two-step prior Ridge regularization) to resolve the problem by incorporating prior biological knowledge about a target gene data set. In our simulation experiments, the propose new methodology CGC-2SPR showed significant performance improvement in terms of accuracy over other widely used GC modeling (PGC, Ridge and Lasso) and MI-based (MRNET and ARACNE) methods. In addition, we applied CGC-2SPR to a real biological dataset, i.e., the yeast metabolic cycle, and discovered more true positive edges with CGC-2SPR than with the other existing methods. Conclusions In our research, we noticed a “ 1+1>2” effect when we combined prior knowledge and gene expression data to discover regulatory networks. Based on causality networks, we made a functional prediction that the Abm1 gene (its functions previously were unknown) might be related to the yeast’s responses to different levels of glucose. Our research improves causality modeling by combining heterogeneous knowledge, which is well aligned with the future direction in system biology. Furthermore, we proposed a method of Monte Carlo significance estimation (MCSE) to calculate the edge significances which provide statistical meanings to the discovered causality networks. All of our data and source codes will be available under the link https://bitbucket.org/dtyu/granger-causality/wiki/Home.
Collapse
Affiliation(s)
- Shun Yao
- Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, 11790, NY, USA. .,Computational Science Center, Brookhaven National Laboratory, Upton, 11793, NY, USA.
| | - Shinjae Yoo
- Computational Science Center, Brookhaven National Laboratory, Upton, 11793, NY, USA.
| | - Dantong Yu
- Computational Science Center, Brookhaven National Laboratory, Upton, 11793, NY, USA.
| |
Collapse
|
36
|
Liu ZP. Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data. Curr Genomics 2015; 16:3-22. [PMID: 25937810 PMCID: PMC4412962 DOI: 10.2174/1389202915666141110210634] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Revised: 09/05/2014] [Accepted: 09/05/2014] [Indexed: 12/17/2022] Open
Abstract
Transcriptional regulation plays vital roles in many fundamental biological processes. Reverse engineering of genome-wide regulatory networks from high-throughput transcriptomic data provides a promising way to characterize the global scenario of regulatory relationships between regulators and their targets. In this review, we summarize and categorize the main frameworks and methods currently available for inferring transcriptional regulatory networks from microarray gene expression profiling data. We overview each of strategies and introduce representative methods respectively. Their assumptions, advantages, shortcomings, and possible improvements and extensions are also clarified and commented.
Collapse
Affiliation(s)
- Zhi-Ping Liu
- Department of Biomedical Engineering, School of Control Science and Engineering, Shandong University, Jinan, Shandong 250061, China
| |
Collapse
|
37
|
Wang D, Yan KK, Sisu C, Cheng C, Rozowsky J, Meyerson W, Gerstein MB. Loregic: a method to characterize the cooperative logic of regulatory factors. PLoS Comput Biol 2015; 11:e1004132. [PMID: 25884877 PMCID: PMC4401777 DOI: 10.1371/journal.pcbi.1004132] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2014] [Accepted: 01/12/2015] [Indexed: 12/24/2022] Open
Abstract
The topology of the gene-regulatory network has been extensively analyzed. Now, given the large amount of available functional genomic data, it is possible to go beyond this and systematically study regulatory circuits in terms of logic elements. To this end, we present Loregic, a computational method integrating gene expression and regulatory network data, to characterize the cooperativity of regulatory factors. Loregic uses all 16 possible two-input-one-output logic gates (e.g. AND or XOR) to describe triplets of two factors regulating a common target. We attempt to find the gate that best matches each triplet’s observed gene expression pattern across many conditions. We make Loregic available as a general-purpose tool (github.com/gersteinlab/loregic). We validate it with known yeast transcription-factor knockout experiments. Next, using human ENCODE ChIP-Seq and TCGA RNA-Seq data, we are able to demonstrate how Loregic characterizes complex circuits involving both proximally and distally regulating transcription factors (TFs) and also miRNAs. Furthermore, we show that MYC, a well-known oncogenic driving TF, can be modeled as acting independently from other TFs (e.g., using OR gates) but antagonistically with repressing miRNAs. Finally, we inter-relate Loregic’s gate logic with other aspects of regulation, such as indirect binding via protein-protein interactions, feed-forward loop motifs and global regulatory hierarchy. Gene expression is controlled by various gene regulatory factors. Those factors work cooperatively forming a complex regulatory circuit genome wide. Corruptions of regulatory cooperativity may lead to abnormal gene expression activity such as cancer. Traditional experimental methods, however, can only identify small-scale regulatory activity. Thus, to systematically understand the cooperativity between and among different types of regulatory factors, we need the efficient and systematic computational methods. Regulatory circuits have been suggested to behave analogously to the electronic circuits in which a wide variety of electronic elements work coordinately to function correctly. Recently, an increasing amount of next generation sequencing data provides a great resource to study regulatory activity. Thus, we developed a general-purpose computational method using logic-circuit models from electronics and applied it to a human leukemia dataset, identifying the genome-wide cooperativity of transcription factors and microRNAs.
Collapse
Affiliation(s)
- Daifeng Wang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
| | - Koon-Kiu Yan
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
| | - Cristina Sisu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
| | - Chao Cheng
- Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, United States of America
| | - Joel Rozowsky
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
| | - William Meyerson
- School of Medicine, Yale University, New Haven, Connecticut, United States of America
| | - Mark B. Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
- Department of Computer Science, Yale University, New Haven, Connecticut, United States of America
- * E-mail:
| |
Collapse
|
38
|
Cheng C, Andrews E, Yan KK, Ung M, Wang D, Gerstein M. An approach for determining and measuring network hierarchy applied to comparing the phosphorylome and the regulome. Genome Biol 2015; 16:63. [PMID: 25880651 PMCID: PMC4404648 DOI: 10.1186/s13059-015-0624-2] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2014] [Accepted: 03/10/2015] [Indexed: 12/16/2022] Open
Abstract
Many biological networks naturally form a hierarchy with a preponderance of downward information flow. In this study, we define a score to quantify the degree of hierarchy in a network and develop a simulated-annealing algorithm to maximize the hierarchical score globally over a network. We apply our algorithm to determine the hierarchical structure of the phosphorylome in detail and investigate the correlation between its hierarchy and kinase properties. We also compare it to the regulatory network, finding that the phosphorylome is more hierarchical than the regulome.
Collapse
Affiliation(s)
- Chao Cheng
- Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, USA. .,Institute for Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, USA. .,Norris Cotton Cancer Center, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, USA.
| | - Erik Andrews
- Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, USA.
| | - Koon-Kiu Yan
- Program in Computational Biology and Bioinformatics, Yale University, 260 Whitney Avenue, New Haven, CT, 06520, USA.
| | - Matthew Ung
- Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, USA.
| | - Daifeng Wang
- Program in Computational Biology and Bioinformatics, Yale University, 260 Whitney Avenue, New Haven, CT, 06520, USA.
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, 260 Whitney Avenue, New Haven, CT, 06520, USA. .,Department of Molecular Biophysics and Biochemistry, Yale University, 260 Whitney Avenue, New Haven, CT, 06520, USA. .,Department of Computer Science, Yale University, 260 Whitney Avenue, New Haven, CT, 06520, USA.
| |
Collapse
|
39
|
Liang C, Li Y, Luo J, Zhang Z. A novel motif-discovery algorithm to identify co-regulatory motifs in large transcription factor and microRNA co-regulatory networks in human. Bioinformatics 2015; 31:2348-55. [DOI: 10.1093/bioinformatics/btv159] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Accepted: 03/13/2015] [Indexed: 01/23/2023] Open
|
40
|
Krishnan R, Dhar PK. Systems Biology of MicroRNA. SYSTEMS AND SYNTHETIC BIOLOGY 2015. [DOI: 10.1007/978-94-017-9514-2_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
41
|
Wu D, Huang Y, Kang J, Li K, Bi X, Zhang T, Jin N, Hu Y, Tan P, Zhang L, Yi Y, Shen W, Huang J, Li X, Li X, Xu J, Wang D. ncRDeathDB: A comprehensive bioinformatics resource for deciphering network organization of the ncRNA-mediated cell death system. Autophagy 2015; 11:1917-26. [PMID: 26431463 PMCID: PMC4824571 DOI: 10.1080/15548627.2015.1089375] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2015] [Revised: 08/18/2015] [Accepted: 08/27/2015] [Indexed: 02/05/2023] Open
Abstract
Programmed cell death (PCD) is a critical biological process involved in many important processes, and defects in PCD have been linked with numerous human diseases. In recent years, the protein architecture in different PCD subroutines has been explored, but our understanding of the global network organization of the noncoding RNA (ncRNA)-mediated cell death system is limited and ambiguous. Hence, we developed the comprehensive bioinformatics resource (ncRDeathDB, www.rna-society.org/ncrdeathdb ) to archive ncRNA-associated cell death interactions. The current version of ncRDeathDB documents a total of more than 4600 ncRNA-mediated PCD entries in 12 species. ncRDeathDB provides a user-friendly interface to query, browse and manipulate these ncRNA-associated cell death interactions. Furthermore, this resource will help to visualize and navigate current knowledge of the noncoding RNA component of cell death and autophagy, to uncover the generic organizing principles of ncRNA-associated cell death systems, and to generate valuable biological hypotheses.
Collapse
Affiliation(s)
- Deng Wu
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
| | - Yan Huang
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
| | - Juanjuan Kang
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
- Bioinformatics Center; School of Life Science; University of Electronic Science and Technology of China; Chengdu, China
| | - Kongning Li
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
| | - Xiaoman Bi
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
| | - Ting Zhang
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
| | - Nana Jin
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
| | - Yongfei Hu
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
| | - Puwen Tan
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
| | - Lu Zhang
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
| | - Ying Yi
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
| | - Wenjun Shen
- Department of Biochemistry and Molecular Biology, Shantou University Medical College; Shantou, China
| | - Jian Huang
- Bioinformatics Center; School of Life Science; University of Electronic Science and Technology of China; Chengdu, China
| | - Xiaobo Li
- Department of Pathology, Harbin Medical University; Harbin, China
| | - Xia Li
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
- Correspondence to: Dong Wang; ; Jianzhen Xu; ; Xia Li;
| | - Jianzhen Xu
- Department of Biochemistry and Molecular Biology, Shantou University Medical College; Shantou, China
- Correspondence to: Dong Wang; ; Jianzhen Xu; ; Xia Li;
| | - Dong Wang
- College of Bioinformatics Science and Technology; Harbin Medical University; Harbin, China
- Department of Biochemistry and Molecular Biology, Shantou University Medical College; Shantou, China
- Correspondence to: Dong Wang; ; Jianzhen Xu; ; Xia Li;
| |
Collapse
|
42
|
Regression analysis of combined gene expression regulation in acute myeloid leukemia. PLoS Comput Biol 2014; 10:e1003908. [PMID: 25340776 PMCID: PMC4207489 DOI: 10.1371/journal.pcbi.1003908] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2014] [Accepted: 09/13/2014] [Indexed: 12/04/2022] Open
Abstract
Gene expression is a combinatorial function of genetic/epigenetic factors such as copy number variation (CNV), DNA methylation (DM), transcription factors (TF) occupancy, and microRNA (miRNA) post-transcriptional regulation. At the maturity of microarray/sequencing technologies, large amounts of data measuring the genome-wide signals of those factors became available from Encyclopedia of DNA Elements (ENCODE) and The Cancer Genome Atlas (TCGA). However, there is a lack of an integrative model to take full advantage of these rich yet heterogeneous data. To this end, we developed RACER (Regression Analysis of Combined Expression Regulation), which fits the mRNA expression as response using as explanatory variables, the TF data from ENCODE, and CNV, DM, miRNA expression signals from TCGA. Briefly, RACER first infers the sample-specific regulatory activities by TFs and miRNAs, which are then used as inputs to infer specific TF/miRNA-gene interactions. Such a two-stage regression framework circumvents a common difficulty in integrating ENCODE data measured in generic cell-line with the sample-specific TCGA measurements. As a case study, we integrated Acute Myeloid Leukemia (AML) data from TCGA and the related TF binding data measured in K562 from ENCODE. As a proof-of-concept, we first verified our model formalism by 10-fold cross-validation on predicting gene expression. We next evaluated RACER on recovering known regulatory interactions, and demonstrated its superior statistical power over existing methods in detecting known miRNA/TF targets. Additionally, we developed a feature selection procedure, which identified 18 regulators, whose activities clustered consistently with cytogenetic risk groups. One of the selected regulators is miR-548p, whose inferred targets were significantly enriched for leukemia-related pathway, implicating its novel role in AML pathogenesis. Moreover, survival analysis using the inferred activities identified C-Fos as a potential AML prognostic marker. Together, we provided a novel framework that successfully integrated the TCGA and ENCODE data in revealing AML-specific regulatory program at global level. Recent studies from The Cancer Genome Atlas (TCGA) showed that most Acute Myeloid Leukemia (AML) patients lack DNA mutations, which can potentially explain the tumorigenesis, and motivated a systematic approach to elucidate aberrant molecular signatures at the transcriptional and epigenetic levels. Using recently available data from two large consortia namely Encyclopedia of DNA Elements and TCGA, we developed a novel computational model to infer the regulatory activities of the expression regulators and their target genes in AML samples. Our analysis revealed 18 regulators whose dysregulation contributed significantly to explaining the global mRNA expression changes. Encouragingly, the inferred activities of these regulatory features followed a consistent pattern with cytogenetic phenotypes of the AML patients. Among these regulators, we identified microRNA hsa-miR-548p, whose regulatory relationships with leukemia-related genes including YY1 suggest its novel role in AML pathogenesis. Additionally, we discovered that the inferred activities of transcription factor C-Fos can be used as a prognostic marker to characterize survival rate of the AML patients. Together, we demonstrated an effective model that can integrate useful information from a large amount of heterogeneous data to dissect regulatory effects. Furthermore, the novel biological findings from this study may be constructive to future experimental research in AML.
Collapse
|
43
|
Liu ZP, Wu H, Zhu J, Miao H. Systematic identification of transcriptional and post-transcriptional regulations in human respiratory epithelial cells during influenza A virus infection. BMC Bioinformatics 2014; 15:336. [PMID: 25281301 PMCID: PMC4287445 DOI: 10.1186/1471-2105-15-336] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Accepted: 09/23/2014] [Indexed: 12/13/2022] Open
Abstract
Background Respiratory epithelial cells are the primary target of influenza virus infection in human. However, the molecular mechanisms of airway epithelial cell responses to viral infection are not fully understood. Revealing genome-wide transcriptional and post-transcriptional regulatory relationships can further advance our understanding of this problem, which motivates the development of novel and more efficient computational methods to simultaneously infer the transcriptional and post-transcriptional regulatory networks. Results Here we propose a novel framework named SITPR to investigate the interactions among transcription factors (TFs), microRNAs (miRNAs) and target genes. Briefly, a background regulatory network on a genome-wide scale (~23,000 nodes and ~370,000 potential interactions) is constructed from curated knowledge and algorithm predictions, to which the identification of transcriptional and post-transcriptional regulatory relationships is anchored. To reduce the dimension of the associated computing problem down to an affordable size, several topological and data-based approaches are used. Furthermore, we propose the constrained LASSO formulation and combine it with the dynamic Bayesian network (DBN) model to identify the activated regulatory relationships from time-course expression data. Our simulation studies on networks of different sizes suggest that the proposed framework can effectively determine the genuine regulations among TFs, miRNAs and target genes; also, we compare SITPR with several selected state-of-the-art algorithms to further evaluate its performance. By applying the SITPR framework to mRNA and miRNA expression data generated from human lung epithelial A549 cells in response to A/Mexico/InDRE4487/2009 (H1N1) virus infection, we are able to detect the activated transcriptional and post-transcriptional regulatory relationships as well as the significant regulatory motifs. Conclusion Compared with other representative state-of-the-art algorithms, the proposed SITPR framework can more effectively identify the activated transcriptional and post-transcriptional regulations simultaneously from a given background network. The idea of SITPR is generally applicable to the analysis of gene regulatory networks in human cells. The results obtained for human respiratory epithelial cells suggest the importance of the transcriptional, post-transcriptional regulations as well as their synergies in the innate immune responses against IAV infection. Electronic supplementary material The online version of this article (doi:10.1186/1471-2105-15-336) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | - Jian Zhu
- Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY 14642, USA.
| | | |
Collapse
|
44
|
Heyndrickx KS, Van de Velde J, Wang C, Weigel D, Vandepoele K. A functional and evolutionary perspective on transcription factor binding in Arabidopsis thaliana. THE PLANT CELL 2014; 26:3894-910. [PMID: 25361952 PMCID: PMC4247581 DOI: 10.1105/tpc.114.130591] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2014] [Revised: 10/07/2014] [Accepted: 10/12/2014] [Indexed: 05/19/2023]
Abstract
Understanding the mechanisms underlying gene regulation is paramount to comprehend the translation from genotype to phenotype. The two are connected by gene expression, and it is generally thought that variation in transcription factor (TF) function is an important determinant of phenotypic evolution. We analyzed publicly available genome-wide chromatin immunoprecipitation experiments for 27 TFs in Arabidopsis thaliana and constructed an experimental network containing 46,619 regulatory interactions and 15,188 target genes. We identified hub targets and highly occupied target (HOT) regions, which are enriched for genes involved in development, stimulus responses, signaling, and gene regulatory processes in the currently profiled network. We provide several lines of evidence that TF binding at plant HOT regions is functional, in contrast to that in animals, and not merely the result of accessible chromatin. HOT regions harbor specific DNA motifs, are enriched for differentially expressed genes, and are often conserved across crucifers and dicots, even though they are not under higher levels of purifying selection than non-HOT regions. Distal bound regions are under purifying selection as well and are enriched for a chromatin state showing regulation by the Polycomb repressive complex. Gene expression complexity is positively correlated with the total number of bound TFs, revealing insights in the regulatory code for genes with different expression breadths. The integration of noncanonical and canonical DNA motif information yields new hypotheses on cobinding and tethering between specific TFs involved in flowering and light regulation.
Collapse
Affiliation(s)
- Ken S Heyndrickx
- Department of Plant Systems Biology, VIB, 9052 Gent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Gent, Belgium
| | - Jan Van de Velde
- Department of Plant Systems Biology, VIB, 9052 Gent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Gent, Belgium
| | - Congmao Wang
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Klaas Vandepoele
- Department of Plant Systems Biology, VIB, 9052 Gent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Gent, Belgium
| |
Collapse
|
45
|
Ni Y, Stingo FC, Baladandayuthapani V. Integrative bayesian network analysis of genomic data. Cancer Inform 2014; 13:39-48. [PMID: 25288878 PMCID: PMC4179606 DOI: 10.4137/cin.s13786] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Revised: 05/04/2014] [Accepted: 05/05/2014] [Indexed: 01/09/2023] Open
Abstract
Rapid development of genome-wide profiling technologies has made it possible to conduct integrative analysis on genomic data from multiple platforms. In this study, we develop a novel integrative Bayesian network approach to investigate the relationships between genetic and epigenetic alterations as well as how these mutations affect a patient's clinical outcome. We take a Bayesian network approach that admits a convenient decomposition of the joint distribution into local distributions. Exploiting the prior biological knowledge about regulatory mechanisms, we model each local distribution as linear regressions. This allows us to analyze multi-platform genome-wide data in a computationally efficient manner. We illustrate the performance of our approach through simulation studies. Our methods are motivated by and applied to a multi-platform glioblastoma dataset, from which we reveal several biologically relevant relationships that have been validated in the literature as well as new genes that could potentially be novel biomarkers for cancer progression.
Collapse
Affiliation(s)
- Yang Ni
- Department of Statistics, Rice University, Houston, Texas, USA
| | - Francesco C Stingo
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | | |
Collapse
|
46
|
Smith-Vikos T, de Lencastre A, Inukai S, Shlomchik M, Holtrup B, Slack FJ. MicroRNAs mediate dietary-restriction-induced longevity through PHA-4/FOXA and SKN-1/Nrf transcription factors. Curr Biol 2014; 24:2238-46. [PMID: 25242029 DOI: 10.1016/j.cub.2014.08.013] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2013] [Revised: 06/30/2014] [Accepted: 08/07/2014] [Indexed: 01/23/2023]
Abstract
BACKGROUND Dietary restriction (DR) has been shown to prolong longevity across diverse taxa, yet the mechanistic relationship between DR and longevity remains unclear. MicroRNAs (miRNAs) control aging-related functions such as metabolism and lifespan through regulation of genes in insulin signaling, mitochondrial respiration, and protein homeostasis. RESULTS We have conducted a network analysis of aging-associated miRNAs connected to transcription factors PHA-4/FOXA and SKN-1/Nrf, which are both necessary for DR-induced lifespan extension in Caenorhabditis elegans. Our network analysis has revealed extensive regulatory interactions between PHA-4, SKN-1, and miRNAs and points to two aging-associated miRNAs, miR-71 and miR-228, as key nodes of this network. We show that miR-71 and miR-228 are critical for the response to DR in C. elegans. DR induces the expression of miR-71 and miR-228, and the regulation of these miRNAs depends on PHA-4 and SKN-1. In turn, we show that PHA-4 and SKN-1 are negatively regulated by miR-228, whereas miR-71 represses PHA-4. CONCLUSIONS Based on our findings, we have discovered new links in an important pathway connecting DR to aging. By interacting with PHA-4 and SKN-1, miRNAs transduce the effect of dietary-restriction-mediated lifespan extension in C. elegans. Given the conservation of miRNAs, PHA-4, and SKN-1 across phylogeny, these interactions are likely to be conserved in more-complex species.
Collapse
Affiliation(s)
- Thalyana Smith-Vikos
- Department of Molecular, Cellular and Developmental Biology, Yale University, P.O. Box 208103, New Haven, CT 06520, USA
| | - Alexandre de Lencastre
- Department of Molecular, Cellular and Developmental Biology, Yale University, P.O. Box 208103, New Haven, CT 06520, USA
| | - Sachi Inukai
- Department of Molecular, Cellular and Developmental Biology, Yale University, P.O. Box 208103, New Haven, CT 06520, USA
| | - Mariel Shlomchik
- Department of Molecular, Cellular and Developmental Biology, Yale University, P.O. Box 208103, New Haven, CT 06520, USA
| | - Brandon Holtrup
- Department of Molecular, Cellular and Developmental Biology, Yale University, P.O. Box 208103, New Haven, CT 06520, USA
| | - Frank J Slack
- Department of Molecular, Cellular and Developmental Biology, Yale University, P.O. Box 208103, New Haven, CT 06520, USA.
| |
Collapse
|
47
|
Angelini C, Costa V. Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems. Front Cell Dev Biol 2014; 2:51. [PMID: 25364758 PMCID: PMC4207007 DOI: 10.3389/fcell.2014.00051] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Accepted: 09/01/2014] [Indexed: 11/15/2022] Open
Abstract
The availability of omic data produced from international consortia, as well as from worldwide laboratories, is offering the possibility both to answer long-standing questions in biomedicine/molecular biology and to formulate novel hypotheses to test. However, the impact of such data is not fully exploited due to a limited availability of multi-omic data integration tools and methods. In this paper, we discuss the interplay between gene expression and epigenetic markers/transcription factors. We show how integrating ChIP-seq and RNA-seq data can help to elucidate gene regulatory mechanisms. In particular, we discuss the two following questions: (i) Can transcription factor occupancies or histone modification data predict gene expression? (ii) Can ChIP-seq and RNA-seq data be used to infer gene regulatory networks? We propose potential directions for statistical data integration. We discuss the importance of incorporating underestimated aspects (such as alternative splicing and long-range chromatin interactions). We also highlight the lack of data benchmarks and the need to develop tools for data integration from a statistical viewpoint, designed in the spirit of reproducible research.
Collapse
Affiliation(s)
- Claudia Angelini
- Istituto per le Applicazioni del Calcolo "M. Picone" - CNR Napoli, Italy ; Computational and Biology Open Laboratory (ComBOlab) Napoli, Italy
| | - Valerio Costa
- Computational and Biology Open Laboratory (ComBOlab) Napoli, Italy ; Institute of Genetics and Biophysics "A. Buzzati-Traverso" - CNR Napoli, Italy
| |
Collapse
|
48
|
Albergante L, Blow JJ, Newman TJ. Buffered Qualitative Stability explains the robustness and evolvability of transcriptional networks. eLife 2014; 3:e02863. [PMID: 25182846 PMCID: PMC4151086 DOI: 10.7554/elife.02863] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Accepted: 08/08/2014] [Indexed: 01/30/2023] Open
Abstract
The gene regulatory network (GRN) is the central decision-making module of the cell. We have developed a theory called Buffered Qualitative Stability (BQS) based on the hypothesis that GRNs are organised so that they remain robust in the face of unpredictable environmental and evolutionary changes. BQS makes strong and diverse predictions about the network features that allow stable responses under arbitrary perturbations, including the random addition of new connections. We show that the GRNs of E. coli, M. tuberculosis, P. aeruginosa, yeast, mouse, and human all verify the predictions of BQS. BQS explains many of the small- and large-scale properties of GRNs, provides conditions for evolvable robustness, and highlights general features of transcriptional response. BQS is severely compromised in a human cancer cell line, suggesting that loss of BQS might underlie the phenotypic plasticity of cancer cells, and highlighting a possible sequence of GRN alterations concomitant with cancer initiation.
Collapse
Affiliation(s)
- Luca Albergante
- College of Life Sciences, University of Dundee, Dundee, United Kingdom
| | - J Julian Blow
- College of Life Sciences, University of Dundee, Dundee, United Kingdom
| | - Timothy J Newman
- College of Life Sciences, University of Dundee, Dundee, United Kingdom School of Engineering, Physics and Mathematics, University of Dundee, Dundee, United Kingdom
| |
Collapse
|
49
|
The comprehensive transcriptional analysis in Caenorhabditis elegans by integrating ChIP-seq and gene expression data. Genet Res (Camb) 2014; 96:e005. [PMID: 25023089 DOI: 10.1017/s0016672314000081] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The fundamental step of learning transcriptional regulation mechanism is to identify the target genes regulated by transcription factors (TFs). Despite numerous target genes identified by chromatin immunoprecipitation followed by high-throughput sequencing technology (ChIP-seq) assays, it is not possible to infer function from binding alone in vivo. This is equally true in one of the best model systems, the nematode Caenorhabditis elegans (C. elegans), where regulation often occurs through diverse TF binding features of transcriptional networks identified in modENCODE. Here, we integrated ten ChIP-seq datasets with genome-wide expression data derived from tiling arrays, involved in six TFs (HLH-1, ELT-3, PQM-1, SKN-1, CEH-14 and LIN-11) with tissue-specific and four TFs (CEH-30, LIN-13, LIN-15B and MEP-1) with broad expression patterns. In common, TF bindings within 3 kb upstream of or within its target gene for these ten studies showed significantly elevated level of expression as opposed to that of non-target controls, indicated that these sites may be more likely to be functional through up-regulating its target genes. Intriguingly, expression of the target genes out of 5 kb upstream of their transcription start site also showed high levels, which was consistent with the results of following network component analysis. Our study has identified similar transcriptional regulation mechanisms of tissue-specific or broad expression TFs in C. elegans using ChIP-seq and gene expression data. It may also provide a novel insight into the mechanism of transcriptional regulation not only for simple organisms but also for more complex species.
Collapse
|
50
|
Afshar AS, Xu J, Goutsias J. Integrative identification of deregulated miRNA/TF-mediated gene regulatory loops and networks in prostate cancer. PLoS One 2014; 9:e100806. [PMID: 24968068 PMCID: PMC4072696 DOI: 10.1371/journal.pone.0100806] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2014] [Accepted: 05/28/2014] [Indexed: 01/07/2023] Open
Abstract
MicroRNAs (miRNAs) have attracted a great deal of attention in biology and medicine. It has been hypothesized that miRNAs interact with transcription factors (TFs) in a coordinated fashion to play key roles in regulating signaling and transcriptional pathways and in achieving robust gene regulation. Here, we propose a novel integrative computational method to infer certain types of deregulated miRNA-mediated regulatory circuits at the transcriptional, post-transcriptional and signaling levels. To reliably predict miRNA-target interactions from mRNA/miRNA expression data, our method collectively utilizes sequence-based miRNA-target predictions obtained from several algorithms, known information about mRNA and miRNA targets of TFs available in existing databases, certain molecular structures identified to be statistically over-represented in gene regulatory networks, available molecular subtyping information, and state-of-the-art statistical techniques to appropriately constrain the underlying analysis. In this way, the method exploits almost every aspect of extractable information in the expression data. We apply our procedure on mRNA/miRNA expression data from prostate tumor and normal samples and detect numerous known and novel miRNA-mediated deregulated loops and networks in prostate cancer. We also demonstrate instances of the results in a number of distinct biological settings, which are known to play crucial roles in prostate and other types of cancer. Our findings show that the proposed computational method can be used to effectively achieve notable insights into the poorly understood molecular mechanisms of miRNA-mediated interactions and dissect their functional roles in cancer in an effort to pave the way for miRNA-based therapeutics in clinical settings.
Collapse
Affiliation(s)
- Ali Sobhi Afshar
- Whitaker Biomedical Engineering Institute, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Joseph Xu
- Whitaker Biomedical Engineering Institute, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - John Goutsias
- Whitaker Biomedical Engineering Institute, The Johns Hopkins University, Baltimore, Maryland, United States of America
| |
Collapse
|