1
|
Wanniarachchi DV, Viswakula S, Wickramasuriya AM. The evaluation of transcription factor binding site prediction tools in human and Arabidopsis genomes. BMC Bioinformatics 2024; 25:371. [PMID: 39623329 PMCID: PMC11613939 DOI: 10.1186/s12859-024-05995-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2024] [Accepted: 11/21/2024] [Indexed: 12/06/2024] Open
Abstract
BACKGROUND The precise prediction of transcription factor binding sites (TFBSs) is pivotal for unraveling the gene regulatory networks underlying biological processes. While numerous tools have emerged for in silico TFBS prediction in recent years, the evolving landscape of computational biology necessitates thorough assessments of tool performance to ensure accuracy and reliability. Only a limited number of studies have been conducted to evaluate the performance of TFBS prediction tools comprehensively. Thus, the present study focused on assessing twelve widely used TFBS prediction tools and four de novo motif discovery tools using a benchmark dataset comprising real, generic, Markov, and negative sequences. TFBSs of Arabidopsis thaliana and Homo sapiens genomes downloaded from the JASPAR database were implanted in these sequences and the performance of tools was evaluated using several statistical parameters at different overlap percentages between the lengths of known and predicted binding sites. RESULTS Overall, the Multiple Cluster Alignment and Search Tool (MCAST) emerged as the best TFBS prediction tool, followed by Find Individual Motif Occurrences (FIMO) and MOtif Occurrence Detection Suite (MOODS). In addition, MotEvo and Dinucleotide Weight Tensor Toolbox (DWT-toolbox) demonstrated the highest sensitivity in identifying TFBSs at 90% and 80% overlap. Further, MCAST and DWT-toolbox managed to demonstrate the highest sensitivity across all three data types real, generic, and Markov. Among the de novo motif discovery tools, the Multiple Em for Motif Elicitation (MEME) emerged as the best performer. An analysis of the promoter regions of genes involved in the anthocyanin biosynthesis pathway in plants and the pentose phosphate pathway in humans, using the three best-performing tools, revealed considerable variation among the top 20 motifs identified by these tools. CONCLUSION The findings of this study lay a robust groundwork for selecting optimal TFBS prediction tools for future research. Given the variability observed in tool performance, employing multiple tools for identifying TFBSs in a set of sequences is highly recommended. In addition, further studies are recommended to develop an integrated toolbox that incorporates TFBS prediction or motif discovery tools, aiming to streamline result precision and accuracy.
Collapse
Affiliation(s)
- Dinithi V Wanniarachchi
- Department of Plant Sciences, Faculty of Science, University of Colombo, Colombo 03, Sri Lanka
| | - Sameera Viswakula
- Department of Statistics, Faculty of Science, University of Colombo, Colombo 03, Sri Lanka
| | | |
Collapse
|
2
|
Roestel JA, Wiersema JH, Jansen RK, Borsch T, Gruenstaeudl M. On the importance of sequence alignment inspections in plastid phylogenomics - an example from revisiting the relationships of the water-lilies. Cladistics 2024; 40:469-495. [PMID: 38761095 DOI: 10.1111/cla.12584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 04/27/2024] [Accepted: 04/29/2024] [Indexed: 05/20/2024] Open
Abstract
The water-lily clade represents the second earliest-diverging branch of angiosperms. Most of its species belong to Nymphaeaceae, of which the "core Nymphaeaceae"-comprising the genera Euryale, Nymphaea and Victoria-is the most diverse clade. Despite previous molecular phylogenetic studies on the core Nymphaeaceae, various aspects of their evolutionary relationships have remained unresolved. The length-variable introns and intergenic spacers are known to contain most of the sequence variability within the water-lily plastomes. Despite the challenges with multiple sequence alignment, any new molecular phylogenetic investigation on the core Nymphaeaceae should focus on these noncoding plastome regions. For example, a new plastid phylogenomic study on the core Nymphaeaceae should generate DNA sequence alignments of all plastid introns and intergenic spacers based on the principle of conserved sequence motifs. In this investigation, we revisit the phylogenetic history of the core Nymphaeaceae by employing such an approach. Specifically, we use a plastid phylogenomic analysis strategy in which all coding and noncoding partitions are separated and then undergo software-driven DNA sequence alignment, followed by a motif-based alignment inspection and adjustment. This approach allows us to increase the reliability of the character base compared to the default practice of aligning complete plastomes through software algorithms alone. Our approach produces significantly different phylogenetic tree reconstructions for several of the plastome regions under study. The results of these reconstructions underscore that Nymphaea is paraphyletic in its current circumscription, that each of the five subgenera of Nymphaea is monophyletic, and that the subgenus Nymphaea is sister to all other subgenera of Nymphaea. Our results also clarify many evolutionary relationships within the Nymphaea subgenera Brachyceras, Hydrocallis and Nymphaea. In closing, we discuss whether the phylogenetic reconstructions obtained through our motif-based alignment adjustments are in line with morphological evidence on water-lily evolution.
Collapse
Affiliation(s)
- Jessica A Roestel
- Institut für Biologie, Systematische Botanik und Pflanzengeographie, Freie Universität Berlin, Berlin, 14195, Germany
| | - John H Wiersema
- Department of Botany, National Museum of Natural History - Smithsonian Institution, Washington, DC, 37012, USA
| | - Robert K Jansen
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, 78712, USA
| | - Thomas Borsch
- Institut für Biologie, Systematische Botanik und Pflanzengeographie, Freie Universität Berlin, Berlin, 14195, Germany
- Botanischer Garten und Botanisches Museum Berlin, Freie Universität Berlin, 14195, Berlin, Germany
| | - Michael Gruenstaeudl
- Institut für Biologie, Systematische Botanik und Pflanzengeographie, Freie Universität Berlin, Berlin, 14195, Germany
- Department of Biological Sciences, Fort Hays State University, Hays, KS, 67601, USA
| |
Collapse
|
3
|
Hesham D, Mosaab A, Amer N, Al-Shehaby N, Magdeldin S, Hassan A, Georgiev H, Elshoky H, Rady M, Aisha KA, Sabet O, El-Naggar S. Epigenetic silencing of ZIC4 unveils a potential tumor suppressor role in pediatric choroid plexus carcinoma. Sci Rep 2024; 14:21293. [PMID: 39266576 PMCID: PMC11393135 DOI: 10.1038/s41598-024-71188-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Accepted: 08/26/2024] [Indexed: 09/14/2024] Open
Abstract
Zic family member ZIC4 is a transcription factor that has been shown to be silenced in several cancers. However, understanding the regulation and function of ZIC4 in pediatric choroid plexus tumors (CPTs) remained limited. This study employed data mining and bioinformatics analysis to investigate the DNA methylation status of ZIC4 in CPTs and its correlation with patient survival. Our results unveiled ZIC4 methylation as a segregating factor, dividing CPT cohorts into two clusters, with hyper-methylation linked to adverse prognosis. Hyper-methylation of ZIC4 was confirmed in a choroid plexus carcinoma-derived cell line (CCHE-45) by bisulfite sequencing. Furthermore, our study demonstrated that demethylating agent and a histone methyltransferase inhibitor could reverse ZIC4 silencing. RNA sequencing and proteomic analysis showed that ZIC4 over-expression influenced genes and proteins involved in immune response, antigen processing and presentation, endoplasmic reticulum stress, and metabolism. Functionally, re-expressing ZIC4 negatively impacted cell proliferation and migration. Ultimately, these findings underscore ZIC4 hyper-methylation as a prognostic marker in CPTs and shed light on potential mechanisms underlying its tumor suppressor role in CPC. This insight paves the way for novel therapeutic targets in treating aggressive CPTs.
Collapse
Affiliation(s)
- Dina Hesham
- Tumor Biology Research Program, Basic Research Unit, Research Department, Children's Cancer Hospital in Egypt 57357, 1 Sekket El Emam, El Madbah El Kadeem Yard, Sayeda Zeinab, Cairo, Egypt
- Microbiology, Immunology and Biotechnology Department, Faculty of Pharmacy and Biotechnology, German University in Cairo (GUC), Cairo, Egypt
| | - Amal Mosaab
- Tumor Biology Research Program, Basic Research Unit, Research Department, Children's Cancer Hospital in Egypt 57357, 1 Sekket El Emam, El Madbah El Kadeem Yard, Sayeda Zeinab, Cairo, Egypt
| | - Nada Amer
- Tumor Biology Research Program, Basic Research Unit, Research Department, Children's Cancer Hospital in Egypt 57357, 1 Sekket El Emam, El Madbah El Kadeem Yard, Sayeda Zeinab, Cairo, Egypt
- Microbiology, Immunology and Biotechnology Department, Faculty of Pharmacy and Biotechnology, German University in Cairo (GUC), Cairo, Egypt
| | - Nouran Al-Shehaby
- Tumor Biology Research Program, Basic Research Unit, Research Department, Children's Cancer Hospital in Egypt 57357, 1 Sekket El Emam, El Madbah El Kadeem Yard, Sayeda Zeinab, Cairo, Egypt
| | - Sameh Magdeldin
- Proteomics and Metabolomics Research Program, Basic Research Unit, Research Department, Children's Cancer Hospital Egypt 57357, Cairo, Egypt
- Department of Physiology, Faculty of Veterinary Medicine, Suez Canal University, Ismailia, Egypt
| | - Ahmed Hassan
- Institute of Immunology, Hannover Medical School, Hannover, Germany
| | - Hristo Georgiev
- Institute of Immunology, Hannover Medical School, Hannover, Germany
| | - Hisham Elshoky
- Tumor Biology Research Program, Basic Research Unit, Research Department, Children's Cancer Hospital in Egypt 57357, 1 Sekket El Emam, El Madbah El Kadeem Yard, Sayeda Zeinab, Cairo, Egypt
| | - Mona Rady
- Microbiology, Immunology and Biotechnology Department, Faculty of Pharmacy and Biotechnology, German University in Cairo (GUC), Cairo, Egypt
- Faculty of Biotechnology, German International University, New Administrative Capital, Cairo, Egypt
| | - Khaled Abou Aisha
- Microbiology, Immunology and Biotechnology Department, Faculty of Pharmacy and Biotechnology, German University in Cairo (GUC), Cairo, Egypt
| | - Ola Sabet
- Tumor Biology Research Program, Basic Research Unit, Research Department, Children's Cancer Hospital in Egypt 57357, 1 Sekket El Emam, El Madbah El Kadeem Yard, Sayeda Zeinab, Cairo, Egypt
- Division of Immunology, University Children's Hospital Zurich, Zurich, Switzerland
| | - Shahenda El-Naggar
- Tumor Biology Research Program, Basic Research Unit, Research Department, Children's Cancer Hospital in Egypt 57357, 1 Sekket El Emam, El Madbah El Kadeem Yard, Sayeda Zeinab, Cairo, Egypt.
| |
Collapse
|
4
|
Taracha-Wisniewska A, Parks EGC, Miller M, Lipinska-Zubrycka L, Dworkin S, Wilanowski T. Vitamin D Receptor Regulates the Expression of the Grainyhead-Like 1 Gene. Int J Mol Sci 2024; 25:7913. [PMID: 39063155 PMCID: PMC11276664 DOI: 10.3390/ijms25147913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Revised: 06/28/2024] [Accepted: 07/11/2024] [Indexed: 07/28/2024] Open
Abstract
Vitamin D plays an important pleiotropic role in maintaining global homeostasis of the human body. Its functions go far beyond skeletal health, playing a crucial role in a plethora of cellular functions, as well as in extraskeletal health, ensuring the proper functioning of multiple human organs, including the skin. Genes from the Grainyhead-like (GRHL) family code for transcription factors necessary for the development and maintenance of various epithelia. Even though they are involved in many processes regulated by vitamin D, a direct link between vitamin D-mediated cellular pathways and GRHL genes has never been described. We employed various bioinformatic methods, quantitative real-time PCR, chromatin immunoprecipitation, reporter gene assays, and calcitriol treatments to investigate this issue. We report that the vitamin D receptor (VDR) binds to a regulatory region of the Grainyhead-like 1 (GRHL1) gene and regulates its expression. Ectopic expression of VDR and treatment with calcitriol alters the expression of the GRHL1 gene. The evidence presented here indicates a role of VDR in the regulation of expression of GRHL1 and correspondingly a role of GRHL1 in mediating the actions of vitamin D.
Collapse
Affiliation(s)
- Agnieszka Taracha-Wisniewska
- Faculty of Biology, Institute of Genetics and Biotechnology, University of Warsaw, 02-096 Warsaw, Poland; (A.T.-W.); (L.L.-Z.)
| | - Emma G. C. Parks
- Department of Physiology, Anatomy and Microbiology, La Trobe University, Melbourne, VIC 3086, Australia; (E.G.C.P.); (S.D.)
| | - Michal Miller
- Nencki Institute of Experimental Biology of Polish Academy of Sciences, 02-093 Warsaw, Poland;
| | - Lidia Lipinska-Zubrycka
- Faculty of Biology, Institute of Genetics and Biotechnology, University of Warsaw, 02-096 Warsaw, Poland; (A.T.-W.); (L.L.-Z.)
| | - Sebastian Dworkin
- Department of Physiology, Anatomy and Microbiology, La Trobe University, Melbourne, VIC 3086, Australia; (E.G.C.P.); (S.D.)
| | - Tomasz Wilanowski
- Faculty of Biology, Institute of Genetics and Biotechnology, University of Warsaw, 02-096 Warsaw, Poland; (A.T.-W.); (L.L.-Z.)
| |
Collapse
|
5
|
Jiménez S, Schreiber V, Mercier R, Gradwohl G, Molina N. Characterization of cell-fate decision landscapes by estimating transcription factor dynamics. CELL REPORTS METHODS 2023; 3:100512. [PMID: 37533652 PMCID: PMC10391345 DOI: 10.1016/j.crmeth.2023.100512] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 03/23/2023] [Accepted: 06/01/2023] [Indexed: 08/04/2023]
Abstract
Time-specific modulation of gene expression during differentiation by transcription factors promotes cell diversity. However, estimating their dynamic regulatory activity at the single-cell level and in a high-throughput manner remains challenging. We present FateCompass, an integrative approach that utilizes single-cell transcriptomics data to identify lineage-specific transcription factors throughout differentiation. By combining a probabilistic framework with RNA velocities or differentiation potential, we estimate transition probabilities, while a linear model of gene regulation is employed to compute transcription factor activities. Considering dynamic changes and correlations of expression and activities, FateCompass identifies lineage-specific regulators. Our validation using in silico data and application to pancreatic endocrine cell differentiation datasets highlight both known and potentially novel lineage-specific regulators. Notably, we uncovered undescribed transcription factors of an enterochromaffin-like population during in vitro differentiation toward ß-like cells. FateCompass provides a valuable framework for hypothesis generation, advancing our understanding of the gene regulatory networks driving cell-fate decisions.
Collapse
Affiliation(s)
- Sara Jiménez
- Université de Strasbourg, Strasbourg, France
- CNRS, UMR 7104, 67400 Illkirch, France
- INSERM, UMR-S 1258, 67400 Illkirch, France
- IGBMC, Institut de Génétique et de Biologie Moléculaire et Cellulaire, 67400 Illkirch, France
| | - Valérie Schreiber
- Université de Strasbourg, Strasbourg, France
- CNRS, UMR 7104, 67400 Illkirch, France
- INSERM, UMR-S 1258, 67400 Illkirch, France
- IGBMC, Institut de Génétique et de Biologie Moléculaire et Cellulaire, 67400 Illkirch, France
| | - Reuben Mercier
- Université de Strasbourg, Strasbourg, France
- CNRS, UMR 7104, 67400 Illkirch, France
- INSERM, UMR-S 1258, 67400 Illkirch, France
- IGBMC, Institut de Génétique et de Biologie Moléculaire et Cellulaire, 67400 Illkirch, France
| | - Gérard Gradwohl
- Université de Strasbourg, Strasbourg, France
- CNRS, UMR 7104, 67400 Illkirch, France
- INSERM, UMR-S 1258, 67400 Illkirch, France
- IGBMC, Institut de Génétique et de Biologie Moléculaire et Cellulaire, 67400 Illkirch, France
| | - Nacho Molina
- Université de Strasbourg, Strasbourg, France
- CNRS, UMR 7104, 67400 Illkirch, France
- INSERM, UMR-S 1258, 67400 Illkirch, France
- IGBMC, Institut de Génétique et de Biologie Moléculaire et Cellulaire, 67400 Illkirch, France
| |
Collapse
|
6
|
Katsantoni M, van Nimwegen E, Zavolan M. Improved analysis of (e)CLIP data with RCRUNCH yields a compendium of RNA-binding protein binding sites and motifs. Genome Biol 2023; 24:77. [PMID: 37069586 PMCID: PMC10108518 DOI: 10.1186/s13059-023-02913-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 03/29/2023] [Indexed: 04/19/2023] Open
Abstract
We present RCRUNCH, an end-to-end solution to CLIP data analysis for identification of binding sites and sequence specificity of RNA-binding proteins. RCRUNCH can analyze not only reads that map uniquely to the genome but also those that map to multiple genome locations or across splice boundaries and can consider various types of background in the estimation of read enrichment. By applying RCRUNCH to the eCLIP data from the ENCODE project, we have constructed a comprehensive and homogeneous resource of in-vivo-bound RBP sequence motifs. RCRUNCH automates the reproducible analysis of CLIP data, enabling studies of post-transcriptional control of gene expression.
Collapse
Affiliation(s)
- Maria Katsantoni
- Biozentrum, University of Basel, 4056, Basel, Switzerland.
- Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland.
| | - Erik van Nimwegen
- Biozentrum, University of Basel, 4056, Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
| | - Mihaela Zavolan
- Biozentrum, University of Basel, 4056, Basel, Switzerland.
- Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland.
| |
Collapse
|
7
|
Single-cell sortChIC identifies hierarchical chromatin dynamics during hematopoiesis. Nat Genet 2023; 55:333-345. [PMID: 36539617 PMCID: PMC9925381 DOI: 10.1038/s41588-022-01260-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Accepted: 11/01/2022] [Indexed: 12/24/2022]
Abstract
Post-translational histone modifications modulate chromatin activity to affect gene expression. How chromatin states underlie lineage choice in single cells is relatively unexplored. We develop sort-assisted single-cell chromatin immunocleavage (sortChIC) and map active (H3K4me1 and H3K4me3) and repressive (H3K27me3 and H3K9me3) histone modifications in the mouse bone marrow. During differentiation, hematopoietic stem and progenitor cells (HSPCs) acquire active chromatin states mediated by cell-type-specifying transcription factors, which are unique for each lineage. By contrast, most alterations in repressive marks during differentiation occur independent of the final cell type. Chromatin trajectory analysis shows that lineage choice at the chromatin level occurs at the progenitor stage. Joint profiling of H3K4me1 and H3K9me3 demonstrates that cell types within the myeloid lineage have distinct active chromatin but share similar myeloid-specific heterochromatin states. This implies a hierarchical regulation of chromatin during hematopoiesis: heterochromatin dynamics distinguish differentiation trajectories and lineages, while euchromatin dynamics reflect cell types within lineages.
Collapse
|
8
|
Yip CW, Hon CC, Yasuzawa K, Sivaraman DM, Ramilowski JA, Shibayama Y, Agrawal S, Prabhu AV, Parr C, Severin J, Lan YJ, Dostie J, Petri A, Nishiyori-Sueki H, Tagami M, Itoh M, López-Redondo F, Kouno T, Chang JC, Luginbühl J, Kato M, Murata M, Yip WH, Shu X, Abugessaisa I, Hasegawa A, Suzuki H, Kauppinen S, Yagi K, Okazaki Y, Kasukawa T, de Hoon M, Carninci P, Shin JW. Antisense-oligonucleotide-mediated perturbation of long non-coding RNA reveals functional features in stem cells and across cell types. Cell Rep 2022; 41:111893. [PMID: 36577377 DOI: 10.1016/j.celrep.2022.111893] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 08/30/2022] [Accepted: 12/07/2022] [Indexed: 12/28/2022] Open
Abstract
Within the scope of the FANTOM6 consortium, we perform a large-scale knockdown of 200 long non-coding RNAs (lncRNAs) in human induced pluripotent stem cells (iPSCs) and systematically characterize their roles in self-renewal and pluripotency. We find 36 lncRNAs (18%) exhibiting cell growth inhibition. From the knockdown of 123 lncRNAs with transcriptome profiling, 36 lncRNAs (29.3%) show molecular phenotypes. Integrating the molecular phenotypes with chromatin-interaction assays further reveals cis- and trans-interacting partners as potential primary targets. Additionally, cell-type enrichment analysis identifies lncRNAs associated with pluripotency, while the knockdown of LINC02595, CATG00000090305.1, and RP11-148B6.2 modulates colony formation of iPSCs. We compare our results with previously published fibroblasts phenotyping data and find that 2.9% of the lncRNAs exhibit a consistent cell growth phenotype, whereas we observe 58.3% agreement in molecular phenotypes. This highlights that molecular phenotyping is more comprehensive in revealing affected pathways.
Collapse
Affiliation(s)
- Chi Wai Yip
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Chung-Chau Hon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Kayoko Yasuzawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Divya M Sivaraman
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan; Sree Chitra Tirunal Institute for Medical Sciences and Technology, Thiruvananthapuram, Kerala 695 011, India
| | - Jordan A Ramilowski
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan; Advanced Medical Research Center, Yokohama City University, Yokohama, Kanagawa 236-0004, Japan
| | - Youtaro Shibayama
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Saumya Agrawal
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Anika V Prabhu
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Callum Parr
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Jessica Severin
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Yan Jun Lan
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Josée Dostie
- Department of Biochemistry, Rosalind and Morris Goodman Cancer Research Center, McGill University, Montréal, QC, Canada
| | - Andreas Petri
- Center for RNA Medicine, Department of Clinical Medicine, Aalborg University, Copenhagen 2450, Denmark
| | | | - Michihira Tagami
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Masayoshi Itoh
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | | | - Tsukasa Kouno
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Jen-Chien Chang
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Joachim Luginbühl
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Masaki Kato
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Mitsuyoshi Murata
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Wing Hin Yip
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Xufeng Shu
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan; Department of Computational Biology and Medical Sciences, The University of Tokyo, Kashiwa, Chiba, 277-8562, Japan
| | - Imad Abugessaisa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Akira Hasegawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Harukazu Suzuki
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Sakari Kauppinen
- Center for RNA Medicine, Department of Clinical Medicine, Aalborg University, Copenhagen 2450, Denmark
| | - Ken Yagi
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Yasushi Okazaki
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Takeya Kasukawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Michiel de Hoon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Piero Carninci
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan; Human Technopole, via Rita Levi Montalcini 1, Milan, Italy
| | - Jay W Shin
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan; Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore 138672, Singapore.
| |
Collapse
|
9
|
Iwasaki Y, Ikemura T, Wada K, Wada Y, Abe T. Comparative genomic analysis of the human genome and six bat genomes using unsupervised machine learning: Mb-level CpG and TFBS islands. BMC Genomics 2022; 23:497. [PMID: 35804296 PMCID: PMC9264310 DOI: 10.1186/s12864-022-08664-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 05/31/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Emerging infectious disease-causing RNA viruses, such as the SARS-CoV-2 and Ebola viruses, are thought to rely on bats as natural reservoir hosts. Since these zoonotic viruses pose a great threat to humans, it is important to characterize the bat genome from multiple perspectives. Unsupervised machine learning methods for extracting novel information from big sequence data without prior knowledge or particular models are highly desirable for obtaining unexpected insights. We previously established a batch-learning self-organizing map (BLSOM) of the oligonucleotide composition that reveals novel genome characteristics from big sequence data. RESULTS In this study, using the oligonucleotide BLSOM, we conducted a comparative genomic study of humans and six bat species. BLSOM is an explainable-type machine learning algorithm that reveals the diagnostic oligonucleotides contributing to sequence clustering (self-organization). When unsupervised machine learning reveals unexpected and/or characteristic features, these features can be studied in more detail via the much simpler and more direct standard distribution map method. Based on this combined strategy, we identified the Mb-level enrichment of CG dinucleotide (Mb-level CpG islands) around the termini of bat long-scaffold sequences. In addition, a class of CG-containing oligonucleotides were enriched in the centromeric and pericentromeric regions of human chromosomes. Oligonucleotides longer than tetranucleotides often represent binding motifs for a wide variety of proteins (e.g., transcription factor binding sequences (TFBSs)). By analyzing the penta- and hexanucleotide composition, we observed the evident enrichment of a wide range of hexanucleotide TFBSs in centromeric and pericentromeric heterochromatin regions on all human chromosomes. CONCLUSION Function of transcription factors (TFs) beyond their known regulation of gene expression (e.g., TF-mediated looping interactions between two different genomic regions) has received wide attention. The Mb-level TFBS and CpG islands are thought to be involved in the large-scale nuclear organization, such as centromere and telomere clustering. TFBSs, which are enriched in centromeric and pericentromeric heterochromatin regions, are thought to play an important role in the formation of nuclear 3D structures. Our machine learning-based analysis will help us to understand the differential features of nuclear 3D structures in the human and bat genomes.
Collapse
Affiliation(s)
- Yuki Iwasaki
- Department of Bioscience, Nagahama Institute of Bio-Science and Technology, Tamura-cho 1266, Nagahama-shi, Shiga-ken, 526-0829, Japan
| | - Toshimichi Ikemura
- Department of Bioscience, Nagahama Institute of Bio-Science and Technology, Tamura-cho 1266, Nagahama-shi, Shiga-ken, 526-0829, Japan.
| | - Kennosuke Wada
- Department of Bioscience, Nagahama Institute of Bio-Science and Technology, Tamura-cho 1266, Nagahama-shi, Shiga-ken, 526-0829, Japan
| | - Yoshiko Wada
- Department of Bioscience, Nagahama Institute of Bio-Science and Technology, Tamura-cho 1266, Nagahama-shi, Shiga-ken, 526-0829, Japan
| | - Takashi Abe
- Smart Information Systems, Faculty of Engineering, Niigata University, Niigata-ken, 950-2181, Japan.
| |
Collapse
|
10
|
Baranasic D, Hörtenhuber M, Balwierz PJ, Zehnder T, Mukarram AK, Nepal C, Várnai C, Hadzhiev Y, Jimenez-Gonzalez A, Li N, Wragg J, D'Orazio FM, Relic D, Pachkov M, Díaz N, Hernández-Rodríguez B, Chen Z, Stoiber M, Dong M, Stevens I, Ross SE, Eagle A, Martin R, Obasaju O, Rastegar S, McGarvey AC, Kopp W, Chambers E, Wang D, Kim HR, Acemel RD, Naranjo S, Łapiński M, Chong V, Mathavan S, Peers B, Sauka-Spengler T, Vingron M, Carninci P, Ohler U, Lacadie SA, Burgess SM, Winata C, van Eeden F, Vaquerizas JM, Gómez-Skarmeta JL, Onichtchouk D, Brown BJ, Bogdanovic O, van Nimwegen E, Westerfield M, Wardle FC, Daub CO, Lenhard B, Müller F. Multiomic atlas with functional stratification and developmental dynamics of zebrafish cis-regulatory elements. Nat Genet 2022; 54:1037-1050. [PMID: 35789323 PMCID: PMC9279159 DOI: 10.1038/s41588-022-01089-w] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 05/03/2022] [Indexed: 12/12/2022]
Abstract
Zebrafish, a popular organism for studying embryonic development and for modeling human diseases, has so far lacked a systematic functional annotation program akin to those in other animal models. To address this, we formed the international DANIO-CODE consortium and created a central repository to store and process zebrafish developmental functional genomic data. Our data coordination center ( https://danio-code.zfin.org ) combines a total of 1,802 sets of unpublished and re-analyzed published genomic data, which we used to improve existing annotations and show its utility in experimental design. We identified over 140,000 cis-regulatory elements throughout development, including classes with distinct features dependent on their activity in time and space. We delineated the distinct distance topology and chromatin features between regulatory elements active during zygotic genome activation and those active during organogenesis. Finally, we matched regulatory elements and epigenomic landscapes between zebrafish and mouse and predicted functional relationships between them beyond sequence similarity, thus extending the utility of zebrafish developmental genomics to mammals.
Collapse
Affiliation(s)
- Damir Baranasic
- MRC London Institute of Medical Sciences, London, UK
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK
| | - Matthias Hörtenhuber
- Department of Biosciences and Nutrition, Karolinska Institutet, NEO, Huddinge, Sweden
| | - Piotr J Balwierz
- MRC London Institute of Medical Sciences, London, UK
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK
- Institute of Cancer and Genomic Sciences, Birmingham Centre for Genome Biology, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Tobias Zehnder
- MRC London Institute of Medical Sciences, London, UK
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK
- Max Planck Institute for Molecular Genetics, Department of Computational Molecular Biology, Berlin, Germany
| | - Abdul Kadir Mukarram
- Department of Biosciences and Nutrition, Karolinska Institutet, NEO, Huddinge, Sweden
| | - Chirag Nepal
- Biotech Research and Innovation Centre (BRIC), Department of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Csilla Várnai
- Institute of Cancer and Genomic Sciences, Birmingham Centre for Genome Biology, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- Centre for Computational Biology, University of Birmingham, Birmingham, UK
| | - Yavor Hadzhiev
- Institute of Cancer and Genomic Sciences, Birmingham Centre for Genome Biology, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Ada Jimenez-Gonzalez
- Institute of Cancer and Genomic Sciences, Birmingham Centre for Genome Biology, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Nan Li
- Institute of Cancer and Genomic Sciences, Birmingham Centre for Genome Biology, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Joseph Wragg
- Institute of Cancer and Genomic Sciences, Birmingham Centre for Genome Biology, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Fabio M D'Orazio
- Institute of Cancer and Genomic Sciences, Birmingham Centre for Genome Biology, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Dorde Relic
- Biozentrum, University of Basel and Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Mikhail Pachkov
- Biozentrum, University of Basel and Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Noelia Díaz
- Max Planck Institute for Molecular Biomedicine, Muenster, Germany
- Institute of Marine Sciences, Barcelona, Spain
| | | | - Zelin Chen
- Translational and Functional Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
- Southern Marine Science and Engineering Guangdong Laboratory, Guangzhou, China
- CAS Key Laboratory of Tropical Marine Bio-Resources and Ecology, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, China
| | - Marcus Stoiber
- Environmental Genomics & Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Michaël Dong
- Department of Biosciences and Nutrition, Karolinska Institutet, NEO, Huddinge, Sweden
| | - Irene Stevens
- Department of Biosciences and Nutrition, Karolinska Institutet, NEO, Huddinge, Sweden
| | - Samuel E Ross
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
| | - Anne Eagle
- Institute of Neuroscience, University of Oregon, Eugene, OR, USA
| | - Ryan Martin
- Institute of Neuroscience, University of Oregon, Eugene, OR, USA
| | - Oluwapelumi Obasaju
- Institute of Cancer and Genomic Sciences, Birmingham Centre for Genome Biology, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Sepand Rastegar
- Institute of Biological and Chemical Systems - Biological Information Processing (IBCS-BIP), Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Alison C McGarvey
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
| | - Wolfgang Kopp
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
| | - Emily Chambers
- Sheffield Bioinformatics Core, Sheffield Institute of Translational Neuroscience, University of Sheffield, Sheffield, UK
| | - Dennis Wang
- Sheffield Bioinformatics Core, Sheffield Institute of Translational Neuroscience, University of Sheffield, Sheffield, UK
- Singapore Institute for Clinical Sciences, Singapore, Singapore
| | - Hyejeong R Kim
- Bateson Centre/Biomedical Science, University of Sheffield, Sheffield, UK
| | - Rafael D Acemel
- Centro Andaluz de Biología del Desarrollo (CABD), CSIC-Universidad Pablo de Olavide-Junta de Andalucía, Seville, Spain
- Epigenetics and Sex Development Group, Berlin Institute for Medical Systems Biology, Max-Delbrück Center for Molecular Medicine, Berlin, Germany
| | - Silvia Naranjo
- Centro Andaluz de Biología del Desarrollo (CABD), CSIC-Universidad Pablo de Olavide-Junta de Andalucía, Seville, Spain
| | - Maciej Łapiński
- International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Vanessa Chong
- MRC Weatherall Institute of Molecular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
| | | | - Bernard Peers
- Laboratory of Zebrafish Development and Disease Models (ZDDM), GIGA-R, SART TILMAN, University of Liège, Liège, Belgium
| | - Tatjana Sauka-Spengler
- MRC Weatherall Institute of Molecular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
| | - Martin Vingron
- Max Planck Institute for Molecular Genetics, Department of Computational Molecular Biology, Berlin, Germany
| | - Piero Carninci
- Laboratory for Transcriptome Technology, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Fondazione Human Technopole, Milano, Italy
| | - Uwe Ohler
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
- Institute of Biology, Humboldt University, Berlin, Germany
| | - Scott Allen Lacadie
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
| | - Shawn M Burgess
- Southern Marine Science and Engineering Guangdong Laboratory, Guangzhou, China
| | - Cecilia Winata
- International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Freek van Eeden
- Bateson Centre/Biomedical Science, University of Sheffield, Sheffield, UK
| | - Juan M Vaquerizas
- MRC London Institute of Medical Sciences, London, UK
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK
- Max Planck Institute for Molecular Biomedicine, Muenster, Germany
| | - José Luis Gómez-Skarmeta
- Centro Andaluz de Biología del Desarrollo (CABD), CSIC-Universidad Pablo de Olavide-Junta de Andalucía, Seville, Spain
| | - Daria Onichtchouk
- Department of Developmental Biology, Signalling Research Centers BIOSS and CIBSS, University of Freiburg, Freiburg, Germany
| | - Ben James Brown
- Environmental Genomics & Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Ozren Bogdanovic
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Erik van Nimwegen
- Biozentrum, University of Basel and Swiss Institute of Bioinformatics, Basel, Switzerland
| | | | - Fiona C Wardle
- Randall Centre for Cell & Molecular Biophysics, Guy's Campus, King's College London, London, UK
| | - Carsten O Daub
- Department of Biosciences and Nutrition, Karolinska Institutet, NEO, Huddinge, Sweden.
- Science for Life Laboratory, Solna, Sweden.
| | - Boris Lenhard
- MRC London Institute of Medical Sciences, London, UK.
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK.
| | - Ferenc Müller
- Institute of Cancer and Genomic Sciences, Birmingham Centre for Genome Biology, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK.
| |
Collapse
|
11
|
Riba A, Oravecz A, Durik M, Jiménez S, Alunni V, Cerciat M, Jung M, Keime C, Keyes WM, Molina N. Cell cycle gene regulation dynamics revealed by RNA velocity and deep-learning. Nat Commun 2022; 13:2865. [PMID: 35606383 PMCID: PMC9126911 DOI: 10.1038/s41467-022-30545-8] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Accepted: 05/06/2022] [Indexed: 11/15/2022] Open
Abstract
Despite the fact that the cell cycle is a fundamental process of life, a detailed quantitative understanding of gene regulation dynamics throughout the cell cycle is far from complete. Single-cell RNA-sequencing (scRNA-seq) technology gives access to these dynamics without externally perturbing the cell. Here, by generating scRNA-seq libraries in different cell systems, we observe cycling patterns in the unspliced-spliced RNA space of cell cycle-related genes. Since existing methods to analyze scRNA-seq are not efficient to measure cycling gene dynamics, we propose a deep learning approach (DeepCycle) to fit these patterns and build a high-resolution map of the entire cell cycle transcriptome. Characterizing the cell cycle in embryonic and somatic cells, we identify major waves of transcription during the G1 phase and systematically study the stages of the cell cycle. Our work will facilitate the study of the cell cycle in multiple cellular models and different biological contexts. Single-cell RNA-sequencing technology gives access to cell cycle dynamics without externally perturbing the cell. Here the authors present DeepCycle,a robust deep learning method to infer the cell cycle state in single cells from scRNA-seq data.
Collapse
|
12
|
Liu D, Liu W, Chen X, Yin J, Ma L, Liu M, Zhou X, Xian L, Li P, Tan X, Zhao J, Liao Y, Cao G. circKCNN2 suppresses the recurrence of hepatocellular carcinoma at least partially via regulating miR-520c-3p/methyl-DNA-binding domain protein 2 axis. Clin Transl Med 2022; 12:e662. [PMID: 35051313 PMCID: PMC8775140 DOI: 10.1002/ctm2.662] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 11/10/2021] [Accepted: 11/15/2021] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Recurrence is the major cause of hepatocellular carcinoma (HCC) death. We aimed to identify circular RNA (circRNA) with predictive and therapeutic value for recurrent HCC. METHODS Tissue samples from recurrent and non-recurrent HCC patients were subjected to circRNA sequencing and transcriptome sequencing. circKCNN2 was identified through multi-omics analyses. The effects of circKCNN2 on HCC were evaluated in cells, animals, database of The Cancer Genome Atlas, and a cohort with 130 HCC patients. circRNA precipitation, chromatin immunoprecipitation assay, RNA pull-down, luciferase assay, and cell experiments were applied to evaluate the interaction of circKCNN2 with miRNAs and proteins. The association between circKCNN2 and the therapeutic effect of lenvatinib was investigated in HCC cell lines and HCC tissue-derived organoids. RESULTS The expression of circKCNN2 was downregulated in HCC tissues and predicted a favorable overall survival and recurrence-free survival. The expression of circKCNN2 was positively correlated with the parental gene, potassium calcium-activated channel subfamily N member (KCNN2). Nuclear transcription factor Y subunit alpha (NFYA) was proven to inhibit the promoter activity of KCNN2, downregulate the expression of KCNN2 and circKCNN2, and predict an unfavorable recurrence-free survival. Ectopic expression of circKCNN2 inhibited HCC cell proliferation, colony formation, migration, and tumor formation in a mouse model. miR-520c-3p sponged by circKCNN2 could reverse the inhibitory effect of circKCNN2 on HCC cells and down-regulate the expression of methyl-DNA-binding domain protein 2 (MBD2). The intratumoral expression of MBD2 predicted a favorable recurrence-free survival. circKCNN2 down-regulated the expression of fibroblast growth factor receptor 4 (FGFR4), which can be reversed by miR-520c-3p and knockdown of MBD2. Lenvatinib inhibited the expression of FGFR4 and upregulated the expression of circKCNN2 and MBD2. Ectopic expression of circKCNN2 in HCC cells enhanced the therapeutic effect of lenvatinib. However, the high inherent level of circKCNN2 in HCC cells was associated with lenvatinib resistance. CONCLUSIONS circKCNN2, transcriptionally repressed by NFYA, suppresses HCC recurrence via the miR-520c-3p/MBD2 axis. Inherent level of circKCNN2 in HCC cells predisposes anti-tumor effect of lenvatinib possibly because both circKCNN2 and lenvatinib repress the expression of FGFR4. circKCNN2 may be a promising predictive biomarker and therapeutic agent for HCC recurrence.
Collapse
Affiliation(s)
- Donghong Liu
- Key Laboratory of Molecular Biology for Infectious DiseasesMinistry of EducationChongqing Medical UniversityChongqingChina
- Institute for Viral HepatitisChongqing Medical UniversityChongqingChina
- Department of Infectious Diseasesthe Second Affiliated HospitalChongqing Medical UniversityChongqingChina
| | - Wenbin Liu
- Department of EpidemiologySecond Military Medical UniversityShanghaiChina
| | - Xi Chen
- Department of EpidemiologySecond Military Medical UniversityShanghaiChina
| | - Jianhua Yin
- Department of EpidemiologySecond Military Medical UniversityShanghaiChina
| | - Longteng Ma
- Department of EpidemiologySecond Military Medical UniversityShanghaiChina
| | - Mei Liu
- Department of EpidemiologySecond Military Medical UniversityShanghaiChina
| | - Xinyu Zhou
- Department of EpidemiologySecond Military Medical UniversityShanghaiChina
| | - Linfeng Xian
- Department of EpidemiologySecond Military Medical UniversityShanghaiChina
| | - Peng Li
- Department of EpidemiologySecond Military Medical UniversityShanghaiChina
| | - Xiaojie Tan
- Department of EpidemiologySecond Military Medical UniversityShanghaiChina
| | - Jun Zhao
- Department of Hepatic SurgeryEastern Hepatobiliary Surgery HospitalSecond Military Medical UniversityShanghaiChina
| | - Yong Liao
- Key Laboratory of Molecular Biology for Infectious DiseasesMinistry of EducationChongqing Medical UniversityChongqingChina
- Institute for Viral HepatitisChongqing Medical UniversityChongqingChina
- Department of Infectious Diseasesthe Second Affiliated HospitalChongqing Medical UniversityChongqingChina
| | - Guangwen Cao
- Department of EpidemiologySecond Military Medical UniversityShanghaiChina
| |
Collapse
|
13
|
Matthews BJ, Melia T, Waxman DJ. Harnessing natural variation to identify cis regulators of sex-biased gene expression in a multi-strain mouse liver model. PLoS Genet 2021; 17:e1009588. [PMID: 34752452 PMCID: PMC8664386 DOI: 10.1371/journal.pgen.1009588] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 12/10/2021] [Accepted: 10/27/2021] [Indexed: 12/13/2022] Open
Abstract
Sex differences in gene expression are widespread in the liver, where many autosomal factors act in tandem with growth hormone signaling to regulate individual variability of sex differences in liver metabolism and disease. Here, we compare hepatic transcriptomic and epigenetic profiles of mouse strains C57BL/6J and CAST/EiJ, representing two subspecies separated by 0.5-1 million years of evolution, to elucidate the actions of genetic factors regulating liver sex differences. We identify 144 protein coding genes and 78 lncRNAs showing strain-conserved sex bias; many have gene ontologies relevant to liver function, are more highly liver-specific and show greater sex bias, and are more proximally regulated than genes whose sex bias is strain-dependent. The strain-conserved genes include key growth hormone-dependent transcriptional regulators of liver sex bias; however, three other transcription factors, Trim24, Tox, and Zfp809, lose their sex-biased expression in CAST/EiJ mouse liver. To elucidate the observed strain specificities in expression, we characterized the strain-dependence of sex-biased chromatin opening and enhancer marks at cis regulatory elements (CREs) within expression quantitative trait loci (eQTL) regulating liver sex-biased genes. Strikingly, 208 of 286 eQTLs with strain-specific, sex-differential effects on expression were associated with a complete gain, loss, or reversal of the sex differences in expression between strains. Moreover, 166 of the 286 eQTLs were linked to the strain-dependent gain or loss of localized sex-biased CREs. Remarkably, a subset of these CREs apparently lacked strain-specific genetic variants yet showed coordinated, strain-dependent sex-biased epigenetic regulation. Thus, we directly link hundreds of strain-specific genetic variants to the high variability in CRE activity and expression of sex-biased genes and uncover underlying genetically-determined epigenetic states controlling liver sex bias in genetically diverse mouse populations.
Collapse
Affiliation(s)
- Bryan J. Matthews
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
| | - Tisha Melia
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
| | - David J. Waxman
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
| |
Collapse
|
14
|
Mason MRJ, Erp S, Wolzak K, Behrens A, Raivich G, Verhaagen J. The Jun-dependent axon regeneration gene program: Jun promotes regeneration over plasticity. Hum Mol Genet 2021; 31:1242-1262. [PMID: 34718572 PMCID: PMC9029231 DOI: 10.1093/hmg/ddab315] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 10/13/2021] [Accepted: 10/25/2021] [Indexed: 11/25/2022] Open
Abstract
The regeneration-associated gene (RAG) expression program is activated in injured peripheral neurons after axotomy and enables long-distance axon re-growth. Over 1000 genes are regulated, and many transcription factors are upregulated or activated as part of this response. However, a detailed picture of how RAG expression is regulated is lacking. In particular, the transcriptional targets and specific functions of the various transcription factors are unclear. Jun was the first-regeneration-associated transcription factor identified and the first shown to be functionally important. Here we fully define the role of Jun in the RAG expression program in regenerating facial motor neurons. At 1, 4 and 14 days after axotomy, Jun upregulates 11, 23 and 44% of the RAG program, respectively. Jun functions relevant to regeneration include cytoskeleton production, metabolic functions and cell activation, and the downregulation of neurotransmission machinery. In silico analysis of promoter regions of Jun targets identifies stronger over-representation of AP1-like sites than CRE-like sites, although CRE sites were also over-represented in regions flanking AP1 sites. Strikingly, in motor neurons lacking Jun, an alternative SRF-dependent gene expression program is initiated after axotomy. The promoters of these newly expressed genes exhibit over-representation of CRE sites in regions near to SRF target sites. This alternative gene expression program includes plasticity-associated transcription factors and leads to an aberrant early increase in synapse density on motor neurons. Jun thus has the important function in the early phase after axotomy of pushing the injured neuron away from a plasticity response and towards a regenerative phenotype.
Collapse
Affiliation(s)
- Matthew R J Mason
- Laboratory for Regeneration of Sensorimotor Systems, Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences (KNAW), Meibergdreef 47, 1105, BA, Amsterdam, The Netherlands
| | - Susan Erp
- Laboratory for Regeneration of Sensorimotor Systems, Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences (KNAW), Meibergdreef 47, 1105, BA, Amsterdam, The Netherlands
| | - Kim Wolzak
- Laboratory for Regeneration of Sensorimotor Systems, Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences (KNAW), Meibergdreef 47, 1105, BA, Amsterdam, The Netherlands
| | - Axel Behrens
- Adult Stem Cell Laboratory, The Francis Crick Institute, London, NW1 1AT, United Kingdom
| | - Gennadij Raivich
- UCL Institute for Women's Health, Maternal and Fetal Medicine, Perinatal Brain Repair Group, London, WC1E 6HX, United Kingdom
| | - Joost Verhaagen
- Laboratory for Regeneration of Sensorimotor Systems, Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences (KNAW), Meibergdreef 47, 1105, BA, Amsterdam, The Netherlands.,Center for Neurogenomics and Cognition Research, Neuroscience Campus Amsterdam, Vrije Universiteit Amsterdam, 1081HV, Amsterdam, The Netherlands
| |
Collapse
|
15
|
Sarnataro S, Riba A, Molina N. Regulation of transcription reactivation dynamics exiting mitosis. PLoS Comput Biol 2021; 17:e1009354. [PMID: 34606497 PMCID: PMC8516288 DOI: 10.1371/journal.pcbi.1009354] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 10/14/2021] [Accepted: 08/17/2021] [Indexed: 11/23/2022] Open
Abstract
Proliferating cells experience a global reduction of transcription during mitosis, yet their cell identity is maintained and regulatory information is propagated from mother to daughter cells. Mitotic bookmarking by transcription factors has been proposed as a potential mechanism to ensure the reactivation of transcription at the proper set of genes exiting mitosis. Recently, mitotic transcription and waves of transcription reactivation have been observed in synchronized populations of human hepatoma cells. However, the study did not consider that mitotic-arrested cell populations progressively desynchronize leading to measurements of gene expression on a mixture of cells at different internal cell-cycle times. Moreover, it is not well understood yet what is the precise role of mitotic bookmarking on mitotic transcription as well as on the transcription reactivation waves. Ultimately, the core gene regulatory network driving the precise transcription reactivation dynamics remains to be identified. To address these questions, we developed a mathematical model to correct for the progressive desynchronization of cells and estimate gene expression dynamics with respect to a cell-cycle pseudotime. Furthermore, we used a multiple linear regression model to infer transcription factor activity dynamics. Our analysis allows us to characterize waves of transcription factor activities exiting mitosis and predict a core gene regulatory network responsible of the transcription reactivation dynamics. Moreover, we identified more than 60 transcription factors that are highly active during mitosis and represent new candidates of mitotic bookmarking factors which could be relevant therapeutic targets to control cell proliferation. Specific gene expression patterns confer particular identities to cells. During proliferation, cells undergo mitosis when chromosomes are formed and segregated into two new cells leading to a global downregulation of gene expression. Yet, cell identity is propagated from mother to daughter cells by the reactivation of gene expression at the appropriate set of genes once mitosis is completed. Mitotic bookmarking has been proposed as a mechanism to regulate this process. Indeed certain regulatory factors tag genes during mitosis to promote gene reactivation in the next cycle. Here we analyze gene expression over time measured on synchronized cell populations by using a new generation sequencing technique. To do so, we proposed a mathematical model to obtain the exact gene expression dynamics with respect to the cell-cycle progression and identified waves of genes reactivation during mitosis and the transition to the next cycle. Also, we developed a computational method that allowed us to predict key regulatory factors that drive this process and predict new candidates that could be involved in mitotic bookmarking. These regulatory factors could be relevant therapeutic targets to control cell proliferation.
Collapse
Affiliation(s)
- Sergio Sarnataro
- Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC) Université de Strasbourg – CNRS – INSERM, Illkirch, France
| | - Andrea Riba
- Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC) Université de Strasbourg – CNRS – INSERM, Illkirch, France
| | - Nacho Molina
- Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC) Université de Strasbourg – CNRS – INSERM, Illkirch, France
- * E-mail:
| |
Collapse
|
16
|
Kotarba G, Taracha-Wisniewska A, Miller M, Dabrowski M, Wilanowski T. Transcription factors Krüppel-like factor 4 and paired box 5 regulate the expression of the Grainyhead-like genes. PLoS One 2021; 16:e0257977. [PMID: 34570823 PMCID: PMC8476022 DOI: 10.1371/journal.pone.0257977] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 09/14/2021] [Indexed: 12/15/2022] Open
Abstract
Genes from the Grainyhead-like (GRHL) family code for transcription factors necessary for the development and maintenance of various epithelia. These genes are also very important in the development of many types of cancer. However, little is known about the regulation of expression of GRHL genes. Previously, there were no systematic analyses of the promoters of GRHL genes or transcription factors that bind to these promoters. Here we report that the Krüppel-like factor 4 (KLF4) and the paired box 5 factor (PAX5) bind to the regulatory regions of the GRHL genes and regulate their expression. Ectopic expression of KLF4 or PAX5 alters the expression of GRHL genes. In KLF4-overexpressing HEK293 cells, the expression of GRHL1 and GRHL3 genes was upregulated by 32% and 60%, respectively, whereas the mRNA level of GRHL2 gene was lowered by 28% when compared to the respective controls. The levels of GRHL1 and GRHL3 expression were decreased by 30% or 33% in PAX5-overexpressing HEK293 cells. The presence of minor frequency allele of single nucleotide polymorphism rs115898376 in the promoter of the GRHL1 gene affected the binding of KLF4 to this site. The evidence presented here suggests an important role of KLF4 and PAX5 in the regulation of expression of GRHL1-3 genes.
Collapse
Affiliation(s)
- Grzegorz Kotarba
- Faculty of Biology, Institute of Genetics and Biotechnology, University of Warsaw, Warsaw, Poland
| | | | - Michal Miller
- Laboratory of Bioinformatics, Nencki Institute of Experimental Biology of Polish Academy of Sciences, Warsaw, Poland
| | - Michal Dabrowski
- Laboratory of Bioinformatics, Nencki Institute of Experimental Biology of Polish Academy of Sciences, Warsaw, Poland
| | - Tomasz Wilanowski
- Faculty of Biology, Institute of Genetics and Biotechnology, University of Warsaw, Warsaw, Poland
- * E-mail:
| |
Collapse
|
17
|
Singh G, Mullany S, Moorthy SD, Zhang R, Mehdi T, Tian R, Duncan AG, Moses AM, Mitchell JA. A flexible repertoire of transcription factor binding sites and a diversity threshold determines enhancer activity in embryonic stem cells. Genome Res 2021; 31:564-575. [PMID: 33712417 PMCID: PMC8015845 DOI: 10.1101/gr.272468.120] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 02/19/2021] [Indexed: 12/28/2022]
Abstract
Transcriptional enhancers are critical for development and phenotype evolution and are often mutated in disease contexts; however, even in well-studied cell types, the sequence code conferring enhancer activity remains unknown. To examine the enhancer regulatory code for pluripotent stem cells, we identified genomic regions with conserved binding of multiple transcription factors in mouse and human embryonic stem cells (ESCs). Examination of these regions revealed that they contain on average 12.6 conserved transcription factor binding site (TFBS) sequences. Enriched TFBSs are a diverse repertoire of 70 different sequences representing the binding sequences of both known and novel ESC regulators. Using a diverse set of TFBSs from this repertoire was sufficient to construct short synthetic enhancers with activity comparable to native enhancers. Site-directed mutagenesis of conserved TFBSs in endogenous enhancers or TFBS deletion from synthetic sequences revealed a requirement for 10 or more different TFBSs. Furthermore, specific TFBSs, including the POU5F1:SOX2 comotif, are dispensable, despite cobinding the POU5F1 (also known as OCT4), SOX2, and NANOG master regulators of pluripotency. These findings reveal that a TFBS sequence diversity threshold overrides the need for optimized regulatory grammar and individual TFBSs that recruit specific master regulators.
Collapse
Affiliation(s)
- Gurdeep Singh
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, M5S 3G5, Canada
| | - Shanelle Mullany
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, M5S 3G5, Canada
| | - Sakthi D Moorthy
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, M5S 3G5, Canada
| | - Richard Zhang
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, M5S 3G5, Canada
| | - Tahmid Mehdi
- Department of Computer Science, University of Toronto, Toronto, M5S 2E4, Canada
| | - Ruxiao Tian
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, M5S 3G5, Canada
| | - Andrew G Duncan
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, M5S 3G5, Canada
| | - Alan M Moses
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, M5S 3G5, Canada.,Department of Computer Science, University of Toronto, Toronto, M5S 2E4, Canada.,Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, M5S 3B3, Canada
| | - Jennifer A Mitchell
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, M5S 3G5, Canada
| |
Collapse
|
18
|
Abugessaisa I, Ramilowski JA, Lizio M, Severin J, Hasegawa A, Harshbarger J, Kondo A, Noguchi S, Yip CW, Ooi J, Tagami M, Hori F, Agrawal S, Hon C, Cardon M, Ikeda S, Ono H, Bono H, Kato M, Hashimoto K, Bonetti A, Kato M, Kobayashi N, Shin J, de Hoon M, Hayashizaki Y, Carninci P, Kawaji H, Kasukawa T. FANTOM enters 20th year: expansion of transcriptomic atlases and functional annotation of non-coding RNAs. Nucleic Acids Res 2021; 49:D892-D898. [PMID: 33211864 PMCID: PMC7779024 DOI: 10.1093/nar/gkaa1054] [Citation(s) in RCA: 66] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/16/2020] [Accepted: 10/21/2020] [Indexed: 11/15/2022] Open
Abstract
The Functional ANnoTation Of the Mammalian genome (FANTOM) Consortium has continued to provide extensive resources in the pursuit of understanding the transcriptome, and transcriptional regulation, of mammalian genomes for the last 20 years. To share these resources with the research community, the FANTOM web-interfaces and databases are being regularly updated, enhanced and expanded with new data types. In recent years, the FANTOM Consortium's efforts have been mainly focused on creating new non-coding RNA datasets and resources. The existing FANTOM5 human and mouse miRNA atlas was supplemented with rat, dog, and chicken datasets. The sixth (latest) edition of the FANTOM project was launched to assess the function of human long non-coding RNAs (lncRNAs). From its creation until 2020, FANTOM6 has contributed to the research community a large dataset generated from the knock-down of 285 lncRNAs in human dermal fibroblasts; this is followed with extensive expression profiling and cellular phenotyping. Other updates to the FANTOM resource includes the reprocessing of the miRNA and promoter atlases of human, mouse and chicken with the latest reference genome assemblies. To facilitate the use and accessibility of all above resources we further enhanced FANTOM data viewers and web interfaces. The updated FANTOM web resource is publicly available at https://fantom.gsc.riken.jp/.
Collapse
Affiliation(s)
- Imad Abugessaisa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Jordan A Ramilowski
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
- Advanced Medical Research Center, Yokohama City University, Kanagawa, Japan
| | - Marina Lizio
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Jesicca Severin
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Akira Hasegawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Jayson Harshbarger
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Atsushi Kondo
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Shuhei Noguchi
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Chi Wai Yip
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | | | - Michihira Tagami
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Fumi Hori
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Saumya Agrawal
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Chung Chau Hon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Melissa Cardon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Shuya Ikeda
- Database Center for Life Science, Research Organization of Information and Systems, Mishima, Shizuoka, Japan
| | - Hiromasa Ono
- Database Center for Life Science, Research Organization of Information and Systems, Mishima, Shizuoka, Japan
| | - Hidemasa Bono
- Database Center for Life Science, Research Organization of Information and Systems, Mishima, Shizuoka, Japan
- Program of Biomedical Science, Graduate School of Integrated Sciences for Life, Hiroshima University
| | - Masaki Kato
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Kosuke Hashimoto
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Alessandro Bonetti
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
- Karolinska Institutet, Stockholm, Sweden
| | - Masaki Kato
- RIKEN Head Office for Information Systems and Cybersecurity, Wako, Saitama, Japan
| | - Norio Kobayashi
- RIKEN Head Office for Information Systems and Cybersecurity, Wako, Saitama, Japan
| | - Jay Shin
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Michiel de Hoon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | | | - Piero Carninci
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Hideya Kawaji
- Correspondence may also be addressed to Hideya Kawaji.
| | - Takeya Kasukawa
- To whom correspondence should be addressed. Tel: +81 45 503 9222; Fax: +81 45 503 9219;
| |
Collapse
|
19
|
Weger M, Alpern D, Cherix A, Ghosal S, Grosse J, Russeil J, Gruetter R, de Kloet ER, Deplancke B, Sandi C. Mitochondrial gene signature in the prefrontal cortex for differential susceptibility to chronic stress. Sci Rep 2020; 10:18308. [PMID: 33110158 PMCID: PMC7591539 DOI: 10.1038/s41598-020-75326-9] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Accepted: 10/14/2020] [Indexed: 02/07/2023] Open
Abstract
Mitochondrial dysfunction was highlighted as a crucial vulnerability factor for the development of depression. However, systemic studies assessing stress-induced changes in mitochondria-associated genes in brain regions relevant to depression symptomatology remain scarce. Here, we performed a genome-wide transcriptomic study to examine mitochondrial gene expression in the prefrontal cortex (PFC) and nucleus accumbens (NAc) of mice exposed to multimodal chronic restraint stress. We identified mitochondria-associated gene pathways as most prominently affected in the PFC and with lesser significance in the NAc. A more detailed mitochondrial gene expression analysis revealed that in particular mitochondrial DNA-encoded subunits of the oxidative phosphorylation complexes were altered in the PFC. The comparison of our data with a reanalyzed transcriptome data set of chronic variable stress mice and major depression disorder subjects showed that the changes in mitochondrial DNA-encoded genes are a feature generalizing to other chronic stress-protocols as well and might have translational relevance. Finally, we provide evidence for changes in mitochondrial outputs in the PFC following chronic stress that are indicative of mitochondrial dysfunction. Collectively, our work reinforces the idea that changes in mitochondrial gene expression are key players in the prefrontal adaptations observed in individuals with high behavioral susceptibility and resilience to chronic stress.
Collapse
Affiliation(s)
- Meltem Weger
- Laboratory of Behavioral Genetics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland.,Institute for Molecular Bioscience, The University of Queensland, St. Lucia, QLD, 4072, Australia
| | - Daniel Alpern
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
| | - Antoine Cherix
- Laboratory for Functional and Metabolic Imaging, École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland.,Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, OX3 9DU, England, UK
| | - Sriparna Ghosal
- Laboratory of Behavioral Genetics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland
| | - Jocelyn Grosse
- Laboratory of Behavioral Genetics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland
| | - Julie Russeil
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland
| | - Rolf Gruetter
- Laboratory for Functional and Metabolic Imaging, École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland
| | - E Ronald de Kloet
- Departement of Endocrinology and Metabolic Disease, Leiden University Medical Center, Leiden, The Netherlands
| | - Bart Deplancke
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
| | - Carmen Sandi
- Laboratory of Behavioral Genetics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland.
| |
Collapse
|
20
|
Alam T, Agrawal S, Severin J, Young RS, Andersson R, Arner E, Hasegawa A, Lizio M, Ramilowski JA, Abugessaisa I, Ishizu Y, Noma S, Tarui H, Taylor MS, Lassmann T, Itoh M, Kasukawa T, Kawaji H, Marchionni L, Sheng G, R R Forrest A, Khachigian LM, Hayashizaki Y, Carninci P, de Hoon MJL. Comparative transcriptomics of primary cells in vertebrates. Genome Res 2020; 30:951-961. [PMID: 32718981 PMCID: PMC7397866 DOI: 10.1101/gr.255679.119] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Accepted: 06/09/2020] [Indexed: 12/18/2022]
Abstract
Gene expression profiles in homologous tissues have been observed to be different between species, which may be due to differences between species in the gene expression program in each cell type, but may also reflect differences in cell type composition of each tissue in different species. Here, we compare expression profiles in matching primary cells in human, mouse, rat, dog, and chicken using Cap Analysis Gene Expression (CAGE) and short RNA (sRNA) sequencing data from FANTOM5. While we find that expression profiles of orthologous genes in different species are highly correlated across cell types, in each cell type many genes were differentially expressed between species. Expression of genes with products involved in transcription, RNA processing, and transcriptional regulation was more likely to be conserved, while expression of genes encoding proteins involved in intercellular communication was more likely to have diverged during evolution. Conservation of expression correlated positively with the evolutionary age of genes, suggesting that divergence in expression levels of genes critical for cell function was restricted during evolution. Motif activity analysis showed that both promoters and enhancers are activated by the same transcription factors in different species. An analysis of expression levels of mature miRNAs and of primary miRNAs identified by CAGE revealed that evolutionary old miRNAs are more likely to have conserved expression patterns than young miRNAs. We conclude that key aspects of the regulatory network are conserved, while differential expression of genes involved in cell-to-cell communication may contribute greatly to phenotypic differences between species.
Collapse
Affiliation(s)
- Tanvir Alam
- College of Science and Engineering, Hamad Bin Khalifa University, Doha, Qatar
| | - Saumya Agrawal
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | - Jessica Severin
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | - Robert S Young
- Centre for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh EH8 9AG, United Kingdom.,MRC Human Genetics Unit, MRC Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XU, United Kingdom
| | - Robin Andersson
- The Bioinformatics Centre, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Erik Arner
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | - Akira Hasegawa
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | - Marina Lizio
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | | | - Imad Abugessaisa
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | - Yuri Ishizu
- RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama 230-0045, Japan
| | - Shohei Noma
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | - Hiroshi Tarui
- RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama 230-0045, Japan
| | - Martin S Taylor
- MRC Human Genetics Unit, MRC Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XU, United Kingdom
| | - Timo Lassmann
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan.,Telethon Kids Institute, University of Western Australia, Perth, WA 6009, Australia
| | - Masayoshi Itoh
- RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako 351-0198, Japan
| | - Takeya Kasukawa
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | - Hideya Kawaji
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan.,RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako 351-0198, Japan
| | - Luigi Marchionni
- Department of Oncology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21287, USA
| | - Guojun Sheng
- International Research Center for Medical Sciences (IRCMS), Kumamoto University, Kumamoto 860-0811, Japan
| | - Alistair R R Forrest
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan.,Harry Perkins Institute of Medical Research, and the Centre for Medical Research, University of Western Australia, QEII Medical Centre, Perth, WA 6009, Australia
| | - Levon M Khachigian
- Vascular Biology and Translational Research, School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, NSW 2052 Australia
| | | | - Piero Carninci
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | | |
Collapse
|
21
|
Discovery of Transcription Factors Novel to Mouse Cerebellar Granule Cell Development Through Laser-Capture Microdissection. THE CEREBELLUM 2019; 17:308-325. [PMID: 29307116 DOI: 10.1007/s12311-017-0912-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Laser-capture microdissection was used to isolate external germinal layer tissue from three developmental periods of mouse cerebellar development: embryonic days 13, 15, and 18. The cerebellar granule cell-enriched mRNA library was generated with next-generation sequencing using the Helicos technology. Our objective was to discover transcriptional regulators that could be important for the development of cerebellar granule cells-the most numerous neuron in the central nervous system. Through differential expression analysis, we have identified 82 differentially expressed transcription factors (TFs) from a total of 1311 differentially expressed genes. In addition, with TF-binding sequence analysis, we have identified 46 TF candidates that could be key regulators responsible for the variation in the granule cell transcriptome between developmental stages. Altogether, we identified 125 potential TFs (82 from differential expression analysis, 46 from motif analysis with 3 overlaps in the two sets). From this gene set, 37 TFs are considered novel due to the lack of previous knowledge about their roles in cerebellar development. The results from transcriptome-wide analyses were validated with existing online databases, qRT-PCR, and in situ hybridization. This study provides an initial insight into the TFs of cerebellar granule cells that might be important for development and provide valuable information for further functional studies on these transcriptional regulators.
Collapse
|
22
|
Berger S, Pachkov M, Arnold P, Omidi S, Kelley N, Salatino S, van Nimwegen E. Crunch: integrated processing and modeling of ChIP-seq data in terms of regulatory motifs. Genome Res 2019; 29:1164-1177. [PMID: 31138617 PMCID: PMC6633267 DOI: 10.1101/gr.239319.118] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Accepted: 05/14/2019] [Indexed: 01/10/2023]
Abstract
Although ChIP-seq has become a routine experimental approach for quantitatively characterizing the genome-wide binding of transcription factors (TFs), computational analysis procedures remain far from standardized, making it difficult to compare ChIP-seq results across experiments. In addition, although genome-wide binding patterns must ultimately be determined by local constellations of DNA-binding sites, current analysis is typically limited to identifying enriched motifs in ChIP-seq peaks. Here we present Crunch, a completely automated computational method that performs all ChIP-seq analysis from quality control through read mapping and peak detecting and that integrates comprehensive modeling of the ChIP signal in terms of known and novel binding motifs, quantifying the contribution of each motif and annotating which combinations of motifs explain each binding peak. By applying Crunch to 128 data sets from the ENCODE Project, we show that Crunch outperforms current peak finders and find that TFs naturally separate into "solitary TFs," for which a single motif explains the ChIP-peaks, and "cobinding TFs," for which multiple motifs co-occur within peaks. Moreover, for most data sets, the motifs that Crunch identified de novo outperform known motifs, and both the set of cobinding motifs and the top motif of solitary TFs are consistent across experiments and cell lines. Crunch is implemented as a web server, enabling standardized analysis of any collection of ChIP-seq data sets by simply uploading raw sequencing data. Results are provided both in a graphical web interface and as downloadable files.
Collapse
Affiliation(s)
- Severin Berger
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, CH-4056 Basel, Switzerland
| | - Mikhail Pachkov
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, CH-4056 Basel, Switzerland
| | - Phil Arnold
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, CH-4056 Basel, Switzerland
| | - Saeed Omidi
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, CH-4056 Basel, Switzerland
| | - Nicholas Kelley
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, CH-4056 Basel, Switzerland
| | - Silvia Salatino
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, CH-4056 Basel, Switzerland
| | - Erik van Nimwegen
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, CH-4056 Basel, Switzerland
| |
Collapse
|
23
|
Weger BD, Gobet C, Yeung J, Martin E, Jimenez S, Betrisey B, Foata F, Berger B, Balvay A, Foussier A, Charpagne A, Boizet-Bonhoure B, Chou CJ, Naef F, Gachon F. The Mouse Microbiome Is Required for Sex-Specific Diurnal Rhythms of Gene Expression and Metabolism. Cell Metab 2019; 29:362-382.e8. [PMID: 30344015 PMCID: PMC6370974 DOI: 10.1016/j.cmet.2018.09.023] [Citation(s) in RCA: 199] [Impact Index Per Article: 33.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Revised: 06/27/2018] [Accepted: 09/25/2018] [Indexed: 02/08/2023]
Abstract
The circadian clock and associated feeding rhythms have a profound impact on metabolism and the gut microbiome. To what extent microbiota reciprocally affect daily rhythms of physiology in the host remains elusive. Here, we analyzed transcriptome and metabolome profiles of male and female germ-free mice. While mRNA expression of circadian clock genes revealed subtle changes in liver, intestine, and white adipose tissue, germ-free mice showed considerably altered expression of genes associated with rhythmic physiology. Strikingly, the absence of the microbiome attenuated liver sexual dimorphism and sex-specific rhythmicity. The resulting feminization of male and masculinization of female germ-free animals is likely caused by altered sexual development and growth hormone secretion, associated with differential activation of xenobiotic receptors. This defines a novel mechanism by which the microbiome regulates host metabolism.
Collapse
Affiliation(s)
- Benjamin D Weger
- Department of Diabetes and Circadian Rhythms, Nestlé Institute of Health Sciences, 1015 Lausanne, Switzerland
| | - Cédric Gobet
- Department of Diabetes and Circadian Rhythms, Nestlé Institute of Health Sciences, 1015 Lausanne, Switzerland; Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Jake Yeung
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Eva Martin
- Department of Diabetes and Circadian Rhythms, Nestlé Institute of Health Sciences, 1015 Lausanne, Switzerland
| | - Sonia Jimenez
- Department of Diabetes and Circadian Rhythms, Nestlé Institute of Health Sciences, 1015 Lausanne, Switzerland
| | - Bertrand Betrisey
- Cellular Metabolism, Department of Cell Biology, Nestlé Institute of Health Sciences, Nestlé Research, 1015 Lausanne, Switzerland
| | - Francis Foata
- Host-Microbe Interaction, Department of Gastro-Intestinal Health, Nestlé Institute of Health Sciences, Nestlé Research, 1000 Lausanne, Switzerland
| | - Bernard Berger
- Host-Microbe Interaction, Department of Gastro-Intestinal Health, Nestlé Institute of Health Sciences, Nestlé Research, 1000 Lausanne, Switzerland
| | - Aurélie Balvay
- Micalis Institute, INRA, AgroParisTech, Université Paris-Saclay, 78350 Jouy-en-Josas, France
| | - Anne Foussier
- Micalis Institute, INRA, AgroParisTech, Université Paris-Saclay, 78350 Jouy-en-Josas, France
| | - Aline Charpagne
- Genomics, Department of Multi-Omics, Nestlé Institute of Health Sciences, Nestlé Research, 1015 Lausanne, Switzerland
| | - Brigitte Boizet-Bonhoure
- Institut de Génétique Humaine, CNRS-Université de Montpellier UMR9002, 34396 Montpellier, France
| | - Chieh Jason Chou
- Host-Microbe Interaction, Department of Gastro-Intestinal Health, Nestlé Institute of Health Sciences, Nestlé Research, 1000 Lausanne, Switzerland
| | - Felix Naef
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Frédéric Gachon
- Department of Diabetes and Circadian Rhythms, Nestlé Institute of Health Sciences, 1015 Lausanne, Switzerland; School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.
| |
Collapse
|
24
|
Nettling M, Treutler H, Cerquides J, Grosse I. Unrealistic phylogenetic trees may improve phylogenetic footprinting. Bioinformatics 2018; 33:1639-1646. [PMID: 28130227 PMCID: PMC5447242 DOI: 10.1093/bioinformatics/btx033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 01/19/2017] [Indexed: 01/10/2023] Open
Abstract
Motivation The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily. Results Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting. Availability and Implementation The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Martin Nettling
- Institute of Computer Science, Martin Luther University Halle-Wittenberg, Halle, Germany
| | - Hendrik Treutler
- Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry, Halle, Germany
| | - Jesus Cerquides
- Institut d'Investigació en Intel ligència Artificial, IIIA-CSIC, Campus UAB, Cerdanyola, Spain
| | - Ivo Grosse
- Institute of Computer Science, Martin Luther University Halle-Wittenberg, Halle, Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany
| |
Collapse
|
25
|
A New Algorithm for Identifying Cis-Regulatory Modules Based on Hidden Markov Model. BIOMED RESEARCH INTERNATIONAL 2018; 2017:6274513. [PMID: 28497059 PMCID: PMC5405574 DOI: 10.1155/2017/6274513] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/23/2016] [Revised: 03/06/2017] [Accepted: 03/23/2017] [Indexed: 11/24/2022]
Abstract
The discovery of cis-regulatory modules (CRMs) is the key to understanding mechanisms of transcription regulation. Since CRMs have specific regulatory structures that are the basis for the regulation of gene expression, how to model the regulatory structure of CRMs has a considerable impact on the performance of CRM identification. The paper proposes a CRM discovery algorithm called ComSPS. ComSPS builds a regulatory structure model of CRMs based on HMM by exploring the rules of CRM transcriptional grammar that governs the internal motif site arrangement of CRMs. We test ComSPS on three benchmark datasets and compare it with five existing methods. Experimental results show that ComSPS performs better than them.
Collapse
|
26
|
An integrated expression atlas of miRNAs and their promoters in human and mouse. Nat Biotechnol 2017; 35:872-878. [PMID: 28829439 DOI: 10.1038/nbt.3947] [Citation(s) in RCA: 383] [Impact Index Per Article: 47.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2016] [Accepted: 07/25/2017] [Indexed: 12/26/2022]
Abstract
MicroRNAs (miRNAs) are short non-coding RNAs with key roles in cellular regulation. As part of the fifth edition of the Functional Annotation of Mammalian Genome (FANTOM5) project, we created an integrated expression atlas of miRNAs and their promoters by deep-sequencing 492 short RNA (sRNA) libraries, with matching Cap Analysis Gene Expression (CAGE) data, from 396 human and 47 mouse RNA samples. Promoters were identified for 1,357 human and 804 mouse miRNAs and showed strong sequence conservation between species. We also found that primary and mature miRNA expression levels were correlated, allowing us to use the primary miRNA measurements as a proxy for mature miRNA levels in a total of 1,829 human and 1,029 mouse CAGE libraries. We thus provide a broad atlas of miRNA expression and promoters in primary mammalian cells, establishing a foundation for detailed analysis of miRNA expression patterns and transcriptional control regions.
Collapse
|
27
|
Sex combs reduced (Scr) regulatory region of Drosophila revisited. Mol Genet Genomics 2017; 292:773-787. [DOI: 10.1007/s00438-017-1309-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2016] [Accepted: 03/08/2017] [Indexed: 10/19/2022]
|
28
|
Omidi S, Zavolan M, Pachkov M, Breda J, Berger S, van Nimwegen E. Automated incorporation of pairwise dependency in transcription factor binding site prediction using dinucleotide weight tensors. PLoS Comput Biol 2017; 13:e1005176. [PMID: 28753602 PMCID: PMC5550003 DOI: 10.1371/journal.pcbi.1005176] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2016] [Revised: 08/09/2017] [Accepted: 06/02/2017] [Indexed: 11/17/2022] Open
Abstract
Gene regulatory networks are ultimately encoded by the sequence-specific binding of (TFs) to short DNA segments. Although it is customary to represent the binding specificity of a TF by a position-specific weight matrix (PSWM), which assumes each position within a site contributes independently to the overall binding affinity, evidence has been accumulating that there can be significant dependencies between positions. Unfortunately, methodological challenges have so far hindered the development of a practical and generally-accepted extension of the PSWM model. On the one hand, simple models that only consider dependencies between nearest-neighbor positions are easy to use in practice, but fail to account for the distal dependencies that are observed in the data. On the other hand, models that allow for arbitrary dependencies are prone to overfitting, requiring regularization schemes that are difficult to use in practice for non-experts. Here we present a new regulatory motif model, called dinucleotide weight tensor (DWT), that incorporates arbitrary pairwise dependencies between positions in binding sites, rigorously from first principles, and free from tunable parameters. We demonstrate the power of the method on a large set of ChIP-seq data-sets, showing that DWTs outperform both PSWMs and motif models that only incorporate nearest-neighbor dependencies. We also demonstrate that DWTs outperform two previously proposed methods. Finally, we show that DWTs inferred from ChIP-seq data also outperform PSWMs on HT-SELEX data for the same TF, suggesting that DWTs capture inherent biophysical properties of the interactions between the DNA binding domains of TFs and their binding sites. We make a suite of DWT tools available at dwt.unibas.ch, that allow users to automatically perform ‘motif finding’, i.e. the inference of DWT motifs from a set of sequences, binding site prediction with DWTs, and visualization of DWT ‘dilogo’ motifs. Gene regulatory networks are ultimately encoded in constellations of short binding sites in the DNA and RNA that are recognized by regulatory factors such as transcription factors (TFs). For several decades, computational analysis of regulatory networks has relied on a model of TF sequence-specificity, the position-specific weight-matrix (PSWM), that assumes different positions in a binding site contribute independently to the total binding energy of the TF. However, in recent years evidence has been accumulating that, at least for some TFs, this assumption does not hold. Here we present a new model for the sequence-specificity of TFs, the dinucleotide weight tensor (DWT), that takes arbitrary dependencies between positions in binding sites into account and show that it consistently outperforms PSWMs on high-throughput datasets on TF binding. Moreover, in contrast to previous approaches, DWTs are directly derived from first principles within a Bayesian framework, and contain no tunable parameters. This allows them to be easily applied in practice and we make a suite of tools available for computational analysis with DWTs.
Collapse
Affiliation(s)
- Saeed Omidi
- Biozentrum, University of Basel, Basel, Switzerland.,Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Mihaela Zavolan
- Biozentrum, University of Basel, Basel, Switzerland.,Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Mikhail Pachkov
- Biozentrum, University of Basel, Basel, Switzerland.,Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Jeremie Breda
- Biozentrum, University of Basel, Basel, Switzerland.,Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Severin Berger
- Biozentrum, University of Basel, Basel, Switzerland.,Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Erik van Nimwegen
- Biozentrum, University of Basel, Basel, Switzerland.,Swiss Institute of Bioinformatics, Basel, Switzerland
| |
Collapse
|
29
|
Chorev M, Joseph Bekker A, Goldberger J, Carmel L. Identification of introns harboring functional sequence elements through positional conservation. Sci Rep 2017. [PMID: 28646210 PMCID: PMC5482813 DOI: 10.1038/s41598-017-04476-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Many human introns carry out a function, in the sense that they are critical to maintain normal cellular activity. Their identification is fundamental to understanding cellular processes and disease. However, being noncoding elements, such functional introns are poorly predicted based on traditional approaches of sequence and structure conservation. Here, we generated a dataset of human functional introns that carry out different types of functions. We showed that functional introns share common characteristics, such as higher positional conservation along the coding sequence and reduced loss rates, regardless of their specific function. A unique property of the data is that if an intron is unknown to be functional, it still does not mean that it is indeed non-functional. We developed a probabilistic framework that explicitly accounts for this unique property, and predicts which specific human introns are functional. We show that we successfully predict function even when the algorithm is trained on introns with a different type of function. This ability has many implications in studying regulatory networks, gene regulation, the effect of mutations outside exons on human disease, and on our general understanding of intron evolution and their functional exaptation in mammals.
Collapse
Affiliation(s)
- Michal Chorev
- Department of Genetics, The Alexander Silberman Institute of Life Sciences, Faculty of Science, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat Ram, Jerusalem, 91904, Israel.,The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Jerusalem, 91904, Israel
| | | | - Jacob Goldberger
- Faculty of Engineering, Bar-Ilan University, Ramat Gan, 52900, Israel
| | - Liran Carmel
- Department of Genetics, The Alexander Silberman Institute of Life Sciences, Faculty of Science, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat Ram, Jerusalem, 91904, Israel.
| |
Collapse
|
30
|
Nettling M, Treutler H, Cerquides J, Grosse I. Combining phylogenetic footprinting with motif models incorporating intra-motif dependencies. BMC Bioinformatics 2017; 18:141. [PMID: 28249564 PMCID: PMC5333389 DOI: 10.1186/s12859-017-1495-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2016] [Accepted: 01/24/2017] [Indexed: 11/23/2022] Open
Abstract
Background Transcriptional gene regulation is a fundamental process in nature, and the experimental and computational investigation of DNA binding motifs and their binding sites is a prerequisite for elucidating this process. Approaches for de-novo motif discovery can be subdivided in phylogenetic footprinting that takes into account phylogenetic dependencies in aligned sequences of more than one species and non-phylogenetic approaches based on sequences from only one species that typically take into account intra-motif dependencies. It has been shown that modeling (i) phylogenetic dependencies as well as (ii) intra-motif dependencies separately improves de-novo motif discovery, but there is no approach capable of modeling both (i) and (ii) simultaneously. Results Here, we present an approach for de-novo motif discovery that combines phylogenetic footprinting with motif models capable of taking into account intra-motif dependencies. We study the degree of intra-motif dependencies inferred by this approach from ChIP-seq data of 35 transcription factors. We find that significant intra-motif dependencies of orders 1 and 2 are present in all 35 datasets and that intra-motif dependencies of order 2 are typically stronger than those of order 1. We also find that the presented approach improves the classification performance of phylogenetic footprinting in all 35 datasets and that incorporating intra-motif dependencies of order 2 yields a higher classification performance than incorporating such dependencies of only order 1. Conclusion Combining phylogenetic footprinting with motif models incorporating intra-motif dependencies leads to an improved performance in the classification of transcription factor binding sites. This may advance our understanding of transcriptional gene regulation and its evolution. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1495-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Martin Nettling
- Institute of Computer Science, Martin Luther University Halle-Wittenberg, Halle, Germany.
| | | | - Jesus Cerquides
- Institut d'Investigació en Intel ·ligència Artificial, IIIA-CSIC, Campus UAB, Cerdanyola, Spain
| | - Ivo Grosse
- Institute of Computer Science, Martin Luther University Halle-Wittenberg, Halle, Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany
| |
Collapse
|
31
|
Paracrine cross-talk between skeletal muscle and macrophages in exercise by PGC-1α-controlled BNP. Sci Rep 2017; 7:40789. [PMID: 28091624 PMCID: PMC5238507 DOI: 10.1038/srep40789] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2016] [Accepted: 12/09/2016] [Indexed: 12/11/2022] Open
Abstract
Activation of resident and infiltrating immune cells is a central event in training adaptation and other contexts of skeletal muscle repair and regeneration. A precise orchestration of inflammatory events in muscle fibers and immune cells is required after recurrent contraction-relaxation cycles. However, the mechanistic aspects of this important regulation remain largely unknown. We now demonstrate that besides a dominant role in controlling cellular metabolism, the peroxisome proliferator-activated receptor γ co-activator 1α (PGC-1α) also has a profound effect on cytokine expression in muscle tissue. Muscle PGC-1α expression results in activation of tissue-resident macrophages, at least in part mediated by PGC-1α-dependent B-type natriuretic peptide (BNP) production and secretion. Positive effects of exercise in metabolic diseases and other pathologies associated with chronic inflammation could accordingly involve the PGC-1α-BNP axis and thereby provide novel targets for therapeutic approaches.
Collapse
|
32
|
Guo H, Huo H, Yu Q. SMCis: An Effective Algorithm for Discovery of Cis-Regulatory Modules. PLoS One 2016; 11:e0162968. [PMID: 27637070 PMCID: PMC5026350 DOI: 10.1371/journal.pone.0162968] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2016] [Accepted: 08/31/2016] [Indexed: 12/02/2022] Open
Abstract
The discovery of cis-regulatory modules (CRMs) is a challenging problem in computational biology. Limited by the difficulty of using an HMM to model dependent features in transcriptional regulatory sequences (TRSs), the probabilistic modeling methods based on HMMs cannot accurately represent the distance between regulatory elements in TRSs and are cumbersome to model the prevailing dependencies between motifs within CRMs. We propose a probabilistic modeling algorithm called SMCis, which builds a more powerful CRM discovery model based on a hidden semi-Markov model. Our model characterizes the regulatory structure of CRMs and effectively models dependencies between motifs at a higher level of abstraction based on segments rather than nucleotides. Experimental results on three benchmark datasets indicate that our method performs better than the compared algorithms.
Collapse
Affiliation(s)
- Haitao Guo
- School of Computer Science and Technology, Xidian University, Xi’an, Shaanxi, China
| | - Hongwei Huo
- School of Computer Science and Technology, Xidian University, Xi’an, Shaanxi, China
- * E-mail:
| | - Qiang Yu
- School of Computer Science and Technology, Xidian University, Xi’an, Shaanxi, China
| |
Collapse
|
33
|
Salatino S, Kupr B, Baresic M, Omidi S, van Nimwegen E, Handschin C. The Genomic Context and Corecruitment of SP1 Affect ERRα Coactivation by PGC-1α in Muscle Cells. Mol Endocrinol 2016; 30:809-25. [PMID: 27182621 PMCID: PMC4970653 DOI: 10.1210/me.2016-1036] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2016] [Accepted: 05/05/2016] [Indexed: 01/22/2023] Open
Abstract
The peroxisome proliferator-activated receptor-γ coactivator 1α (PGC-1α) coordinates the transcriptional network response to promote an improved endurance capacity in skeletal muscle, eg, by coactivating the estrogen-related receptor-α (ERRα) in the regulation of oxidative substrate metabolism. Despite a close functional relationship, the interaction between these 2 proteins has not been studied on a genomic level. We now mapped the genome-wide binding of ERRα to DNA in a skeletal muscle cell line with elevated PGC-1α and linked the DNA recruitment to global PGC-1α target gene regulation. We found that, surprisingly, ERRα coactivation by PGC-1α is only observed in the minority of all PGC-1α recruitment sites. Nevertheless, a majority of PGC-1α target gene expression is dependent on ERRα. Intriguingly, the interaction between these 2 proteins is controlled by the genomic context of response elements, in particular the relative GC and CpG content, monomeric and dimeric repeat-binding site configuration for ERRα, and adjacent recruitment of the transcription factor specificity protein 1. These findings thus not only reveal a novel insight into the regulatory network underlying muscle cell plasticity but also strongly link the genomic context of DNA-response elements to control transcription factor-coregulator interactions.
Collapse
Affiliation(s)
- Silvia Salatino
- Focal Area Growth and Development (S.S., B.K., M.B., C.H.) and Focal Area Computational and Systems Biology (S.S., E.N.), Biozentrum, University of Basel, and Swiss Institute of Bioinformatics (S.S., E.N.), CH-4056 Basel, Switzerland
| | - Barbara Kupr
- Focal Area Growth and Development (S.S., B.K., M.B., C.H.) and Focal Area Computational and Systems Biology (S.S., E.N.), Biozentrum, University of Basel, and Swiss Institute of Bioinformatics (S.S., E.N.), CH-4056 Basel, Switzerland
| | - Mario Baresic
- Focal Area Growth and Development (S.S., B.K., M.B., C.H.) and Focal Area Computational and Systems Biology (S.S., E.N.), Biozentrum, University of Basel, and Swiss Institute of Bioinformatics (S.S., E.N.), CH-4056 Basel, Switzerland
| | | | - Erik van Nimwegen
- Focal Area Growth and Development (S.S., B.K., M.B., C.H.) and Focal Area Computational and Systems Biology (S.S., E.N.), Biozentrum, University of Basel, and Swiss Institute of Bioinformatics (S.S., E.N.), CH-4056 Basel, Switzerland
| | - Christoph Handschin
- Focal Area Growth and Development (S.S., B.K., M.B., C.H.) and Focal Area Computational and Systems Biology (S.S., E.N.), Biozentrum, University of Basel, and Swiss Institute of Bioinformatics (S.S., E.N.), CH-4056 Basel, Switzerland
| |
Collapse
|
34
|
Alcalá-Corona SA, Velázquez-Caldelas TE, Espinal-Enríquez J, Hernández-Lemus E. Community Structure Reveals Biologically Functional Modules in MEF2C Transcriptional Regulatory Network. Front Physiol 2016; 7:184. [PMID: 27252657 PMCID: PMC4878384 DOI: 10.3389/fphys.2016.00184] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2016] [Accepted: 05/06/2016] [Indexed: 01/04/2023] Open
Abstract
Gene regulatory networks are useful to understand the activity behind the complex mechanisms in transcriptional regulation. A main goal in contemporary biology is using such networks to understand the systemic regulation of gene expression. In this work, we carried out a systematic study of a transcriptional regulatory network derived from a comprehensive selection of all potential transcription factor interactions downstream from MEF2C, a human transcription factor master regulator. By analyzing the connectivity structure of such network, we were able to find different biologically functional processes and specific biochemical pathways statistically enriched in communities of genes into the network, such processes are related to cell signaling, cell cycle and metabolism. In this way we further support the hypothesis that structural properties of biological networks encode an important part of their functional behavior in eukaryotic cells.
Collapse
Affiliation(s)
- Sergio A Alcalá-Corona
- Computational Genomics Department, National Institute of Genomic MedicineMexico City, Mexico; Complexity in Systems Biology, Center for Complexity Sciences, Universidad Nacional Autónoma de MéxicoMexico City, Mexico
| | | | - Jesús Espinal-Enríquez
- Computational Genomics Department, National Institute of Genomic MedicineMexico City, Mexico; Complexity in Systems Biology, Center for Complexity Sciences, Universidad Nacional Autónoma de MéxicoMexico City, Mexico
| | - Enrique Hernández-Lemus
- Computational Genomics Department, National Institute of Genomic MedicineMexico City, Mexico; Complexity in Systems Biology, Center for Complexity Sciences, Universidad Nacional Autónoma de MéxicoMexico City, Mexico
| |
Collapse
|
35
|
Nettling M, Treutler H, Cerquides J, Grosse I. Detecting and correcting the binding-affinity bias in ChIP-seq data using inter-species information. BMC Genomics 2016; 17:347. [PMID: 27165633 PMCID: PMC4862171 DOI: 10.1186/s12864-016-2682-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 04/28/2016] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Transcriptional gene regulation is a fundamental process in nature, and the experimental and computational investigation of DNA binding motifs and their binding sites is a prerequisite for elucidating this process. ChIP-seq has become the major technology to uncover genomic regions containing those binding sites, but motifs predicted by traditional computational approaches using these data are distorted by a ubiquitous binding-affinity bias. Here, we present an approach for detecting and correcting this bias using inter-species information. RESULTS We find that the binding-affinity bias caused by the ChIP-seq experiment in the reference species is stronger than the indirect binding-affinity bias in orthologous regions from phylogenetically related species. We use this difference to develop a phylogenetic footprinting model that is capable of detecting and correcting the binding-affinity bias. We find that this model improves motif prediction and that the corrected motifs are typically softer than those predicted by traditional approaches. CONCLUSIONS These findings indicate that motifs published in databases and in the literature are artificially sharpened compared to the native motifs. These findings also indicate that our current understanding of transcriptional gene regulation might be blurred, but that it is possible to advance this understanding by taking into account inter-species information available today and even more in the future.
Collapse
Affiliation(s)
- Martin Nettling
- Institute of Computer Science, Martin Luther University, Halle (Saale), Germany.
| | | | | | - Ivo Grosse
- Institute of Computer Science, Martin Luther University, Halle (Saale), Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany
| |
Collapse
|
36
|
Svensson K, Albert V, Cardel B, Salatino S, Handschin C. Skeletal muscle PGC-1α modulates systemic ketone body homeostasis and ameliorates diabetic hyperketonemia in mice. FASEB J 2016; 30:1976-86. [PMID: 26849960 PMCID: PMC4970654 DOI: 10.1096/fj.201500128] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Accepted: 01/26/2016] [Indexed: 11/11/2022]
Abstract
Ketone bodies (KBs) are crucial energy substrates during states of low carbohydrate availability. However, an aberrant regulation of KB homeostasis can lead to complications such as diabetic ketoacidosis. Exercise and diabetes affect systemic KB homeostasis, but the regulation of KB metabolism is still enigmatic. In our study in mice with either knockout or overexpression of the peroxisome proliferator-activated receptor-γ coactivator (PGC)-1α in skeletal muscle, PGC-1α regulated ketolytic gene transcription in muscle. Furthermore, KB homeostasis of these mice was investigated during withholding of food, exercise, and ketogenic diet feeding, and after streptozotocin injection. In response to these ketogenic stimuli, modulation of PGC-1α levels in muscle affected systemic KB homeostasis. Moreover, the data demonstrate that skeletal muscle PGC-1α is necessary for the enhanced ketolytic capacity in response to exercise training and overexpression of PGC-1α in muscle enhances systemic ketolytic capacity and is sufficient to ameliorate diabetic hyperketonemia in mice. In cultured myotubes, the transcription factor estrogen-related receptor-α was a partner of PGC-1α in the regulation of ketolytic gene transcription. These results demonstrate a central role of skeletal muscle PGC-1α in the transcriptional regulation of systemic ketolytic capacity.-Svensson, K., Albert, V., Cardel, B., Salatino, S., Handschin, C. Skeletal muscle PGC-1α modulates systemic ketone body homeostasis and ameliorates diabetic hyperketonemia in mice.
Collapse
|
37
|
SIB Swiss Institute of Bioinformatics Members. The SIB Swiss Institute of Bioinformatics' resources: focus on curated databases. Nucleic Acids Res 2015; 44:D27-37. [PMID: 26615188 PMCID: PMC4702916 DOI: 10.1093/nar/gkv1310] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2015] [Accepted: 11/09/2015] [Indexed: 12/15/2022] Open
Abstract
The SIB Swiss Institute of Bioinformatics (www.isb-sib.ch) provides world-class bioinformatics databases, software tools, services and training to the international life science community in academia and industry. These solutions allow life scientists to turn the exponentially growing amount of data into knowledge. Here, we provide an overview of SIB's resources and competence areas, with a strong focus on curated databases and SIB's most popular and widely used resources. In particular, SIB's Bioinformatics resource portal ExPASy features over 150 resources, including UniProtKB/Swiss-Prot, ENZYME, PROSITE, neXtProt, STRING, UniCarbKB, SugarBindDB, SwissRegulon, EPD, arrayMap, Bgee, SWISS-MODEL Repository, OMA, OrthoDB and other databases, which are briefly described in this article.
Collapse
|
38
|
de Hoon M, Shin JW, Carninci P. Paradigm shifts in genomics through the FANTOM projects. Mamm Genome 2015; 26:391-402. [PMID: 26253466 PMCID: PMC4602071 DOI: 10.1007/s00335-015-9593-8] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2015] [Accepted: 07/08/2015] [Indexed: 12/18/2022]
Abstract
Big leaps in science happen when scientists from different backgrounds interact. In the past 15 years, the FANTOM Consortium has brought together scientists from different fields to analyze and interpret genomic data produced with novel technologies, including mouse full-length cDNAs and, more recently, expression profiling at single-nucleotide resolution by cap-analysis gene expression. The FANTOM Consortium has provided the most comprehensive mouse cDNA collection for functional studies and extensive maps of the human and mouse transcriptome comprising promoters, enhancers, as well as the network of their regulatory interactions. More importantly, serendipitous observations of the FANTOM dataset led us to realize that the mammalian genome is pervasively transcribed, even from retrotransposon elements, which were previously considered junk DNA. The majority of products from the mammalian genome are long non-coding RNAs (lncRNAs), including sense-antisense, intergenic, and enhancer RNAs. While the biological function has been elucidated for some lncRNAs, more than 98 % of them remain without a known function. We argue that large-scale studies are urgently needed to address the functional role of lncRNAs.
Collapse
Affiliation(s)
- Michiel de Hoon
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, 230-0045, Japan.
| | - Jay W Shin
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, 230-0045, Japan.
| | - Piero Carninci
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, 230-0045, Japan.
| |
Collapse
|
39
|
Pemberton-Ross PJ, Pachkov M, van Nimwegen E. ARMADA: Using motif activity dynamics to infer gene regulatory networks from gene expression data. Methods 2015; 85:62-74. [PMID: 26164700 DOI: 10.1016/j.ymeth.2015.06.024] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Revised: 06/22/2015] [Accepted: 06/23/2015] [Indexed: 11/19/2022] Open
Abstract
Analysis of gene expression data remains one of the most promising avenues toward reconstructing genome-wide gene regulatory networks. However, the large dimensionality of the problem prohibits the fitting of explicit dynamical models of gene regulatory networks, whereas machine learning methods for dimensionality reduction such as clustering or principal component analysis typically fail to provide mechanistic interpretations of the reduced descriptions. To address this, we recently developed a general methodology called motif activity response analysis (MARA) that, by modeling gene expression patterns in terms of the activities of concrete regulators, accomplishes dramatic dimensionality reduction while retaining mechanistic biological interpretations of its predictions (Balwierz, 2014). Here we extend MARA by presenting ARMADA, which models the activity dynamics of regulators across a time course, and infers the causal interactions between the regulators that drive the dynamics of their activities across time. We have implemented ARMADA as part of our ISMARA webserver, ismara.unibas.ch, allowing any researcher to automatically apply it to any gene expression time course. To illustrate the method, we apply ARMADA to a time course of human umbilical vein endothelial cells treated with TNF. Remarkably, ARMADA is able to reproduce the complex observed motif activity dynamics using a relatively small set of interactions between the key regulators in this system. In addition, we show that ARMADA successfully infers many of the key regulatory interactions known to drive this inflammatory response and discuss several novel interactions that ARMADA predicts. In combination with ISMARA, ARMADA provides a powerful approach to generating plausible hypotheses for the key interactions between regulators that control gene expression in any system for which time course measurements are available.
Collapse
Affiliation(s)
- Peter J Pemberton-Ross
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, Basel, Switzerland.
| | - Mikhail Pachkov
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, Basel, Switzerland.
| | - Erik van Nimwegen
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, Basel, Switzerland.
| |
Collapse
|
40
|
Oldenburg J, Watzka M, Bevans CG. VKORC1 and VKORC1L1: Why do Vertebrates Have Two Vitamin K 2,3-Epoxide Reductases? Nutrients 2015; 7:6250-80. [PMID: 26264021 PMCID: PMC4555119 DOI: 10.3390/nu7085280] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2015] [Revised: 07/08/2015] [Accepted: 07/15/2015] [Indexed: 01/01/2023] Open
Abstract
Among all cellular life on earth, with the exception of yeasts, fungi, and some prokaryotes, VKOR family homologs are ubiquitously encoded in nuclear genomes, suggesting ancient and important biological roles for these enzymes. Despite single gene and whole genome duplications on the largest evolutionary timescales, and the fact that most gene duplications eventually result in loss of one copy, it is surprising that all jawed vertebrates (gnathostomes) have retained two paralogous VKOR genes. Both VKOR paralogs function as entry points for nutritionally acquired and recycled K vitamers in the vitamin K cycle. Here we present phylogenetic evidence that the human paralogs likely arose earlier than gnathostomes, possibly in the ancestor of crown chordates. We ask why gnathostomes have maintained these paralogs throughout evolution and present a current summary of what we know. In particular, we look to published studies about tissue- and developmental stage-specific expression, enzymatic function, phylogeny, biological roles and associated pathways that together suggest subfunctionalization as a major influence in evolutionary fixation of both paralogs. Additionally, we investigate on what evolutionary timescale the paralogs arose and under what circumstances in order to gain insight into the biological raison d’être for both VKOR paralogs in gnathostomes.
Collapse
Affiliation(s)
- Johannes Oldenburg
- Institute of Experimental Haematology and Transfusion Medicine, University Clinic Bonn, Bonn 53105, Germany.
| | - Matthias Watzka
- Institute of Experimental Haematology and Transfusion Medicine, University Clinic Bonn, Bonn 53105, Germany.
| | | |
Collapse
|
41
|
Martinez-Ledesma E, Verhaak RGW, Treviño V. Identification of a multi-cancer gene expression biomarker for cancer clinical outcomes using a network-based algorithm. Sci Rep 2015. [PMID: 26202601 PMCID: PMC5378879 DOI: 10.1038/srep11966] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Cancer types are commonly classified by histopathology and more recently through molecular characteristics such as gene expression, mutations, copy number variations, and epigenetic alterations. These molecular characterizations have led to the proposal of prognostic biomarkers for many cancer types. Nevertheless, most of these biomarkers have been proposed for a specific cancer type or even specific subtypes. Although more challenging, it is useful to identify biomarkers that can be applied for multiple types of cancer. Here, we have used a network-based exploration approach to identify a multi-cancer gene expression biomarker highly connected by ESR1, PRKACA, LRP1, JUN and SMAD2 that can be predictive of clinical outcome in 12 types of cancer from The Cancer Genome Atlas (TCGA) repository. The gene signature of this biomarker is highly supported by cancer literature, biological terms, and prognostic power in other cancer types. Additionally, the signature does not seem to be highly associated with specific mutations or copy number alterations. Comparisons with cancer-type specific and other multi-cancer biomarkers in TCGA and other datasets showed that the performance of the proposed multi-cancer biomarker is superior, making the proposed approach and multi-cancer biomarker potentially useful in research and clinical settings.
Collapse
Affiliation(s)
- Emmanuel Martinez-Ledesma
- 1] Grupo de Enfoque e Investigación en Bioinformática, Departamento de Investigación e Innovación, Escuela Nacional de Medicina, Tecnológico de Monterrey, Monterrey, Nuevo León 64849, México [2] Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Roeland G W Verhaak
- 1] Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA [2] Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Victor Treviño
- Grupo de Enfoque e Investigación en Bioinformática, Departamento de Investigación e Innovación, Escuela Nacional de Medicina, Tecnológico de Monterrey, Monterrey, Nuevo León 64849, México
| |
Collapse
|
42
|
Hernández Lemus E, Baca López K, Lemus R, García Herrera R. The role of master regulators in gene regulatory networks. PAPERS IN PHYSICS 2015. [DOI: 10.4279/pip.070011] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
Gene regulatory networks present a wide variety of dynamical responses to intrinsic and extrinsic perturbations. Arguably, one of the most important of such coordinated responses is the one of amplification cascades, in which activation of a few key-responsive transcription factors (termed master regulators, MRs) lead to a large series of transcriptional activation events. This is so since master regulators are transcription factors controlling the expression of other transcription factor molecules and so on. MRs hold a central position related to transcriptional dynamics and control of gene regulatory networks and are often involved in complex feedback and feedforward loops inducing non-trivial dynamics. Recent studies have pointed out to the myocyte enhancing factor 2C (MEF2C, also known as MADS box transcription enhancer factor 2, polypeptide C) as being one of such master regulators involved in the pathogenesis of primary breast cancer. In this work, we perform an integrative genomic analysis of the transcriptional regulation activity of MEF2C and its target genes to evaluate to what extent are these molecules inducing collective responses leading to gene expression deregulation and carcinogenesis. We also analyzed a number of induced dynamic responses, in particular those associated with transcriptional bursts, and nonlinear cascading to evaluate the influence they may have in malignant phenotypes and cancer.Received: 20 Novembre 2014, Accepted: 24 June 2015; Edited by: C. A. Condat, G. J. Sibona; DOI: http://dx.doi.org/10.4279/PIP.070011Cite as: E Hernández-Lemus, K Baca-López, R Lemus, R García-Herrera, Papers in Physics 7, 070011 (2015)This paper, by E Hernández-Lemus, K Baca-López, R Lemus, R García-Herrera, is licensed under the Creative Commons Attribution License 3.0.
Collapse
|
43
|
Arner E, Daub CO, Vitting-Seerup K, Andersson R, Lilje B, Drabløs F, Lennartsson A, Rönnerblad M, Hrydziuszko O, Vitezic M, Freeman TC, Alhendi AMN, Arner P, Axton R, Baillie JK, Beckhouse A, Bodega B, Briggs J, Brombacher F, Davis M, Detmar M, Ehrlund A, Endoh M, Eslami A, Fagiolini M, Fairbairn L, Faulkner GJ, Ferrai C, Fisher ME, Forrester L, Goldowitz D, Guler R, Ha T, Hara M, Herlyn M, Ikawa T, Kai C, Kawamoto H, Khachigian LM, Klinken SP, Kojima S, Koseki H, Klein S, Mejhert N, Miyaguchi K, Mizuno Y, Morimoto M, Morris KJ, Mummery C, Nakachi Y, Ogishima S, Okada-Hatakeyama M, Okazaki Y, Orlando V, Ovchinnikov D, Passier R, Patrikakis M, Pombo A, Qin XY, Roy S, Sato H, Savvi S, Saxena A, Schwegmann A, Sugiyama D, Swoboda R, Tanaka H, Tomoiu A, Winteringham LN, Wolvetang E, Yanagi-Mizuochi C, Yoneda M, Zabierowski S, Zhang P, Abugessaisa I, Bertin N, Diehl AD, Fukuda S, Furuno M, Harshbarger J, Hasegawa A, Hori F, Ishikawa-Kato S, Ishizu Y, Itoh M, Kawashima T, Kojima M, Kondo N, Lizio M, Meehan TF, Mungall CJ, Murata M, Nishiyori-Sueki H, Sahin S, Nagao-Sato S, Severin J, de Hoon MJL, Kawai J, Kasukawa T, Lassmann T, et alArner E, Daub CO, Vitting-Seerup K, Andersson R, Lilje B, Drabløs F, Lennartsson A, Rönnerblad M, Hrydziuszko O, Vitezic M, Freeman TC, Alhendi AMN, Arner P, Axton R, Baillie JK, Beckhouse A, Bodega B, Briggs J, Brombacher F, Davis M, Detmar M, Ehrlund A, Endoh M, Eslami A, Fagiolini M, Fairbairn L, Faulkner GJ, Ferrai C, Fisher ME, Forrester L, Goldowitz D, Guler R, Ha T, Hara M, Herlyn M, Ikawa T, Kai C, Kawamoto H, Khachigian LM, Klinken SP, Kojima S, Koseki H, Klein S, Mejhert N, Miyaguchi K, Mizuno Y, Morimoto M, Morris KJ, Mummery C, Nakachi Y, Ogishima S, Okada-Hatakeyama M, Okazaki Y, Orlando V, Ovchinnikov D, Passier R, Patrikakis M, Pombo A, Qin XY, Roy S, Sato H, Savvi S, Saxena A, Schwegmann A, Sugiyama D, Swoboda R, Tanaka H, Tomoiu A, Winteringham LN, Wolvetang E, Yanagi-Mizuochi C, Yoneda M, Zabierowski S, Zhang P, Abugessaisa I, Bertin N, Diehl AD, Fukuda S, Furuno M, Harshbarger J, Hasegawa A, Hori F, Ishikawa-Kato S, Ishizu Y, Itoh M, Kawashima T, Kojima M, Kondo N, Lizio M, Meehan TF, Mungall CJ, Murata M, Nishiyori-Sueki H, Sahin S, Nagao-Sato S, Severin J, de Hoon MJL, Kawai J, Kasukawa T, Lassmann T, Suzuki H, Kawaji H, Summers KM, Wells C, FANTOM Consortium, Hume DA, Forrest ARR, Sandelin A, Carninci P, Hayashizaki Y. Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells. Science 2015; 347:1010-4. [PMID: 25678556 PMCID: PMC4681433 DOI: 10.1126/science.1259418] [Show More Authors] [Citation(s) in RCA: 438] [Impact Index Per Article: 43.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Although it is generally accepted that cellular differentiation requires changes to transcriptional networks, dynamic regulation of promoters and enhancers at specific sets of genes has not been previously studied en masse. Exploiting the fact that active promoters and enhancers are transcribed, we simultaneously measured their activity in 19 human and 14 mouse time courses covering a wide range of cell types and biological stimuli. Enhancer RNAs, then messenger RNAs encoding transcription factors, dominated the earliest responses. Binding sites for key lineage transcription factors were simultaneously overrepresented in enhancers and promoters active in each cellular system. Our data support a highly generalizable model in which enhancer transcription is the earliest event in successive waves of transcriptional change during cellular differentiation or activation.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - David A. Hume
- Corresponding author. (D.A.H.); (A.R.R.F.); (A.S.); (P.C.); (Y.H.)
| | | | - Albin Sandelin
- Corresponding author. (D.A.H.); (A.R.R.F.); (A.S.); (P.C.); (Y.H.)
| | - Piero Carninci
- Corresponding author. (D.A.H.); (A.R.R.F.); (A.S.); (P.C.); (Y.H.)
| | | |
Collapse
|
44
|
Vitezic M, Bertin N, Andersson R, Lipovich L, Kawaji H, Lassmann T, Sandelin A, Heutink P, Goldowitz D, Ha T, Zhang P, Patrizi A, Fagiolini M, Forrest ARR, Carninci P, Saxena A. CAGE-defined promoter regions of the genes implicated in Rett Syndrome. BMC Genomics 2014; 15:1177. [PMID: 25539566 PMCID: PMC4522966 DOI: 10.1186/1471-2164-15-1177] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2013] [Accepted: 12/04/2014] [Indexed: 11/10/2022] Open
Abstract
Background Mutations in three functionally diverse genes cause Rett Syndrome. Although the functions of Forkhead box G1 (FOXG1), Methyl CpG binding protein 2 (MECP2) and Cyclin-dependent kinase-like 5 (CDKL5) have been studied individually, not much is known about their relation to each other with respect to expression levels and regulatory regions. Here we analyzed data from hundreds of mouse and human samples included in the FANTOM5 project, to identify transcript initiation sites, expression levels, expression correlations and regulatory regions of the three genes. Results Our investigations reveal the predominantly used transcription start sites (TSSs) for each gene including novel transcription start sites for FOXG1. We show that FOXG1 expression is poorly correlated with the expression of MECP2 and CDKL5. We identify promoter shapes for each TSS, the predicted location of enhancers for each gene and the common transcription factors likely to regulate the three genes. Our data imply Polycomb Repressive Complex 2 (PRC2) mediated silencing of Foxg1 in cerebellum. Conclusions Our analyses provide a comprehensive picture of the regulatory regions of the three genes involved in Rett Syndrome. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1177) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Morana Vitezic
- Omics Science Center, RIKEN Yokohama Institute, Omics Science Center (OSC), 1-17-22 Suehiro cho, Tsurumi ku, Yokohama, Japan. .,Department of Cell and Molecular Biology (CMB), Karolinska Institutet, Stockholm, Sweden. .,The Bioinformatics Center, Department of Biology and Biotech Research and Innovation Center, University of Copenhagen, Copenhagen, Denmark.
| | - Nicolas Bertin
- Omics Science Center, RIKEN Yokohama Institute, Omics Science Center (OSC), 1-17-22 Suehiro cho, Tsurumi ku, Yokohama, Japan. .,Division of Genomic Technologies (DGT), RIKEN Center for Life Science Technologies, Yokohama, Japan. .,Cancer Science Institute of Singapore, National University of Singapore, Singapore, Singapore.
| | - Robin Andersson
- The Bioinformatics Center, Department of Biology and Biotech Research and Innovation Center, University of Copenhagen, Copenhagen, Denmark.
| | - Leonard Lipovich
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, USA. .,Department of Neurology, School of Medicine, Wayne State University, Detroit, MI, USA.
| | - Hideya Kawaji
- Omics Science Center, RIKEN Yokohama Institute, Omics Science Center (OSC), 1-17-22 Suehiro cho, Tsurumi ku, Yokohama, Japan. .,Division of Genomic Technologies (DGT), RIKEN Center for Life Science Technologies, Yokohama, Japan. .,RIKEN Preventive Medicine and Diagnosis Innovation Program (PMI), Wako, Japan.
| | - Timo Lassmann
- Omics Science Center, RIKEN Yokohama Institute, Omics Science Center (OSC), 1-17-22 Suehiro cho, Tsurumi ku, Yokohama, Japan. .,Division of Genomic Technologies (DGT), RIKEN Center for Life Science Technologies, Yokohama, Japan. .,Telethon Kids Institute, The University of Western Australia, Perth, Australia.
| | - Albin Sandelin
- The Bioinformatics Center, Department of Biology and Biotech Research and Innovation Center, University of Copenhagen, Copenhagen, Denmark.
| | - Peter Heutink
- Division of Genomic Technologies (DGT), RIKEN Center for Life Science Technologies, Yokohama, Japan. .,German Center for Neurodegenerative Diseases (DZNE), Tübingen, Germany. .,Eberhard Karls University, Tübingen, Germany.
| | - Dan Goldowitz
- Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, Dept of Medical Genetics, University of British Columbia, Vancouver, Canada.
| | - Thomas Ha
- Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, Dept of Medical Genetics, University of British Columbia, Vancouver, Canada.
| | - Peter Zhang
- Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, Dept of Medical Genetics, University of British Columbia, Vancouver, Canada.
| | - Annarita Patrizi
- FM Kirby Neurobiology Center, Department of Neurology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
| | - Michela Fagiolini
- FM Kirby Neurobiology Center, Department of Neurology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
| | - Alistair R R Forrest
- Omics Science Center, RIKEN Yokohama Institute, Omics Science Center (OSC), 1-17-22 Suehiro cho, Tsurumi ku, Yokohama, Japan. .,Division of Genomic Technologies (DGT), RIKEN Center for Life Science Technologies, Yokohama, Japan.
| | - Piero Carninci
- Omics Science Center, RIKEN Yokohama Institute, Omics Science Center (OSC), 1-17-22 Suehiro cho, Tsurumi ku, Yokohama, Japan. .,Division of Genomic Technologies (DGT), RIKEN Center for Life Science Technologies, Yokohama, Japan.
| | - Alka Saxena
- Omics Science Center, RIKEN Yokohama Institute, Omics Science Center (OSC), 1-17-22 Suehiro cho, Tsurumi ku, Yokohama, Japan. .,Biomedical Research Centre at Guy's and St Thomas' Trust, Genomics Core Facility, Guy's Hospital, London, UK.
| | | |
Collapse
|
45
|
Gruber AR, Martin G, Müller P, Schmidt A, Gruber AJ, Gumienny R, Mittal N, Jayachandran R, Pieters J, Keller W, van Nimwegen E, Zavolan M. Global 3′ UTR shortening has a limited effect on protein abundance in proliferating T cells. Nat Commun 2014; 5:5465. [DOI: 10.1038/ncomms6465] [Citation(s) in RCA: 143] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2014] [Accepted: 10/03/2014] [Indexed: 12/12/2022] Open
|
46
|
Baresic M, Salatino S, Kupr B, van Nimwegen E, Handschin C. Transcriptional network analysis in muscle reveals AP-1 as a partner of PGC-1α in the regulation of the hypoxic gene program. Mol Cell Biol 2014; 34:2996-3012. [PMID: 24912679 PMCID: PMC4135604 DOI: 10.1128/mcb.01710-13] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2013] [Revised: 01/26/2014] [Accepted: 06/03/2014] [Indexed: 12/16/2022] Open
Abstract
Skeletal muscle tissue shows an extraordinary cellular plasticity, but the underlying molecular mechanisms are still poorly understood. Here, we use a combination of experimental and computational approaches to unravel the complex transcriptional network of muscle cell plasticity centered on the peroxisome proliferator-activated receptor γ coactivator 1α (PGC-1α), a regulatory nexus in endurance training adaptation. By integrating data on genome-wide binding of PGC-1α and gene expression upon PGC-1α overexpression with comprehensive computational prediction of transcription factor binding sites (TFBSs), we uncover a hitherto-underestimated number of transcription factor partners involved in mediating PGC-1α action. In particular, principal component analysis of TFBSs at PGC-1α binding regions predicts that, besides the well-known role of the estrogen-related receptor α (ERRα), the activator protein 1 complex (AP-1) plays a major role in regulating the PGC-1α-controlled gene program of the hypoxia response. Our findings thus reveal the complex transcriptional network of muscle cell plasticity controlled by PGC-1α.
Collapse
Affiliation(s)
- Mario Baresic
- Focal Area Growth and Development, Biozentrum, University of Basel, Basel, Switzerland
| | - Silvia Salatino
- Focal Area Growth and Development, Biozentrum, University of Basel, Basel, Switzerland Focal Area Computational and Systems Biology, Biozentrum, University of Basel, Basel, Switzerland Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Barbara Kupr
- Focal Area Growth and Development, Biozentrum, University of Basel, Basel, Switzerland
| | - Erik van Nimwegen
- Focal Area Computational and Systems Biology, Biozentrum, University of Basel, Basel, Switzerland Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Christoph Handschin
- Focal Area Growth and Development, Biozentrum, University of Basel, Basel, Switzerland
| |
Collapse
|
47
|
Davis MR, Andersson R, Severin J, de Hoon M, Bertin N, Baillie JK, Kawaji H, Sandelin A, Forrest ARR, Summers KM. Transcriptional profiling of the human fibrillin/LTBP gene family, key regulators of mesenchymal cell functions. Mol Genet Metab 2014; 112:73-83. [PMID: 24703491 PMCID: PMC4019825 DOI: 10.1016/j.ymgme.2013.12.006] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/29/2013] [Revised: 12/06/2013] [Accepted: 12/06/2013] [Indexed: 01/23/2023]
Abstract
The fibrillins and latent transforming growth factor binding proteins (LTBPs) form a superfamily of extracellular matrix (ECM) proteins characterized by the presence of a unique domain, the 8-cysteine transforming growth factor beta (TGFβ) binding domain. These proteins are involved in the structure of the extracellular matrix and controlling the bioavailability of TGFβ family members. Genes encoding these proteins show differential expression in mesenchymal cell types which synthesize the extracellular matrix. We have investigated the promoter regions of the seven gene family members using the FANTOM5 CAGE database for human. While the protein and nucleotide sequences show considerable sequence similarity, the promoter regions were quite diverse. Most genes had a single predominant transcription start site region but LTBP1 and LTBP4 had two regions initiating different transcripts. Most of the family members were expressed in a range of mesenchymal and other cell types, often associated with use of alternative promoters or transcription start sites within a promoter in different cell types. FBN3 was the lowest expressed gene, and was found only in embryonic and fetal tissues. The different promoters for one gene were more similar to each other in expression than to promoters of the other family members. Notably expression of all 22 LTBP2 promoters was tightly correlated and quite distinct from all other family members. We located candidate enhancer regions likely to be involved in expression of the genes. Each gene was associated with a unique subset of transcription factors across multiple promoters although several motifs including MAZ, SP1, GTF2I and KLF4 showed overrepresentation across the gene family. FBN1 and FBN2, which had similar expression patterns, were regulated by different transcription factors. This study highlights the role of alternative transcription start sites in regulating the tissue specificity of closely related genes and suggests that this important class of extracellular matrix proteins is subject to subtle regulatory variations that explain the differential roles of members of this gene family.
Collapse
Affiliation(s)
- Margaret R Davis
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush EH25 9RG, UK.
| | - Robin Andersson
- The Bioinformatics Centre, Department of Biology and Biotech Research and Innovation Centre, University of Copenhagen, Ole Maaloes Vej 5, 2200 Copenhagen N, Denmark.
| | - Jessica Severin
- RIKEN Omics Science Center, Yokohama, Kanagawa 230-0045, Japan(1); RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Kanagawa 230-0045, Japan.
| | - Michiel de Hoon
- RIKEN Omics Science Center, Yokohama, Kanagawa 230-0045, Japan(1); RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Kanagawa 230-0045, Japan.
| | - Nicolas Bertin
- RIKEN Omics Science Center, Yokohama, Kanagawa 230-0045, Japan(1); RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Kanagawa 230-0045, Japan.
| | - J Kenneth Baillie
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush EH25 9RG, UK.
| | - Hideya Kawaji
- RIKEN Omics Science Center, Yokohama, Kanagawa 230-0045, Japan(1); RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Kanagawa 230-0045, Japan; RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako, Saitama 351-0198, Japan.
| | - Albin Sandelin
- The Bioinformatics Centre, Department of Biology and Biotech Research and Innovation Centre, University of Copenhagen, Ole Maaloes Vej 5, 2200 Copenhagen N, Denmark.
| | - Alistair R R Forrest
- RIKEN Omics Science Center, Yokohama, Kanagawa 230-0045, Japan(1); RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Kanagawa 230-0045, Japan.
| | - Kim M Summers
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush EH25 9RG, UK; The University of Queensland Northside Clinical School, Prince Charles Hospital, Chermside 4032, Australia.
| |
Collapse
|
48
|
Levinson M, Zhou Q. A penalized Bayesian approach to predicting sparse protein-DNA binding landscapes. ACTA ACUST UNITED AC 2014; 30:636-43. [PMID: 24115169 DOI: 10.1093/bioinformatics/btt585] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Cellular processes are controlled, directly or indirectly, by the binding of hundreds of different DNA binding factors (DBFs) to the genome. One key to deeper understanding of the cell is discovering where, when and how strongly these DBFs bind to the DNA sequence. Direct measurement of DBF binding sites (BSs; e.g. through ChIP-Chip or ChIP-Seq experiments) is expensive, noisy and not available for every DBF in every cell type. Naive and most existing computational approaches to detecting which DBFs bind in a set of genomic regions of interest often perform poorly, due to the high false discovery rates and restrictive requirements for prior knowledge. RESULTS We develop SparScape, a penalized Bayesian method for identifying DBFs active in the considered regions and predicting a joint probabilistic binding landscape. Using a sparsity-inducing penalization, SparScape is able to select a small subset of DBFs with enriched BSs in a set of DNA sequences from a much larger candidate set. This substantially reduces the false positives in prediction of BSs. Analysis of ChIP-Seq data in mouse embryonic stem cells and simulated data show that SparScape dramatically outperforms the naive motif scanning method and the comparable computational approaches in terms of DBF identification and BS prediction. AVAILABILITY AND IMPLEMENTATION SparScape is implemented in C++ with OpenMP (optional at compilation) and is freely available at 'www.stat.ucla.edu/∼zhou/Software.html' for academic use.
Collapse
Affiliation(s)
- Matthew Levinson
- Department of Statistics, University of California, Los Angeles, CA 90095, USA
| | | |
Collapse
|
49
|
Balwierz PJ, Pachkov M, Arnold P, Gruber AJ, Zavolan M, van Nimwegen E. ISMARA: automated modeling of genomic signals as a democracy of regulatory motifs. Genome Res 2014; 24:869-84. [PMID: 24515121 PMCID: PMC4009616 DOI: 10.1101/gr.169508.113] [Citation(s) in RCA: 228] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Accurate reconstruction of the regulatory networks that control gene expression is one of the key current challenges in molecular biology. Although gene expression and chromatin state dynamics are ultimately encoded by constellations of binding sites recognized by regulators such as transcriptions factors (TFs) and microRNAs (miRNAs), our understanding of this regulatory code and its context-dependent read-out remains very limited. Given that there are thousands of potential regulators in mammals, it is not practical to use direct experimentation to identify which of these play a key role for a particular system of interest. We developed a methodology that models gene expression or chromatin modifications in terms of genome-wide predictions of regulatory sites and completely automated it into a web-based tool called ISMARA (Integrated System for Motif Activity Response Analysis). Given only gene expression or chromatin state data across a set of samples as input, ISMARA identifies the key TFs and miRNAs driving expression/chromatin changes and makes detailed predictions regarding their regulatory roles. These include predicted activities of the regulators across the samples, their genome-wide targets, enriched gene categories among the targets, and direct interactions between the regulators. Applying ISMARA to data sets from well-studied systems, we show that it consistently identifies known key regulators ab initio. We also present a number of novel predictions including regulatory interactions in innate immunity, a master regulator of mucociliary differentiation, TFs consistently disregulated in cancer, and TFs that mediate specific chromatin modifications.
Collapse
Affiliation(s)
- Piotr J Balwierz
- Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, CH-4056 Basel, Switzerland
| | | | | | | | | | | |
Collapse
|
50
|
Luisier R, Unterberger EB, Goodman JI, Schwarz M, Moggs J, Terranova R, van Nimwegen E. Computational modeling identifies key gene regulatory interactions underlying phenobarbital-mediated tumor promotion. Nucleic Acids Res 2014; 42:4180-95. [PMID: 24464994 PMCID: PMC3985636 DOI: 10.1093/nar/gkt1415] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Gene regulatory interactions underlying the early stages of non-genotoxic carcinogenesis are poorly understood. Here, we have identified key candidate regulators of phenobarbital (PB)-mediated mouse liver tumorigenesis, a well-characterized model of non-genotoxic carcinogenesis, by applying a new computational modeling approach to a comprehensive collection of in vivo gene expression studies. We have combined our previously developed motif activity response analysis (MARA), which models gene expression patterns in terms of computationally predicted transcription factor binding sites with singular value decomposition (SVD) of the inferred motif activities, to disentangle the roles that different transcriptional regulators play in specific biological pathways of tumor promotion. Furthermore, transgenic mouse models enabled us to identify which of these regulatory activities was downstream of constitutive androstane receptor and β-catenin signaling, both crucial components of PB-mediated liver tumorigenesis. We propose novel roles for E2F and ZFP161 in PB-mediated hepatocyte proliferation and suggest that PB-mediated suppression of ESR1 activity contributes to the development of a tumor-prone environment. Our study shows that combining MARA with SVD allows for automated identification of independent transcription regulatory programs within a complex in vivo tissue environment and provides novel mechanistic insights into PB-mediated hepatocarcinogenesis.
Collapse
Affiliation(s)
- Raphaëlle Luisier
- Discovery and Investigative Safety, Novartis Institutes for Biomedical Research, 4057 Basel, Switzerland, Department of Toxicology, Institute of Experimental and Clinical Pharmacology and Toxicology, University of Tübingen, 72074 Tübingen, Germany, Department of Pharmacology and Toxicology, Michigan State University, MI 48824, USA and Biozentrum, University of Basel and Swiss Institute of Bioinformatics, 4056 Basel, Switzerland
| | | | | | | | | | | | | |
Collapse
|