1
|
Foroozandeh Shahraki M, Farahbod M, Libbrecht MW. Robust chromatin state annotation. Genome Res 2024; 34:469-483. [PMID: 38514204 PMCID: PMC11067878 DOI: 10.1101/gr.278343.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 03/19/2024] [Indexed: 03/23/2024]
Abstract
With the goal of mapping genomic activity, international projects have recently measured epigenetic activity in hundreds of cell and tissue types. Chromatin state annotations produced by segmentation and genome annotation (SAGA) methods have emerged as the predominant way to summarize these epigenomic data sets in order to annotate the genome. These chromatin state annotations are essential for many genomic tasks, including identifying active regulatory elements and interpreting disease-associated genetic variation. However, despite the widespread applications of SAGA methods, no principled approach exists to evaluate the statistical significance of chromatin state assignments. Here, we propose the first method for assigning calibrated confidence scores to chromatin state annotations. Toward this goal, we performed a comprehensive evaluation of the reproducibility of the two most widely used existing SAGA methods, ChromHMM and Segway. We found that their predictions are frequently irreproducible. For example, when applying the same SAGA method on two sets of experimental replicates, 27%-69% of predicted enhancers fail to replicate. This suggests that a substantial fraction of predicted elements in existing chromatin state annotations cannot be relied upon. To remedy this problem, we introduce SAGAconf, a method for assigning a measure of confidence (r-value) to chromatin state annotations. SAGAconf works with any SAGA method and assigns an r-value to each genomic bin of a chromatin state annotation that represents the probability that the label of this bin will be reproduced in a replicated experiment. Thus, SAGAconf allows a researcher to select only the reliable predictions from a chromatin annotation for use in downstream analyses.
Collapse
Affiliation(s)
| | - Marjan Farahbod
- School of Computing Science, Simon Fraser University, Burnaby, British Columbia V51 1S6, Canada
| | - Maxwell W Libbrecht
- School of Computing Science, Simon Fraser University, Burnaby, British Columbia V51 1S6, Canada
| |
Collapse
|
2
|
Xiang G, He X, Giardine BM, Isaac KJ, Taylor DJ, McCoy RC, Jansen C, Keller CA, Wixom AQ, Cockburn A, Miller A, Qi Q, He Y, Li Y, Lichtenberg J, Heuston EF, Anderson SM, Luan J, Vermunt MW, Yue F, Sauria ME, Schatz MC, Taylor J, Göttgens B, Hughes JR, Higgs DR, Weiss MJ, Cheng Y, Blobel GA, Bodine DM, Zhang Y, Li Q, Mahony S, Hardison RC. Interspecies regulatory landscapes and elements revealed by novel joint systematic integration of human and mouse blood cell epigenomes. bioRxiv 2024:2023.04.02.535219. [PMID: 37066352 PMCID: PMC10103973 DOI: 10.1101/2023.04.02.535219] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Knowledge of locations and activities of cis-regulatory elements (CREs) is needed to decipher basic mechanisms of gene regulation and to understand the impact of genetic variants on complex traits. Previous studies identified candidate CREs (cCREs) using epigenetic features in one species, making comparisons difficult between species. In contrast, we conducted an interspecies study defining epigenetic states and identifying cCREs in blood cell types to generate regulatory maps that are comparable between species, using integrative modeling of eight epigenetic features jointly in human and mouse in our Validated Systematic Integration (VISION) Project. The resulting catalogs of cCREs are useful resources for further studies of gene regulation in blood cells, indicated by high overlap with known functional elements and strong enrichment for human genetic variants associated with blood cell phenotypes. The contribution of each epigenetic state in cCREs to gene regulation, inferred from a multivariate regression, was used to estimate epigenetic state Regulatory Potential (esRP) scores for each cCRE in each cell type, which were used to categorize dynamic changes in cCREs. Groups of cCREs displaying similar patterns of regulatory activity in human and mouse cell types, obtained by joint clustering on esRP scores, harbored distinctive transcription factor binding motifs that were similar between species. An interspecies comparison of cCREs revealed both conserved and species-specific patterns of epigenetic evolution. Finally, we showed that comparisons of the epigenetic landscape between species can reveal elements with similar roles in regulation, even in the absence of genomic sequence alignment.
Collapse
Affiliation(s)
- Guanjue Xiang
- Bioinformatics and Genomics Graduate Program, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA 02215
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02215
| | - Xi He
- Bioinformatics and Genomics Graduate Program, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802
| | - Belinda M. Giardine
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802
| | - Kathryn J. Isaac
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218
| | - Dylan J. Taylor
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218
| | - Rajiv C. McCoy
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218
| | - Camden Jansen
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802
| | - Cheryl A. Keller
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802
| | - Alexander Q. Wixom
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802
| | - April Cockburn
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802
| | - Amber Miller
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802
| | - Qian Qi
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN 38105
| | - Yanghua He
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN 38105
- Department of Human Nutrition, Food and Animal Sciences, University of Hawai`i at Mānoa, Honolulu, HI 96822, USA
| | - Yichao Li
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN 38105
| | - Jens Lichtenberg
- Genetics and Molecular Biology Branch, National Human Genome Research Institute, Bethesda, MD 20892
| | - Elisabeth F. Heuston
- Genetics and Molecular Biology Branch, National Human Genome Research Institute, Bethesda, MD 20892
| | - Stacie M. Anderson
- Flow Cytometry Core, National Human Genome Research Institute, Bethesda, MD 20892
| | - Jing Luan
- Department of Pediatrics, Children’s Hospital of Philadelphia, and Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
| | - Marit W. Vermunt
- Department of Pediatrics, Children’s Hospital of Philadelphia, and Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
| | - Feng Yue
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Evanston, IL 60611
| | - Michael E.G. Sauria
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218
| | - Michael C. Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218
| | - James Taylor
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218
| | - Berthold Göttgens
- Welcome and MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
| | - Jim R. Hughes
- MRC Weatherall Institute of Molecular Medicine, Oxford University, Oxford, UK
| | - Douglas R. Higgs
- MRC Weatherall Institute of Molecular Medicine, Oxford University, Oxford, UK
| | - Mitchell J. Weiss
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN 38105
| | - Yong Cheng
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN 38105
| | - Gerd A. Blobel
- Department of Pediatrics, Children’s Hospital of Philadelphia, and Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
| | - David M. Bodine
- Genetics and Molecular Biology Branch, National Human Genome Research Institute, Bethesda, MD 20892
| | - Yu Zhang
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802
| | - Qunhua Li
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802
- Center for Computational Biology and Bioinformatics, Genome Sciences Institute, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802
| | - Shaun Mahony
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802
- Center for Computational Biology and Bioinformatics, Genome Sciences Institute, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802
- Center for Eukaryotic Gene Regulation, The Pennsylvania State University, University Park, PA 16802
| | - Ross C. Hardison
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802
- Center for Computational Biology and Bioinformatics, Genome Sciences Institute, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802
- Center for Eukaryotic Gene Regulation, The Pennsylvania State University, University Park, PA 16802
| |
Collapse
|
3
|
Sabarís G, Ortíz DM, Laiker I, Mayansky I, Naik S, Cavalli G, Stern DL, Preger-Ben Noon E, Frankel N. The Density of Regulatory Information Is a Major Determinant of Evolutionary Constraint on Noncoding DNA in Drosophila. Mol Biol Evol 2024; 41:msae004. [PMID: 38364113 PMCID: PMC10871701 DOI: 10.1093/molbev/msae004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 11/26/2023] [Accepted: 01/05/2024] [Indexed: 02/18/2024] Open
Abstract
Evolutionary analyses have estimated that ∼60% of nucleotides in intergenic regions of the Drosophila melanogaster genome are functionally relevant, suggesting that regulatory information may be encoded more densely in intergenic regions than has been revealed by most functional dissections of regulatory DNA. Here, we approached this issue through a functional dissection of the regulatory region of the gene shavenbaby (svb). Most of the ∼90 kb of this large regulatory region is highly conserved in the genus Drosophila, though characterized enhancers occupy a small fraction of this region. By analyzing the regulation of svb in different contexts of Drosophila development, we found that the regulatory information that drives svb expression in the abdominal pupal epidermis is organized in a different way than the elements that drive svb expression in the embryonic epidermis. While in the embryonic epidermis svb is activated by compact enhancers separated by large inactive DNA regions, svb expression in the pupal epidermis is driven by regulatory information distributed over broader regions of svb cis-regulatory DNA. In the same vein, we observed that other developmental genes also display a dense distribution of putative regulatory elements in their regulatory regions. Furthermore, we found that a large percentage of conserved noncoding DNA of the Drosophila genome is contained within regions of open chromatin. These results suggest that part of the evolutionary constraint on noncoding DNA of Drosophila is explained by the density of regulatory information, which may be greater than previously appreciated.
Collapse
Affiliation(s)
- Gonzalo Sabarís
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
- Institute of Human Genetics, UMR 9002 CNRS-Université de Montpellier, Montpellier, France
| | - Daniela M Ortíz
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
| | - Ian Laiker
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
| | - Ignacio Mayansky
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
| | - Sujay Naik
- Department of Genetics and Developmental Biology, The Rappaport Faculty of Medicine and Research Institute, Technion—Israel Institute of Technology, Haifa 3109601, Israel
| | - Giacomo Cavalli
- Institute of Human Genetics, UMR 9002 CNRS-Université de Montpellier, Montpellier, France
| | - David L Stern
- Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA 20147, USA
| | - Ella Preger-Ben Noon
- Department of Genetics and Developmental Biology, The Rappaport Faculty of Medicine and Research Institute, Technion—Israel Institute of Technology, Haifa 3109601, Israel
| | - Nicolás Frankel
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
- Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales (FCEN), Universidad de Buenos Aires (UBA), Buenos Aires 1428, Argentina
| |
Collapse
|
4
|
Xiang G, Guo Y, Bumcrot D, Sigova A. JMnorm: a novel joint multi-feature normalization method for integrative and comparative epigenomics. Nucleic Acids Res 2024; 52:e11. [PMID: 38055833 PMCID: PMC10810286 DOI: 10.1093/nar/gkad1146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 10/25/2023] [Accepted: 11/14/2023] [Indexed: 12/08/2023] Open
Abstract
Combinatorial patterns of epigenetic features reflect transcriptional states and functions of genomic regions. While many epigenetic features have correlated relationships, most existing data normalization approaches analyze each feature independently. Such strategies may distort relationships between functionally correlated epigenetic features and hinder biological interpretation. We present a novel approach named JMnorm that simultaneously normalizes multiple epigenetic features across cell types, species, and experimental conditions by leveraging information from partially correlated epigenetic features. We demonstrate that JMnorm-normalized data can better preserve cross-epigenetic-feature correlations across different cell types and enhance consistency between biological replicates than data normalized by other methods. Additionally, we show that JMnorm-normalized data can consistently improve the performance of various downstream analyses, which include candidate cis-regulatory element clustering, cross-cell-type gene expression prediction, detection of transcription factor binding and changes upon perturbations. These findings suggest that JMnorm effectively minimizes technical noise while preserving true biologically significant relationships between epigenetic datasets. We anticipate that JMnorm will enhance integrative and comparative epigenomics.
Collapse
Affiliation(s)
- Guanjue Xiang
- CAMP4 Therapeutics Corp., One Kendall Square, Building 1400 West, Cambridge, MA 02139, USA
| | - Yuchun Guo
- CAMP4 Therapeutics Corp., One Kendall Square, Building 1400 West, Cambridge, MA 02139, USA
| | - David Bumcrot
- CAMP4 Therapeutics Corp., One Kendall Square, Building 1400 West, Cambridge, MA 02139, USA
| | - Alla Sigova
- CAMP4 Therapeutics Corp., One Kendall Square, Building 1400 West, Cambridge, MA 02139, USA
| |
Collapse
|
5
|
Tam PLF, Cheung MF, Chan LY, Leung D. Cell-type differential targeting of SETDB1 prevents aberrant CTCF binding, chromatin looping, and cis-regulatory interactions. Nat Commun 2024; 15:15. [PMID: 38167730 PMCID: PMC10762014 DOI: 10.1038/s41467-023-44578-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 12/19/2023] [Indexed: 01/05/2024] Open
Abstract
SETDB1 is an essential histone methyltransferase that deposits histone H3 lysine 9 trimethylation (H3K9me3) to transcriptionally repress genes and repetitive elements. The function of differential H3K9me3 enrichment between cell-types remains unclear. Here, we demonstrate mutual exclusivity of H3K9me3 and CTCF across mouse tissues from different developmental timepoints. We analyze SETDB1 depleted cells and discover that H3K9me3 prevents aberrant CTCF binding independently of DNA methylation and H3K9me2. Such sites are enriched with SINE B2 retrotransposons. Moreover, analysis of higher-order genome architecture reveals that large chromatin structures including topologically associated domains and subnuclear compartments, remain intact in SETDB1 depleted cells. However, chromatin loops and local 3D interactions are disrupted, leading to transcriptional changes by modifying pre-existing chromatin landscapes. Specific genes with altered expression show differential interactions with dysregulated cis-regulatory elements. Collectively, we find that cell-type specific targets of SETDB1 maintain cellular identities by modulating CTCF binding, which shape nuclear architecture and transcriptomic networks.
Collapse
Affiliation(s)
- Phoebe Lut Fei Tam
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, SAR, China
| | - Ming Fung Cheung
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, SAR, China
- Center for Epigenomics Research, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, SAR, China
| | - Lu Yan Chan
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, SAR, China
- Center for Epigenomics Research, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, SAR, China
| | - Danny Leung
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, SAR, China.
- Center for Epigenomics Research, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, SAR, China.
| |
Collapse
|
6
|
Batra SS, Cabrera A, Spence JP, Hilton IB, Song YS. Predicting the effect of CRISPR-Cas9-based epigenome editing. bioRxiv 2023:2023.10.03.560674. [PMID: 37873127 PMCID: PMC10592942 DOI: 10.1101/2023.10.03.560674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Epigenetic regulation orchestrates mammalian transcription, but functional links between them remain elusive. To tackle this problem, we here use epigenomic and transcriptomic data from 13 ENCODE cell types to train machine learning models to predict gene expression from histone post-translational modifications (PTMs), achieving transcriptome-wide correlations of ~ 0.70 - 0.79 for most samples. In addition to recapitulating known associations between histone PTMs and expression patterns, our models predict that acetylation of histone subunit H3 lysine residue 27 (H3K27ac) near the transcription start site (TSS) significantly increases expression levels. To validate this prediction experimentally and investigate how engineered vs. natural deposition of H3K27ac might differentially affect expression, we apply the synthetic dCas9-p300 histone acetyltransferase system to 8 genes in the HEK293T cell line. Further, to facilitate model building, we perform MNase-seq to map genome-wide nucleosome occupancy levels in HEK293T. We observe that our models perform well in accurately ranking relative fold changes among genes in response to the dCas9-p300 system; however, their ability to rank fold changes within individual genes is noticeably diminished compared to predicting expression across cell types from their native epigenetic signatures. Our findings highlight the need for more comprehensive genome-scale epigenome editing datasets, better understanding of the actual modifications made by epigenome editing tools, and improved causal models that transfer better from endogenous cellular measurements to perturbation experiments. Together these improvements would facilitate the ability to understand and predictably control the dynamic human epigenome with consequences for human health.
Collapse
Affiliation(s)
| | | | | | - Isaac B. Hilton
- Department of Bioengineering, Rice University
- Department of BioSciences, Rice University
| | - Yun S. Song
- Computer Science Division, University of California, Berkeley
- Department of Statistics, University of California, Berkeley
| |
Collapse
|
7
|
Walker M, Li Y, Morales-Hernandez A, Qi Q, Parupalli C, Brown S, Christian C, Clements WK, Cheng Y, McKinney-Freeman S. An NFIX-mediated regulatory network governs the balance of hematopoietic stem and progenitor cells during hematopoiesis. Blood Adv 2023; 7:4677-4689. [PMID: 36478187 PMCID: PMC10468369 DOI: 10.1182/bloodadvances.2022007811] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Revised: 10/07/2022] [Accepted: 11/09/2022] [Indexed: 12/12/2022] Open
Abstract
The transcription factor (TF) nuclear factor I-X (NFIX) is a positive regulator of hematopoietic stem and progenitor cell (HSPC) transplantation. Nfix-deficient HSPCs exhibit a severe loss of repopulating activity, increased apoptosis, and a loss of colony-forming potential. However, the underlying mechanism remains elusive. Here, we performed cellular indexing of transcriptomes and epitopes by high-throughput sequencing (CITE-seq) on Nfix-deficient HSPCs and observed a loss of long-term hematopoietic stem cells and an accumulation of megakaryocyte and myelo-erythroid progenitors. The genome-wide binding profile of NFIX in primitive murine hematopoietic cells revealed its colocalization with other hematopoietic TFs, such as PU.1. We confirmed the physical interaction between NFIX and PU.1 and demonstrated that the 2 TFs co-occupy super-enhancers and regulate genes implicated in cellular respiration and hematopoietic differentiation. In addition, we provide evidence suggesting that the absence of NFIX negatively affects PU.1 binding at some genomic loci. Our data support a model in which NFIX collaborates with PU.1 at super-enhancers to promote the differentiation and homeostatic balance of hematopoietic progenitors.
Collapse
Affiliation(s)
- Megan Walker
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN
| | - Yichao Li
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN
| | | | - Qian Qi
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN
| | | | - Scott Brown
- Department of Immunology, St. Jude Children’s Research Hospital, Memphis, TN
| | - Claiborne Christian
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN
| | - Wilson K. Clements
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN
| | - Yong Cheng
- Department of Hematology, St. Jude Children’s Research Hospital, Memphis, TN
| | | |
Collapse
|
8
|
Gombos M, Raynaud C, Nomoto Y, Molnár E, Brik-Chaouche R, Takatsuka H, Zaki A, Bernula D, Latrasse D, Mineta K, Nagy F, He X, Iwakawa H, Őszi E, An J, Suzuki T, Papdi C, Bergis C, Benhamed M, Bögre L, Ito M, Magyar Z. The canonical E2Fs together with RETINOBLASTOMA-RELATED are required to establish quiescence during plant development. Commun Biol 2023; 6:903. [PMID: 37666980 PMCID: PMC10477330 DOI: 10.1038/s42003-023-05259-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 08/18/2023] [Indexed: 09/06/2023] Open
Abstract
Maintaining stable and transient quiescence in differentiated and stem cells, respectively, requires repression of the cell cycle. The plant RETINOBLASTOMA-RELATED (RBR) has been implicated in stem cell maintenance, presumably by forming repressor complexes with E2F transcription factors. Surprisingly we find that mutations in all three canonical E2Fs do not hinder the cell cycle, but similarly to RBR silencing, result in hyperplasia. Contrary to the growth arrest that occurs when exit from proliferation to differentiation is inhibited upon RBR silencing, the e2fabc mutant develops enlarged organs with supernumerary stem and differentiated cells as quiescence is compromised. While E2F, RBR and the M-phase regulatory MYB3Rs are part of the DREAM repressor complexes, and recruited to overlapping groups of targets, they regulate distinct sets of genes. Only the loss of E2Fs but not the MYB3Rs interferes with quiescence, which might be due to the ability of E2Fs to control both G1-S and some key G2-M targets. We conclude that collectively the three canonical E2Fs in complex with RBR have central roles in establishing cellular quiescence during organ development, leading to enhanced plant growth.
Collapse
Affiliation(s)
- Magdolna Gombos
- Institute of Plant Biology, Biological Research Centre, H-6726, Szeged, Hungary
| | - Cécile Raynaud
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
- Université de Paris Cité, CNRS, INRAE, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
| | - Yuji Nomoto
- School of Biological Science and Technology, College of Science and Engineering, Kanazawa University, Kakuma-machi, Kanazawa, 920-1192, Japan
| | - Eszter Molnár
- Institute of Plant Biology, Biological Research Centre, H-6726, Szeged, Hungary
| | - Rim Brik-Chaouche
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
- Université de Paris Cité, CNRS, INRAE, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
| | - Hirotomo Takatsuka
- School of Biological Science and Technology, College of Science and Engineering, Kanazawa University, Kakuma-machi, Kanazawa, 920-1192, Japan
| | - Ahmad Zaki
- Royal Holloway, University of London, Department of Biological Sciences, Egham, Surrey, TW20 0EX, UK
| | - Dóra Bernula
- Institute of Plant Biology, Biological Research Centre, H-6726, Szeged, Hungary
| | - David Latrasse
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
- Université de Paris Cité, CNRS, INRAE, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
| | - Keito Mineta
- School of Biological Science and Technology, College of Science and Engineering, Kanazawa University, Kakuma-machi, Kanazawa, 920-1192, Japan
| | - Fruzsina Nagy
- Institute of Plant Biology, Biological Research Centre, H-6726, Szeged, Hungary
- Doctoral School in Biology, Faculty of Science and Informatics, University of Szeged, H-6726, Szeged, Hungary
| | - Xiaoning He
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
- Université de Paris Cité, CNRS, INRAE, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
| | - Hidekazu Iwakawa
- School of Biological Science and Technology, College of Science and Engineering, Kanazawa University, Kakuma-machi, Kanazawa, 920-1192, Japan
| | - Erika Őszi
- Institute of Plant Biology, Biological Research Centre, H-6726, Szeged, Hungary
| | - Jing An
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
- Université de Paris Cité, CNRS, INRAE, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
| | - Takamasa Suzuki
- College of Bioscience and Biotechnology, Chubu University, Kasugai, Aichi, 487-8501, Japan
| | - Csaba Papdi
- Royal Holloway, University of London, Department of Biological Sciences, Egham, Surrey, TW20 0EX, UK
| | - Clara Bergis
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
- Université de Paris Cité, CNRS, INRAE, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
| | - Moussa Benhamed
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
- Université de Paris Cité, CNRS, INRAE, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif sur Yvette, France
| | - László Bögre
- Royal Holloway, University of London, Department of Biological Sciences, Egham, Surrey, TW20 0EX, UK
| | - Masaki Ito
- School of Biological Science and Technology, College of Science and Engineering, Kanazawa University, Kakuma-machi, Kanazawa, 920-1192, Japan
| | - Zoltán Magyar
- Institute of Plant Biology, Biological Research Centre, H-6726, Szeged, Hungary.
| |
Collapse
|
9
|
Gao L, Mathur V, Tam SKM, Zhou X, Cheung MF, Chan LY, Estrada-Gutiérrez G, Leung BW, Moungmaithong S, Wang CC, Poon LC, Leung D. Single-cell analysis reveals transcriptomic and epigenomic impacts on the maternal-fetal interface following SARS-CoV-2 infection. Nat Cell Biol 2023:10.1038/s41556-023-01169-x. [PMID: 37400500 PMCID: PMC10344786 DOI: 10.1038/s41556-023-01169-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 05/22/2023] [Indexed: 07/05/2023]
Abstract
During pregnancy the maternal-fetal interface plays vital roles in fetal development. Its disruption is frequently found in pregnancy complications. Recent studies show increased incidences of adverse pregnancy outcomes in patients with COVID-19; however, the mechanism remains unclear. Here we analysed the molecular impacts of SARS-CoV-2 infection on the maternal-fetal interface. Generating bulk and single-nucleus transcriptomic and epigenomic profiles from patients with COVID-19 and control samples, we discovered aberrant immune activation and angiogenesis patterns in distinct cells from patients. Surprisingly, retrotransposons were also dysregulated in specific cell types. Notably, reduced enhancer activities of LTR8B elements were functionally linked to the downregulation of pregnancy-specific glycoprotein genes in syncytiotrophoblasts. Our findings revealed that SARS-CoV-2 infection induced substantial changes to the epigenome and transcriptome at the maternal-fetal interface, which may be associated with pregnancy complications.
Collapse
Grants
- GRF16103721 Research Grants Council, University Grants Committee (RGC, UGC)
- GRF16103721 Research Grants Council, University Grants Committee (RGC, UGC)
- GRF16103721 Research Grants Council, University Grants Committee (RGC, UGC)
- CRF C5045-20EF Research Grants Council, University Grants Committee (RGC, UGC)
- CRF C5045-20EF Research Grants Council, University Grants Committee (RGC, UGC)
- CRF C5045-20EF Research Grants Council, University Grants Committee (RGC, UGC)
- CRF C5045-20EF Research Grants Council, University Grants Committee (RGC, UGC)
- CUHK 2020.053 Chinese University of Hong Kong (CUHK)
- CUHK 2020.053 Chinese University of Hong Kong (CUHK)
- CUHK 2020.053 Chinese University of Hong Kong (CUHK)
- CUHK 2020.053 Chinese University of Hong Kong (CUHK)
Collapse
Affiliation(s)
- Lin Gao
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China
| | - Vrinda Mathur
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China
| | - Sabrina Ka Man Tam
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China
| | - Xuemeng Zhou
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China
| | - Ming Fung Cheung
- Center for Epigenomics Research, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China
| | - Lu Yan Chan
- Center for Epigenomics Research, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China
| | | | - Bo Wah Leung
- Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Sakita Moungmaithong
- Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Chi Chiu Wang
- Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Shatin, Hong Kong, China
- Li Ka Shing Institute of Health Sciences; School of Biomedical Sciences and The Chinese University of Hong Kong-Sichuan University Joint Laboratory in Reproductive Medicine, The Chinese University of Hong Kong, Shatin, Hong Kong, China
| | - Liona C Poon
- Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Shatin, Hong Kong, China.
| | - Danny Leung
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China.
- Center for Epigenomics Research, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China.
| |
Collapse
|
10
|
Schreiber J, Boix C, Wook Lee J, Li H, Guan Y, Chang CC, Chang JC, Hawkins-Hooker A, Schölkopf B, Schweikert G, Carulla MR, Canakoglu A, Guzzo F, Nanni L, Masseroli M, Carman MJ, Pinoli P, Hong C, Yip KY, Spence JP, Batra SS, Song YS, Mahony S, Zhang Z, Tan W, Shen Y, Sun Y, Shi M, Adrian J, Sandstrom R, Farrell N, Halow J, Lee K, Jiang L, Yang X, Epstein C, Strattan JS, Bernstein B, Snyder M, Kellis M, Stafford W, Kundaje A. The ENCODE Imputation Challenge: a critical assessment of methods for cross-cell type imputation of epigenomic profiles. Genome Biol 2023; 24:79. [PMID: 37072822 PMCID: PMC10111747 DOI: 10.1186/s13059-023-02915-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Accepted: 03/24/2023] [Indexed: 04/20/2023] Open
Abstract
A promising alternative to comprehensively performing genomics experiments is to, instead, perform a subset of experiments and use computational methods to impute the remainder. However, identifying the best imputation methods and what measures meaningfully evaluate performance are open questions. We address these questions by comprehensively analyzing 23 methods from the ENCODE Imputation Challenge. We find that imputation evaluations are challenging and confounded by distributional shifts from differences in data collection and processing over time, the amount of available data, and redundancy among performance measures. Our analyses suggest simple steps for overcoming these issues and promising directions for more robust research.
Collapse
Affiliation(s)
| | - Carles Boix
- Stanford University School of Medicine, Stanford, CA, USA
| | - Jin Wook Lee
- Stanford University School of Medicine, Stanford, CA, USA
| | - Hongyang Li
- Stanford University School of Medicine, Stanford, CA, USA
| | - Yuanfang Guan
- Stanford University School of Medicine, Stanford, CA, USA
| | | | | | | | | | | | | | - Arif Canakoglu
- Stanford University School of Medicine, Stanford, CA, USA
| | | | - Luca Nanni
- Stanford University School of Medicine, Stanford, CA, USA
| | | | | | - Pietro Pinoli
- Stanford University School of Medicine, Stanford, CA, USA
| | - Chenyang Hong
- Stanford University School of Medicine, Stanford, CA, USA
| | - Kevin Y Yip
- Stanford University School of Medicine, Stanford, CA, USA
| | | | | | - Yun S Song
- Stanford University School of Medicine, Stanford, CA, USA
| | - Shaun Mahony
- Stanford University School of Medicine, Stanford, CA, USA
| | - Zheng Zhang
- Stanford University School of Medicine, Stanford, CA, USA
| | - Wuwei Tan
- Stanford University School of Medicine, Stanford, CA, USA
| | - Yang Shen
- Stanford University School of Medicine, Stanford, CA, USA
| | - Yuanfei Sun
- Stanford University School of Medicine, Stanford, CA, USA
| | - Minyi Shi
- Stanford University School of Medicine, Stanford, CA, USA
| | - Jessika Adrian
- Stanford University School of Medicine, Stanford, CA, USA
| | | | - Nina Farrell
- Stanford University School of Medicine, Stanford, CA, USA
| | - Jessica Halow
- Stanford University School of Medicine, Stanford, CA, USA
| | - Kristen Lee
- Stanford University School of Medicine, Stanford, CA, USA
| | - Lixia Jiang
- Stanford University School of Medicine, Stanford, CA, USA
| | - Xinqiong Yang
- Stanford University School of Medicine, Stanford, CA, USA
| | | | | | | | - Michael Snyder
- Stanford University School of Medicine, Stanford, CA, USA
| | - Manolis Kellis
- Stanford University School of Medicine, Stanford, CA, USA
| | | | - Anshul Kundaje
- Stanford University School of Medicine, Stanford, CA, USA
| |
Collapse
|
11
|
Xiang G, Giardine B, An L, Sun C, Keller CA, Heuston EF, Anderson SM, Kirby M, Bodine D, Zhang Y, Hardison RC. Snapshot: a package for clustering and visualizing epigenetic history during cell differentiation. BMC Bioinformatics 2023; 24:102. [PMID: 36941541 PMCID: PMC10026520 DOI: 10.1186/s12859-023-05223-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Accepted: 03/07/2023] [Indexed: 03/23/2023] Open
Abstract
BACKGROUND Epigenetic modification of chromatin plays a pivotal role in regulating gene expression during cell differentiation. The scale and complexity of epigenetic data pose significant challenges for biologists to identify the regulatory events controlling cell differentiation. RESULTS To reduce the complexity, we developed a package, called Snapshot, for clustering and visualizing candidate cis-regulatory elements (cCREs) based on their epigenetic signals during cell differentiation. This package first introduces a binarized indexing strategy for clustering the cCREs. It then provides a series of easily interpretable figures for visualizing the signal and epigenetic state patterns of the cCREs clusters during the cell differentiation. It can also use different hierarchies of cell types to highlight the epigenetic history specific to any particular cell lineage. We demonstrate the utility of Snapshot using data from a consortium project for ValIdated Systematic IntegratiON (VISION) of epigenomic data in hematopoiesis. CONCLUSION The package Snapshot can identify all distinct clusters of genomic locations with unique epigenetic signal patterns during cell differentiation. It outperforms other methods in terms of interpreting and reproducing the identified cCREs clusters. The package of Snapshot is available at GitHub: https://github.com/guanjue/Snapshot .
Collapse
Affiliation(s)
- Guanjue Xiang
- The Bioinformatics and Genomics Program, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA.
| | - Belinda Giardine
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Lin An
- The Bioinformatics and Genomics Program, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA
| | - Chen Sun
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA, USA
| | - Cheryl A Keller
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | | | | | | | - David Bodine
- NHGRI Hematopoiesis Section, GMBB, Bethesda, MD, USA
| | - Yu Zhang
- Department of Statistics, The Pennsylvania State University, University Park, PA, USA
| | - Ross C Hardison
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA.
| |
Collapse
|
12
|
Lee H, Sanidas I, Dyson NJ, Lawrence MS. Chromatin-bound protein colocalization analysis using bedGraph2Cluster and PanChIP. STAR Protoc 2023; 4:101991. [PMID: 36607812 PMCID: PMC9826822 DOI: 10.1016/j.xpro.2022.101991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 11/16/2022] [Accepted: 12/13/2022] [Indexed: 01/06/2023] Open
Abstract
Computational pipelines for chromatin immunoprecipitation sequencing analysis can neglect colocalization events that occur in a mere subset of the genome. Here, we detail a streamlined approach for assessing colocalization of chromatin-bound proteins using the bedGraph2Cluster and PanChIP algorithms. Using histone modifications as an example, bedGraph2Cluster performs clustering analysis on chromatin binding patterns of target proteins. PanChIP then compares these clusters with a reference library of chromatin binding patterns and measures the overlap in peaks, capturing the heterogeneity in chromatin binding and colocalization patterns. For complete details on the use and execution of this protocol, please refer to Sanidas et al. (2022).1.
Collapse
Affiliation(s)
- Hanjun Lee
- Massachusetts General Hospital Cancer Center, Harvard Medical School, 149 13th Street, Charlestown, MA 02129, USA; Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA.
| | - Ioannis Sanidas
- Massachusetts General Hospital Cancer Center, Harvard Medical School, 149 13th Street, Charlestown, MA 02129, USA; Department of Molecular and Cellular Biology, Roswell Park Comprehensive Cancer Center, Elm & Carlton Streets, Buffalo, NY 14203, USA
| | - Nicholas J Dyson
- Massachusetts General Hospital Cancer Center, Harvard Medical School, 149 13th Street, Charlestown, MA 02129, USA
| | - Michael S Lawrence
- Massachusetts General Hospital Cancer Center, Harvard Medical School, 149 13th Street, Charlestown, MA 02129, USA; Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA.
| |
Collapse
|
13
|
Yu B, Li P, Zhang QC, Hou L. Differential analysis of RNA structure probing experiments at nucleotide resolution: uncovering regulatory functions of RNA structure. Nat Commun 2022; 13:4227. [PMID: 35869080 PMCID: PMC9307511 DOI: 10.1038/s41467-022-31875-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Accepted: 07/05/2022] [Indexed: 11/09/2022] Open
Abstract
RNAs perform their function by forming specific structures, which can change across cellular conditions. Structure probing experiments combined with next generation sequencing technology have enabled transcriptome-wide analysis of RNA secondary structure in various cellular conditions. Differential analysis of structure probing data in different conditions can reveal the RNA structurally variable regions (SVRs), which is important for understanding RNA functions. Here, we propose DiffScan, a computational framework for normalization and differential analysis of structure probing data in high resolution. DiffScan preprocesses structure probing datasets to remove systematic bias, and then scans the transcripts to identify SVRs and adaptively determines their lengths and locations. The proposed approach is compatible with most structure probing platforms (e.g., icSHAPE, DMS-seq). When evaluated with simulated and benchmark datasets, DiffScan identifies structurally variable regions at nucleotide resolution, with substantial improvement in accuracy compared with existing SVR detection methods. Moreover, the improvement is robust when tested in multiple structure probing platforms. Application of DiffScan in a dataset of multi-subcellular RNA structurome and a subsequent motif enrichment analysis suggest potential links of RNA structural variation and mRNA abundance, possibly mediated by RNA binding proteins such as the serine/arginine rich splicing factors. This work provides an effective tool for differential analysis of RNA secondary structure, reinforcing the power of structure probing experiments in deciphering the dynamic RNA structurome. The authors present DiffScan, an advanced tool for normalization and differential analysis of RNA structure probing experiments, combining their power in deciphering the dynamic RNA structurome and facilitating the discovery of RNA regulatory functions.
Collapse
|
14
|
Feng R, Mayuranathan T, Huang P, Doerfler PA, Li Y, Yao Y, Zhang J, Palmer LE, Mayberry K, Christakopoulos GE, Xu P, Li C, Cheng Y, Blobel GA, Simon MC, Weiss MJ. Activation of γ-globin expression by hypoxia-inducible factor 1α. Nature 2022; 610:783-90. [PMID: 36224385 DOI: 10.1038/s41586-022-05312-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 09/02/2022] [Indexed: 12/24/2022]
Abstract
Around birth, globin expression in human red blood cells (RBCs) shifts from γ-globin to β-globin, which results in fetal haemoglobin (HbF, α2γ2) being gradually replaced by adult haemoglobin (HbA, α2β2)1. This process has motivated the development of innovative approaches to treat sickle cell disease and β-thalassaemia by increasing HbF levels in postnatal RBCs2. Here we provide therapeutically relevant insights into globin gene switching obtained through a CRISPR-Cas9 screen for ubiquitin-proteasome components that regulate HbF expression. In RBC precursors, depletion of the von Hippel-Lindau (VHL) E3 ubiquitin ligase stabilized its ubiquitination target, hypoxia-inducible factor 1α (HIF1α)3,4, to induce γ-globin gene transcription. Mechanistically, HIF1α-HIF1β heterodimers bound cognate DNA elements in BGLT3, a long noncoding RNA gene located 2.7 kb downstream of the tandem γ-globin genes HBG1 and HBG2. This was followed by the recruitment of transcriptional activators, chromatin opening and increased long-range interactions between the γ-globin genes and their upstream enhancer. Similar induction of HbF occurred with hypoxia or with inhibition of prolyl hydroxylase domain enzymes that target HIF1α for ubiquitination by the VHL E3 ubiquitin ligase. Our findings link globin gene regulation with canonical hypoxia adaptation, provide a mechanism for HbF induction during stress erythropoiesis and suggest a new therapeutic approach for β-haemoglobinopathies.
Collapse
|
15
|
Servant N. Bioinformatics Methods for ChIP-seq Histone Analysis. Methods Mol Biol 2022; 2529:267-293. [PMID: 35733020 DOI: 10.1007/978-1-0716-2481-4_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The field of genomics and genome-wide analysis has exploded since around 2008 with the development of high-throughput omics approaches, largely driven by the emergence of the next-generation sequencing technologies. Among the different biological applications supported by recent sequencing technologies, ChIP-seq (Chromatin ImmunoPrecipitation followed by Sequencing) is one of the most powerful techniques which has dramatically changed our view of the epigenetics landscape of cells.In this chapter, I will present and discuss the main steps of bioinformatic and biostatistical analysis of ChIP-seq data (Fig. 1). While this technique has been widely used to study transcription factor binding sites, I will focus here on the analysis of histone modifications.
Collapse
Affiliation(s)
- Nicolas Servant
- Institut Curie, Bioinformatics core facility, Paris, France.
- INSERM U900, Paris, France.
- PSL Research University, Paris, France.
- Mines Paris Tech, Fontainebleau, Paris, France.
| |
Collapse
|
16
|
Abstract
Segmentation and genome annotation (SAGA) algorithms are widely used to understand genome activity and gene regulation. These algorithms take as input epigenomic datasets, such as chromatin immunoprecipitation-sequencing (ChIP-seq) measurements of histone modifications or transcription factor binding. They partition the genome and assign a label to each segment such that positions with the same label exhibit similar patterns of input data. SAGA algorithms discover categories of activity such as promoters, enhancers, or parts of genes without prior knowledge of known genomic elements. In this sense, they generally act in an unsupervised fashion like clustering algorithms, but with the additional simultaneous function of segmenting the genome. Here, we review the common methodological framework that underlies these methods, review variants of and improvements upon this basic framework, and discuss the outlook for future work. This review is intended for those interested in applying SAGA methods and for computational researchers interested in improving upon them.
Collapse
Affiliation(s)
| | - Rachel C. W. Chan
- Department of Computer Science, University of Toronto, Toronto, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
| | - Michael M. Hoffman
- Department of Computer Science, University of Toronto, Toronto, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
- Vector Institute for Artificial Intelligence, Toronto, Canada
| |
Collapse
|
17
|
Doerfler PA, Feng R, Li Y, Palmer LE, Porter SN, Bell HW, Crossley M, Pruett-Miller SM, Cheng Y, Weiss MJ. Activation of γ-globin gene expression by GATA1 and NF-Y in hereditary persistence of fetal hemoglobin. Nat Genet 2021; 53:1177-86. [PMID: 34341563 DOI: 10.1038/s41588-021-00904-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Accepted: 06/25/2021] [Indexed: 11/30/2022]
Abstract
Hereditary persistence of fetal hemoglobin (HPFH) ameliorates β-hemoglobinopathies by inhibiting the developmental switch from γ-globin (HBG1/HBG2) to β-globin (HBB) gene expression. Some forms of HPFH are associated with γ-globin promoter variants that either disrupt binding motifs for transcriptional repressors or create new motifs for transcriptional activators. How these variants sustain γ-globin gene expression postnatally remains undefined. We mapped γ-globin promoter sequences functionally in erythroid cells harboring different HPFH variants. Those that disrupt a BCL11A repressor binding element induce γ-globin expression by facilitating the recruitment of transcription factors NF-Y to a nearby proximal CCAAT box and GATA1 to an upstream motif. The proximal CCAAT element becomes dispensable for HPFH variants that generate new binding motifs for activators NF-Y or KLF1, but GATA1 recruitment remains essential. Our findings define distinct mechanisms through which transcription factors and their cis-regulatory elements activate γ-globin expression in different forms of HPFH, some of which are being recreated by therapeutic genome editing.
Collapse
|
18
|
Bayat F, Libbrecht M. VSS: Variance-stabilized signals for sequencing-based genomic signals. Bioinformatics 2021; 37:4383-4391. [PMID: 34165492 PMCID: PMC8652025 DOI: 10.1093/bioinformatics/btab457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 04/28/2021] [Accepted: 06/17/2021] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION A sequencing-based genomic assay such as ChIP-seq outputs a real-valued signal for each position in the genome that measures the strength of activity at that position. Most genomic signals lack the property of variance stabilization. That is, a difference between 0 and 100 reads usually has a very different statistical importance from a difference between 1,000 and 1,100 reads. A statistical model such as a negative binomial distribution can account for this pattern, but learning these models is computationally challenging. Therefore, many applications - including imputation and segmentation and genome annotation (SAGA) - instead use Gaussian models and use a transformation such as log or inverse hyperbolic sine (asinh) to stabilize variance. RESULTS We show here that existing transformations do not fully stabilize variance in genomic data sets. To solve this issue, we propose VSS, a method that produces variance-stabilized signals for sequencing-based genomic signals. VSS learns the empirical relationship between the mean and variance of a given signal data set and produces transformed signals that normalize for this dependence. We show that VSS successfully stabilizes variance and that doing so improves downstream applications such as SAGA. VSS will eliminate the need for downstream methods to implement complex mean-variance relationship models, and will enable genomic signals to be easily understood by eye. AVAILABILITY https://github.com/faezeh-bayat/VSS. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Faezeh Bayat
- Department of Computing Science, Simon Fraser University, Burnaby, BC V5A 1S6, Canada
| | - Maxwell Libbrecht
- Department of Computing Science, Simon Fraser University, Burnaby, BC V5A 1S6, Canada
| |
Collapse
|
19
|
Xiang G, Giardine BM, Mahony S, Zhang Y, Hardison RC. S3V2-IDEAS: a package for normalizing, denoising and integrating epigenomic datasets across different cell types. Bioinformatics 2021; 37:3011-3013. [PMID: 33681991 PMCID: PMC8479670 DOI: 10.1093/bioinformatics/btab148] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 01/26/2021] [Accepted: 03/01/2021] [Indexed: 02/02/2023] Open
Abstract
SUMMARY Epigenetic modifications reflect key aspects of transcriptional regulation, and many epigenomic datasets have been generated under different biological contexts to provide insights into regulatory processes. However, the technical noise in epigenomic datasets and the many dimensions (features) examined make it challenging to effectively extract biologically meaningful inferences from these datasets. We developed a package that reduces noise while normalizing the epigenomic data by a novel normalization method, followed by integrative dimensional reduction by learning and assigning epigenetic states. This package, called S3V2-IDEAS, can be used to identify epigenetic states for multiple features, or identify discretized signal intensity levels and a master peak list across different cell types for a single feature. We illustrate the outputs and performance of S3V2-IDEAS using 137 epigenomics datasets from the VISION project that provides ValIdated Systematic IntegratiON of epigenomic data in hematopoiesis. AVAILABILITY AND IMPLEMENTATION S3V2-IDEAS pipeline is freely available as open source software released under an MIT license at: https://github.com/guanjue/S3V2_IDEAS_ESMP. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Guanjue Xiang
- The Bioinformatics and Genomics Program, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA
- To whom correspondence should be addressed. or
| | - Belinda M Giardine
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Shaun Mahony
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Yu Zhang
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
| | - Ross C Hardison
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
- To whom correspondence should be addressed. or
| |
Collapse
|
20
|
Luan J, Xiang G, Gómez-García PA, Tome JM, Zhang Z, Vermunt MW, Zhang H, Huang A, Keller CA, Giardine BM, Zhang Y, Lan Y, Lis JT, Lakadamyali M, Hardison RC, Blobel GA. Distinct properties and functions of CTCF revealed by a rapidly inducible degron system. Cell Rep 2021; 34:108783. [PMID: 33626344 PMCID: PMC7999233 DOI: 10.1016/j.celrep.2021.108783] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2020] [Revised: 11/25/2020] [Accepted: 02/02/2021] [Indexed: 02/07/2023] Open
Abstract
CCCTC-binding factor (CTCF) is a conserved zinc finger transcription factor implicated in a wide range of functions, including genome organization, transcription activation, and elongation. To explore the basis for CTCF functional diversity, we coupled an auxin-induced degron system with precision nuclear run-on. Unexpectedly, oriented CTCF motifs in gene bodies are associated with transcriptional stalling in a manner independent of bound CTCF. Moreover, CTCF at different binding sites (CBSs) displays highly variable resistance to degradation. Motif sequence does not significantly predict degradation behavior, but location at chromatin boundaries and chromatin loop anchors, as well as co-occupancy with cohesin, are associated with delayed degradation. Single-molecule tracking experiments link chromatin residence time to CTCF degradation kinetics, which has ramifications regarding architectural CTCF functions. Our study highlights the heterogeneity of CBSs, uncovers properties specific to architecturally important CBSs, and provides insights into the basic processes of genome organization and transcription regulation.
Collapse
Affiliation(s)
- Jing Luan
- Medical Scientist Training Program, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Guanjue Xiang
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Pablo Aurelio Gómez-García
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Jacob M Tome
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Zhe Zhang
- Department of Biomedical and Health Informatics, Children's Hospital of Pennsylvania, Philadelphia, PA, USA
| | - Marit W Vermunt
- Division of Hematology, The Children's Hospital of Pennsylvania, Philadelphia, PA, USA
| | - Haoyue Zhang
- Division of Hematology, The Children's Hospital of Pennsylvania, Philadelphia, PA, USA
| | - Anran Huang
- Division of Hematology, The Children's Hospital of Pennsylvania, Philadelphia, PA, USA
| | - Cheryl A Keller
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Belinda M Giardine
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Yu Zhang
- Department of Statistics, Pennsylvania State University, University Park, PA 16802, USA
| | - Yemin Lan
- Penn Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - John T Lis
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Melike Lakadamyali
- Department of Physiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Ross C Hardison
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Gerd A Blobel
- Division of Hematology, The Children's Hospital of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
21
|
Xiang G, Keller CA, Heuston E, Giardine BM, An L, Wixom AQ, Miller A, Cockburn A, Sauria MEG, Weaver K, Lichtenberg J, Göttgens B, Li Q, Bodine D, Mahony S, Taylor J, Blobel GA, Weiss MJ, Cheng Y, Yue F, Hughes J, Higgs DR, Zhang Y, Hardison RC. An integrative view of the regulatory and transcriptional landscapes in mouse hematopoiesis. Genome Res 2020; 30:472-484. [PMID: 32132109 PMCID: PMC7111515 DOI: 10.1101/gr.255760.119] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Accepted: 02/21/2020] [Indexed: 01/29/2023]
Abstract
Thousands of epigenomic data sets have been generated in the past decade, but it is difficult for researchers to effectively use all the data relevant to their projects. Systematic integrative analysis can help meet this need, and the VISION project was established for validated systematic integration of epigenomic data in hematopoiesis. Here, we systematically integrated extensive data recording epigenetic features and transcriptomes from many sources, including individual laboratories and consortia, to produce a comprehensive view of the regulatory landscape of differentiating hematopoietic cell types in mouse. By using IDEAS as our integrative and discriminative epigenome annotation system, we identified and assigned epigenetic states simultaneously along chromosomes and across cell types, precisely and comprehensively. Combining nuclease accessibility and epigenetic states produced a set of more than 200,000 candidate cis-regulatory elements (cCREs) that efficiently capture enhancers and promoters. The transitions in epigenetic states of these cCREs across cell types provided insights into mechanisms of regulation, including decreases in numbers of active cCREs during differentiation of most lineages, transitions from poised to active or inactive states, and shifts in nuclease accessibility of CTCF-bound elements. Regression modeling of epigenetic states at cCREs and gene expression produced a versatile resource to improve selection of cCREs potentially regulating target genes. These resources are available from our VISION website to aid research in genomics and hematopoiesis.
Collapse
Affiliation(s)
- Guanjue Xiang
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Cheryl A Keller
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Elisabeth Heuston
- NHGRI Hematopoiesis Section, Genetics and Molecular Biology Branch, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Belinda M Giardine
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Lin An
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Alexander Q Wixom
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Amber Miller
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - April Cockburn
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Michael E G Sauria
- Departments of Biology and Computer Science, Johns Hopkins University, Baltimore, Maryland 20218, USA
| | - Kathryn Weaver
- Departments of Biology and Computer Science, Johns Hopkins University, Baltimore, Maryland 20218, USA
| | - Jens Lichtenberg
- NHGRI Hematopoiesis Section, Genetics and Molecular Biology Branch, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Berthold Göttgens
- Welcome and MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge CB2 1TN, United Kingdom
| | - Qunhua Li
- Department of Statistics, Program in Bioinformatics and Genomics, Center for Computational Biology and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - David Bodine
- NHGRI Hematopoiesis Section, Genetics and Molecular Biology Branch, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Shaun Mahony
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - James Taylor
- Departments of Biology and Computer Science, Johns Hopkins University, Baltimore, Maryland 20218, USA
| | - Gerd A Blobel
- Department of Pediatrics, Children's Hospital of Philadelphia and University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA
| | - Mitchell J Weiss
- Department of Hematology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, USA
| | - Yong Cheng
- Department of Hematology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, USA
| | - Feng Yue
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, Pennsylvania 17033, USA
| | - Jim Hughes
- MRC Weatherall Institute of Molecular Medicine, Oxford University, Oxford OX3 9DS, United Kingdom
| | - Douglas R Higgs
- MRC Weatherall Institute of Molecular Medicine, Oxford University, Oxford OX3 9DS, United Kingdom
| | - Yu Zhang
- Department of Statistics, Program in Bioinformatics and Genomics, Center for Computational Biology and Bioinformatics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Ross C Hardison
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|