1
|
Reuter LM, Khadayate SP, Mossler A, Liebl K, Faull SV, Karimi MM, Speck C. MCM2-7 loading-dependent ORC release ensures genome-wide origin licensing. Nat Commun 2024; 15:7306. [PMID: 39181881 PMCID: PMC11344781 DOI: 10.1038/s41467-024-51538-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 08/09/2024] [Indexed: 08/27/2024] Open
Abstract
Origin recognition complex (ORC)-dependent loading of the replicative helicase MCM2-7 onto replication origins in G1-phase forms the basis of replication fork establishment in S-phase. However, how ORC and MCM2-7 facilitate genome-wide DNA licensing is not fully understood. Mapping the molecular footprints of budding yeast ORC and MCM2-7 genome-wide, we discovered that MCM2-7 loading is associated with ORC release from origins and redistribution to non-origin sites. Our bioinformatic analysis revealed that origins are compact units, where a single MCM2-7 double hexamer blocks repetitive loading through steric ORC binding site occlusion. Analyses of A-elements and an improved B2-element consensus motif uncovered that DNA shape, DNA flexibility, and the correct, face-to-face spacing of the two DNA elements are hallmarks of ORC-binding and efficient helicase loading sites. Thus, our work identified fundamental principles for MCM2-7 helicase loading that explain how origin licensing is realised across the genome.
Collapse
Affiliation(s)
- L Maximilian Reuter
- DNA Replication Group, Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom.
- Institute of Molecular Biology (IMB) gGmbH, Ackermannweg 4, Mainz, Germany.
| | | | - Audrey Mossler
- DNA Replication Group, Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Korbinian Liebl
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, IL, USA
| | - Sarah V Faull
- DNA Replication Group, Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Mohammad M Karimi
- MRC London Institute of Medical Sciences (LMS), London, United Kingdom
- Comprehensive Cancer Centre, School of Cancer & Pharmaceutical Sciences, Faculty of Life Sciences & Medicine, King's College London, London, United Kingdom
| | - Christian Speck
- DNA Replication Group, Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom.
- MRC London Institute of Medical Sciences (LMS), London, United Kingdom.
| |
Collapse
|
2
|
Li NN, Lun DX, Gong N, Meng G, Du XY, Wang H, Bao X, Li XY, Song JW, Hu K, Li L, Li SY, Liu W, Zhu W, Zhang Y, Li J, Yao T, Mou L, Han X, Hao F, Hu Y, Liu L, Zhu H, Wu Y, Liu B. Targeting the chromatin structural changes of antitumor immunity. J Pharm Anal 2024; 14:100905. [PMID: 38665224 PMCID: PMC11043877 DOI: 10.1016/j.jpha.2023.11.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 09/28/2023] [Accepted: 11/21/2023] [Indexed: 04/28/2024] Open
Abstract
Epigenomic imbalance drives abnormal transcriptional processes, promoting the onset and progression of cancer. Although defective gene regulation generally affects carcinogenesis and tumor suppression networks, tumor immunogenicity and immune cells involved in antitumor responses may also be affected by epigenomic changes, which may have significant implications for the development and application of epigenetic therapy, cancer immunotherapy, and their combinations. Herein, we focus on the impact of epigenetic regulation on tumor immune cell function and the role of key abnormal epigenetic processes, DNA methylation, histone post-translational modification, and chromatin structure in tumor immunogenicity, and introduce these epigenetic research methods. We emphasize the value of small-molecule inhibitors of epigenetic modulators in enhancing antitumor immune responses and discuss the challenges of developing treatment plans that combine epigenetic therapy and immunotherapy through the complex interaction between cancer epigenetics and cancer immunology.
Collapse
Affiliation(s)
- Nian-nian Li
- Weifang People's Hospital, Weifang, Shandong, 261000, China
- School of Life Sciences, Nankai University, Tianjin, 300071, China
| | - Deng-xing Lun
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Ningning Gong
- Weifang Traditional Chinese Medicine Hospital, Weifang, Shandong, 261000, China
| | - Gang Meng
- Shaanxi Key Laboratory of Sericulture, Ankang University, Ankang, Shaanxi, 725000, China
| | - Xin-ying Du
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - He Wang
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Xiangxiang Bao
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Xin-yang Li
- Guizhou Education University, Guiyang, 550018, China
| | - Ji-wu Song
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Kewei Hu
- Weifang Traditional Chinese Medicine Hospital, Weifang, Shandong, 261000, China
| | - Lala Li
- Guizhou Normal University, Guiyang, 550025, China
| | - Si-ying Li
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Wenbo Liu
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Wanping Zhu
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Yunlong Zhang
- School of Medical Imaging, Weifang Medical University, Weifang, Shandong, 261053, China
| | - Jikai Li
- Department of Bone and Soft Tissue Oncology, Tianjin Hospital, Tianjin, 300299, China
| | - Ting Yao
- School of Life Sciences, Nankai University, Tianjin, 300071, China
- Teda Institute of Biological Sciences & Biotechnology, Nankai University, Tianjin, 300457, China
| | - Leming Mou
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Xiaoqing Han
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Furong Hao
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Yongcheng Hu
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Lin Liu
- School of Life Sciences, Nankai University, Tianjin, 300071, China
| | - Hongguang Zhu
- Weifang People's Hospital, Weifang, Shandong, 261000, China
| | - Yuyun Wu
- Xinqiao Hospital of Army Military Medical University, Chongqing, 400038, China
| | - Bin Liu
- Weifang People's Hospital, Weifang, Shandong, 261000, China
- School of Life Sciences, Nankai University, Tianjin, 300071, China
- Teda Institute of Biological Sciences & Biotechnology, Nankai University, Tianjin, 300457, China
| |
Collapse
|
3
|
Arora S, Yang J, Akiyama T, James DQ, Morrissey A, Blanda TR, Badjatia N, Lai WK, Ko MS, Pugh BF, Mahony S. Joint sequence & chromatin neural networks characterize the differential abilities of Forkhead transcription factors to engage inaccessible chromatin. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.06.561228. [PMID: 37873361 PMCID: PMC10592618 DOI: 10.1101/2023.10.06.561228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
The DNA-binding activities of transcription factors (TFs) are influenced by both intrinsic sequence preferences and extrinsic interactions with cell-specific chromatin landscapes and other regulatory proteins. Disentangling the roles of these binding determinants remains challenging. For example, the FoxA subfamily of Forkhead domain (Fox) TFs are known pioneer factors that can bind to relatively inaccessible sites during development. Yet FoxA TF binding also varies across cell types, pointing to a combination of intrinsic and extrinsic forces guiding their binding. While other Forkhead domain TFs are often assumed to have pioneering abilities, how sequence and chromatin features influence the binding of related Fox TFs has not been systematically characterized. Here, we present a principled approach to compare the relative contributions of intrinsic DNA sequence preference and cell-specific chromatin environments to a TF's DNA-binding activities. We apply our approach to investigate how a selection of Fox TFs (FoxA1, FoxC1, FoxG1, FoxL2, and FoxP3) vary in their binding specificity. We over-express the selected Fox TFs in mouse embryonic stem cells, which offer a platform to contrast each TF's binding activity within the same preexisting chromatin background. By applying a convolutional neural network to interpret the Fox TF binding patterns, we evaluate how sequence and preexisting chromatin features jointly contribute to induced TF binding. We demonstrate that Fox TFs bind different DNA targets, and drive differential gene expression patterns, even when induced in identical chromatin settings. Despite the association between Forkhead domains and pioneering activities, the selected Fox TFs display a wide range of affinities for preexiting chromatin states. Using sequence and chromatin feature attribution techniques to interpret the neural network predictions, we show that differential sequence preferences combined with differential abilities to engage relatively inaccessible chromatin together explain Fox TF binding patterns at individual sites and genome-wide.
Collapse
Affiliation(s)
- Sonny Arora
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Jianyu Yang
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Tomohiko Akiyama
- Department of Systems Medicine, Keio University School of Medicine, Tokyo, Japan
- Current address: School of Medicine, Yokohama City University, Japan
| | - Daniela Q. James
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Alexis Morrissey
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Thomas R. Blanda
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Nitika Badjatia
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - William K.M. Lai
- Department of Molecular Biology and Genetics, Cornell University, NY, USA
| | - Minoru S.H. Ko
- Department of Systems Medicine, Keio University School of Medicine, Tokyo, Japan
| | - B. Franklin Pugh
- Department of Molecular Biology and Genetics, Cornell University, NY, USA
| | - Shaun Mahony
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| |
Collapse
|
4
|
Dang T, Kumaishi K, Usui E, Kobori S, Sato T, Toda Y, Yamasaki Y, Tsujimoto H, Ichihashi Y, Iwata H. Stochastic variational variable selection for high-dimensional microbiome data. MICROBIOME 2022; 10:236. [PMID: 36566203 PMCID: PMC9789572 DOI: 10.1186/s40168-022-01439-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 11/28/2022] [Indexed: 06/17/2023]
Abstract
BACKGROUND The rapid and accurate identification of a minimal-size core set of representative microbial species plays an important role in the clustering of microbial community data and interpretation of clustering results. However, the huge dimensionality of microbial metagenomics datasets is a major challenge for the existing methods such as Dirichlet multinomial mixture (DMM) models. In the approach of the existing methods, the computational burden of identifying a small number of representative species from a large number of observed species remains a challenge. RESULTS We propose a novel approach to improve the performance of the widely used DMM approach by combining three ideas: (i) we propose an indicator variable to identify representative operational taxonomic units that substantially contribute to the differentiation among clusters; (ii) to address the computational burden of high-dimensional microbiome data, we propose a stochastic variational inference, which approximates the posterior distribution using a controllable distribution called variational distribution, and stochastic optimization algorithms for fast computation; and (iii) we extend the finite DMM model to an infinite case by considering Dirichlet process mixtures and estimating the number of clusters as a variational parameter. Using the proposed method, stochastic variational variable selection (SVVS), we analyzed the root microbiome data collected in our soybean field experiment, the human gut microbiome data from three published datasets of large-scale case-control studies and the healthy human microbiome data from the Human Microbiome Project. CONCLUSIONS SVVS demonstrates a better performance and significantly faster computation than those of the existing methods in all cases of testing datasets. In particular, SVVS is the only method that can analyze massive high-dimensional microbial data with more than 50,000 microbial species and 1000 samples. Furthermore, a core set of representative microbial species is identified using SVVS that can improve the interpretability of Bayesian mixture models for a wide range of microbiome studies. Video Abstract.
Collapse
Affiliation(s)
- Tung Dang
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Kie Kumaishi
- RIKEN BioResource Research Center, Tsukuba, Ibaraki, Japan
| | - Erika Usui
- RIKEN BioResource Research Center, Tsukuba, Ibaraki, Japan
| | - Shungo Kobori
- RIKEN BioResource Research Center, Tsukuba, Ibaraki, Japan
| | - Takumi Sato
- RIKEN BioResource Research Center, Tsukuba, Ibaraki, Japan
| | - Yusuke Toda
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Yuji Yamasaki
- Arid Land Research Center, Tottori University, Tottori, Japan
| | | | | | - Hiroyoshi Iwata
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
5
|
Osmala M, Eraslan G, Lähdesmäki H. ChromDMM: a Dirichlet-multinomial mixture model for clustering heterogeneous epigenetic data. Bioinformatics 2022; 38:3863-3870. [PMID: 35786716 PMCID: PMC9364382 DOI: 10.1093/bioinformatics/btac444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 05/20/2022] [Accepted: 06/30/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Research on epigenetic modifications and other chromatin features at genomic regulatory elements elucidates essential biological mechanisms including the regulation of gene expression. Despite the growing number of epigenetic datasets, new tools are still needed to discover novel distinctive patterns of heterogeneous epigenetic signals at regulatory elements. RESULTS We introduce ChromDMM, a product Dirichlet-multinomial mixture model for clustering genomic regions that are characterized by multiple chromatin features. ChromDMM extends the mixture model framework by profile shifting and flipping that can probabilistically account for inaccuracies in the position and strand-orientation of the genomic regions. Owing to hyper-parameter optimization, ChromDMM can also regularize the smoothness of the epigenetic profiles across the consecutive genomic regions. With simulated data, we demonstrate that ChromDMM clusters, shifts and strand-orients the profiles more accurately than previous methods. With ENCODE data, we show that the clustering of enhancer regions in the human genome reveals distinct patterns in several chromatin features. We further validate the enhancer clusters by their enrichment for transcriptional regulatory factor binding sites. AVAILABILITY AND IMPLEMENTATION ChromDMM is implemented as an R package and is available at https://github.com/MariaOsmala/ChromDMM. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | - Harri Lähdesmäki
- Department of Computer Science, Aalto University, Espoo 02150, Finland
| |
Collapse
|
6
|
Lai WKM, Mariani L, Rothschild G, Smith ER, Venters BJ, Blanda TR, Kuntala PK, Bocklund K, Mairose J, Dweikat SN, Mistretta K, Rossi MJ, James D, Anderson JT, Phanor SK, Zhang W, Zhao Z, Shah AP, Novitzky K, McAnarney E, Keogh MC, Shilatifard A, Basu U, Bulyk ML, Pugh BF. A ChIP-exo screen of 887 Protein Capture Reagents Program transcription factor antibodies in human cells. Genome Res 2021; 31:1663-1679. [PMID: 34426512 PMCID: PMC8415381 DOI: 10.1101/gr.275472.121] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 07/07/2021] [Indexed: 12/22/2022]
Abstract
Antibodies offer a powerful means to interrogate specific proteins in a complex milieu. However, antibody availability and reliability can be problematic, whereas epitope tagging can be impractical in many cases. To address these limitations, the Protein Capture Reagents Program (PCRP) generated over a thousand renewable monoclonal antibodies (mAbs) against human presumptive chromatin proteins. However, these reagents have not been widely field-tested. We therefore performed a screen to test their ability to enrich genomic regions via chromatin immunoprecipitation (ChIP) and a variety of orthogonal assays. Eight hundred eighty-seven unique antibodies against 681 unique human transcription factors (TFs) were assayed by ultra-high-resolution ChIP-exo/seq, generating approximately 1200 ChIP-exo data sets, primarily in a single pass in one cell type (K562). Subsets of PCRP mAbs were further tested in ChIP-seq, CUT&RUN, STORM super-resolution microscopy, immunoblots, and protein binding microarray (PBM) experiments. About 5% of the tested antibodies displayed high-confidence target (i.e., cognate antigen) enrichment across at least one assay and are strong candidates for additional validation. An additional 34% produced ChIP-exo data that were distinct from background and thus warrant further testing. The remaining 61% were not substantially different from background, and likely require consideration of a much broader survey of cell types and/or assay optimizations. We show and discuss the metrics and challenges to antibody validation in chromatin-based assays.
Collapse
Affiliation(s)
- William K M Lai
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.,Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Luca Mariani
- Division of Genetics, Department of Medicine; Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Gerson Rothschild
- Department of Microbiology and Immunology, Vagelos College of Physicians and Surgeons, Columbia University, New York, New York 10032, USA
| | - Edwin R Smith
- Simpson Querrey Institute for Epigenetics and the Department of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, Illinois 60611, USA
| | | | - Thomas R Blanda
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Prashant K Kuntala
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Kylie Bocklund
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Joshua Mairose
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Sarah N Dweikat
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Katelyn Mistretta
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Matthew J Rossi
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Daniela James
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - James T Anderson
- Division of Genetics, Department of Medicine; Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Sabrina K Phanor
- Division of Genetics, Department of Medicine; Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Wanwei Zhang
- Department of Microbiology and Immunology, Vagelos College of Physicians and Surgeons, Columbia University, New York, New York 10032, USA
| | - Zibo Zhao
- Simpson Querrey Institute for Epigenetics and the Department of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, Illinois 60611, USA
| | - Avani P Shah
- Simpson Querrey Institute for Epigenetics and the Department of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, Illinois 60611, USA
| | | | | | | | - Ali Shilatifard
- Simpson Querrey Institute for Epigenetics and the Department of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, Illinois 60611, USA
| | - Uttiya Basu
- Department of Microbiology and Immunology, Vagelos College of Physicians and Surgeons, Columbia University, New York, New York 10032, USA
| | - Martha L Bulyk
- Division of Genetics, Department of Medicine; Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA.,Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
| | - B Franklin Pugh
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.,Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
7
|
Biswas A, Narlikar L. Resolving diverse protein-DNA footprints from exonuclease-based ChIP experiments. Bioinformatics 2021; 37:i367-i375. [PMID: 34252930 PMCID: PMC8275329 DOI: 10.1093/bioinformatics/btab274] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
MOTIVATION High-throughput chromatin immunoprecipitation (ChIP) sequencing-based assays capture genomic regions associated with the profiled transcription factor (TF). ChIP-exo is a modified protocol, which uses lambda exonuclease to digest DNA close to the TF-DNA complex, in order to improve on the positional resolution of the TF-DNA contact. Because the digestion occurs in the 5'-3' orientation, the protocol produces directional footprints close to the complex, on both sides of the double stranded DNA. Like all ChIP-based methods, ChIP-exo reports a mixture of different regions associated with the TF: those bound directly to the TF as well as via intermediaries. However, the distribution of footprints are likely to be indicative of the complex forming at the DNA. RESULTS We present ExoDiversity, which uses a model-based framework to learn a joint distribution over footprints and motifs, thus resolving the mixture of ChIP-exo footprints into diverse binding modes. It uses no prior motif or TF information and automatically learns the number of different modes from the data. We show its application on a wide range of TFs and organisms/cell-types. Because its goal is to explain the complete set of reported regions, it is able to identify co-factor TF motifs that appear in a small fraction of the dataset. Further, ExoDiversity discovers small nucleotide variations within and outside canonical motifs, which co-occur with variations in footprints, suggesting that the TF-DNA structural configuration at those regions is likely to be different. Finally, we show that detected modes have specific DNA shape features and conservation signals, giving insights into the structure and function of the putative TF-DNA complexes. AVAILABILITY AND IMPLEMENTATION The code for ExoDiversity is available on https://github.com/NarlikarLab/exoDIVERSITY. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Anushua Biswas
- Department of Chemical Engineering, CSIR-National Chemical Laboratory, Pune 411008, India.,Academy of Scientific and Innovative Research, Ghaziabad 201002, India
| | - Leelavati Narlikar
- Department of Chemical Engineering, CSIR-National Chemical Laboratory, Pune 411008, India.,Academy of Scientific and Innovative Research, Ghaziabad 201002, India
| |
Collapse
|
8
|
Puig RR, Boddie P, Khan A, Castro-Mondragon JA, Mathelier A. UniBind: maps of high-confidence direct TF-DNA interactions across nine species. BMC Genomics 2021; 22:482. [PMID: 34174819 PMCID: PMC8236138 DOI: 10.1186/s12864-021-07760-6] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 05/27/2021] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Transcription factors (TFs) bind specifically to TF binding sites (TFBSs) at cis-regulatory regions to control transcription. It is critical to locate these TF-DNA interactions to understand transcriptional regulation. Efforts to predict bona fide TFBSs benefit from the availability of experimental data mapping DNA binding regions of TFs (chromatin immunoprecipitation followed by sequencing - ChIP-seq). RESULTS In this study, we processed ~ 10,000 public ChIP-seq datasets from nine species to provide high-quality TFBS predictions. After quality control, it culminated with the prediction of ~ 56 million TFBSs with experimental and computational support for direct TF-DNA interactions for 644 TFs in > 1000 cell lines and tissues. These TFBSs were used to predict > 197,000 cis-regulatory modules representing clusters of binding events in the corresponding genomes. The high-quality of the TFBSs was reinforced by their evolutionary conservation, enrichment at active cis-regulatory regions, and capacity to predict combinatorial binding of TFs. Further, we confirmed that the cell type and tissue specificity of enhancer activity was correlated with the number of TFs with binding sites predicted in these regions. All the data is provided to the community through the UniBind database that can be accessed through its web-interface ( https://unibind.uio.no/ ), a dedicated RESTful API, and as genomic tracks. Finally, we provide an enrichment tool, available as a web-service and an R package, for users to find TFs with enriched TFBSs in a set of provided genomic regions. CONCLUSIONS UniBind is the first resource of its kind, providing the largest collection of high-confidence direct TF-DNA interactions in nine species.
Collapse
Affiliation(s)
- Rafael Riudavets Puig
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0349, Oslo, Norway
| | - Paul Boddie
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0349, Oslo, Norway
| | - Aziz Khan
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0349, Oslo, Norway
- Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | | | - Anthony Mathelier
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0349, Oslo, Norway.
- Department of Medical Genetics, Oslo University Hospital, Oslo, 0424, Norway.
| |
Collapse
|
9
|
Rossi MJ, Kuntala PK, Lai WKM, Yamada N, Badjatia N, Mittal C, Kuzu G, Bocklund K, Farrell NP, Blanda TR, Mairose JD, Basting AV, Mistretta KS, Rocco DJ, Perkinson ES, Kellogg GD, Mahony S, Pugh BF. A high-resolution protein architecture of the budding yeast genome. Nature 2021; 592:309-314. [PMID: 33692541 PMCID: PMC8035251 DOI: 10.1038/s41586-021-03314-8] [Citation(s) in RCA: 129] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2020] [Accepted: 01/29/2021] [Indexed: 01/31/2023]
Abstract
The genome-wide architecture of chromatin-associated proteins that maintains chromosome integrity and gene regulation is not well defined. Here we use chromatin immunoprecipitation, exonuclease digestion and DNA sequencing (ChIP-exo/seq)1,2 to define this architecture in Saccharomyces cerevisiae. We identify 21 meta-assemblages consisting of roughly 400 different proteins that are related to DNA replication, centromeres, subtelomeres, transposons and transcription by RNA polymerase (Pol) I, II and III. Replication proteins engulf a nucleosome, centromeres lack a nucleosome, and repressive proteins encompass three nucleosomes at subtelomeric X-elements. We find that most promoters associated with Pol II evolved to lack a regulatory region, having only a core promoter. These constitutive promoters comprise a short nucleosome-free region (NFR) adjacent to a +1 nucleosome, which together bind the transcription-initiation factor TFIID to form a preinitiation complex. Positioned insulators protect core promoters from upstream events. A small fraction of promoters evolved an architecture for inducibility, whereby sequence-specific transcription factors (ssTFs) create a nucleosome-depleted region (NDR) that is distinct from an NFR. We describe structural interactions among ssTFs, their cognate cofactors and the genome. These interactions include the nucleosomal and transcriptional regulators RPD3-L, SAGA, NuA4, Tup1, Mediator and SWI-SNF. Surprisingly, we do not detect interactions between ssTFs and TFIID, suggesting that such interactions do not stably occur. Our model for gene induction involves ssTFs, cofactors and general factors such as TBP and TFIIB, but not TFIID. By contrast, constitutive transcription involves TFIID but not ssTFs engaged with their cofactors. From this, we define a highly integrated network of gene regulation by ssTFs.
Collapse
Affiliation(s)
- Matthew J Rossi
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Prashant K Kuntala
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - William K M Lai
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Naomi Yamada
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Nitika Badjatia
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Chitvan Mittal
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Guray Kuzu
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Kylie Bocklund
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Nina P Farrell
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Thomas R Blanda
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Joshua D Mairose
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Ann V Basting
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Katelyn S Mistretta
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - David J Rocco
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Emily S Perkinson
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Gretta D Kellogg
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Shaun Mahony
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - B Franklin Pugh
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA.
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA.
| |
Collapse
|
10
|
Lee BH, Rhie SK. Molecular and computational approaches to map regulatory elements in 3D chromatin structure. Epigenetics Chromatin 2021; 14:14. [PMID: 33741028 PMCID: PMC7980343 DOI: 10.1186/s13072-021-00390-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Accepted: 03/08/2021] [Indexed: 12/19/2022] Open
Abstract
Epigenetic marks do not change the sequence of DNA but affect gene expression in a cell-type specific manner by altering the activities of regulatory elements. Development of new molecular biology assays, sequencing technologies, and computational approaches enables us to profile the human epigenome in three-dimensional structure genome-wide. Here we describe various molecular biology techniques and bioinformatic tools that have been developed to measure the activities of regulatory elements and their chromatin interactions. Moreover, we list currently available three-dimensional epigenomic data sets that are generated in various human cell types and tissues to assist in the design and analysis of research projects.
Collapse
Affiliation(s)
- Beoung Hun Lee
- Department of Biochemistry and Molecular Medicine and the Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, CA, 90089, USA
| | - Suhn K Rhie
- Department of Biochemistry and Molecular Medicine and the Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, CA, 90089, USA.
| |
Collapse
|
11
|
Avsec Ž, Weilert M, Shrikumar A, Krueger S, Alexandari A, Dalal K, Fropf R, McAnany C, Gagneur J, Kundaje A, Zeitlinger J. Base-resolution models of transcription-factor binding reveal soft motif syntax. Nat Genet 2021; 53:354-366. [PMID: 33603233 PMCID: PMC8812996 DOI: 10.1038/s41588-021-00782-6] [Citation(s) in RCA: 317] [Impact Index Per Article: 79.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Accepted: 01/07/2021] [Indexed: 01/30/2023]
Abstract
The arrangement (syntax) of transcription factor (TF) binding motifs is an important part of the cis-regulatory code, yet remains elusive. We introduce a deep learning model, BPNet, that uses DNA sequence to predict base-resolution chromatin immunoprecipitation (ChIP)-nexus binding profiles of pluripotency TFs. We develop interpretation tools to learn predictive motif representations and identify soft syntax rules for cooperative TF binding interactions. Strikingly, Nanog preferentially binds with helical periodicity, and TFs often cooperate in a directional manner, which we validate using clustered regularly interspaced short palindromic repeat (CRISPR)-induced point mutations. Our model represents a powerful general approach to uncover the motifs and syntax of cis-regulatory sequences in genomics data.
Collapse
Affiliation(s)
- Žiga Avsec
- Department of Informatics, Technical University of Munich, Garching, Germany,Graduate School of Quantitative Biosciences (QBM), Ludwig-Maximilians-Universität München, Munich, Germany,Currently at DeepMind, London, UK
| | - Melanie Weilert
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Avanti Shrikumar
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Sabrina Krueger
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Amr Alexandari
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Khyati Dalal
- Stowers Institute for Medical Research, Kansas City, MO, USA,The University of Kansas Medical Center, Kansas City, KS, USA
| | - Robin Fropf
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Charles McAnany
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Julien Gagneur
- Department of Informatics, Technical University of Munich, Garching, Germany
| | - Anshul Kundaje
- Department of Computer Science, Stanford University, Stanford, CA, USA,Department of Genetics, Stanford University, Stanford, CA, USA,correspondence: ,
| | - Julia Zeitlinger
- Stowers Institute for Medical Research, Kansas City, MO, USA,The University of Kansas Medical Center, Kansas City, KS, USA,correspondence: ,
| |
Collapse
|
12
|
Srivastava D, Aydin B, Mazzoni EO, Mahony S. An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding. Genome Biol 2021; 22:20. [PMID: 33413545 PMCID: PMC7788824 DOI: 10.1186/s13059-020-02218-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 12/03/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Transcription factor (TF) binding specificity is determined via a complex interplay between the transcription factor's DNA binding preference and cell type-specific chromatin environments. The chromatin features that correlate with transcription factor binding in a given cell type have been well characterized. For instance, the binding sites for a majority of transcription factors display concurrent chromatin accessibility. However, concurrent chromatin features reflect the binding activities of the transcription factor itself and thus provide limited insight into how genome-wide TF-DNA binding patterns became established in the first place. To understand the determinants of transcription factor binding specificity, we therefore need to examine how newly activated transcription factors interact with sequence and preexisting chromatin landscapes. RESULTS Here, we investigate the sequence and preexisting chromatin predictors of TF-DNA binding by examining the genome-wide occupancy of transcription factors that have been induced in well-characterized chromatin environments. We develop Bichrom, a bimodal neural network that jointly models sequence and preexisting chromatin data to interpret the genome-wide binding patterns of induced transcription factors. We find that the preexisting chromatin landscape is a differential global predictor of TF-DNA binding; incorporating preexisting chromatin features improves our ability to explain the binding specificity of some transcription factors substantially, but not others. Furthermore, by analyzing site-level predictors, we show that transcription factor binding in previously inaccessible chromatin tends to correspond to the presence of more favorable cognate DNA sequences. CONCLUSIONS Bichrom thus provides a framework for modeling, interpreting, and visualizing the joint sequence and chromatin landscapes that determine TF-DNA binding dynamics.
Collapse
Affiliation(s)
- Divyanshi Srivastava
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, Pennsylvania State University, University Park, PA, USA
| | - Begüm Aydin
- Department of Biology, New York University, New York, NY, USA
| | | | - Shaun Mahony
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, Pennsylvania State University, University Park, PA, USA.
| |
Collapse
|
13
|
Yamada N, Rossi MJ, Farrell N, Pugh BF, Mahony S. Alignment and quantification of ChIP-exo crosslinking patterns reveal the spatial organization of protein-DNA complexes. Nucleic Acids Res 2020; 48:11215-11226. [PMID: 32747934 PMCID: PMC7672471 DOI: 10.1093/nar/gkaa618] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Revised: 06/25/2020] [Accepted: 07/13/2020] [Indexed: 12/12/2022] Open
Abstract
The ChIP-exo assay precisely delineates protein-DNA crosslinking patterns by combining chromatin immunoprecipitation with 5' to 3' exonuclease digestion. Within a regulatory complex, the physical distance of a regulatory protein to DNA affects crosslinking efficiencies. Therefore, the spatial organization of a protein-DNA complex could potentially be inferred by analyzing how crosslinking signatures vary between its subunits. Here, we present a computational framework that aligns ChIP-exo crosslinking patterns from multiple proteins across a set of coordinately bound regulatory regions, and which detects and quantifies protein-DNA crosslinking events within the aligned profiles. By producing consistent measurements of protein-DNA crosslinking strengths across multiple proteins, our approach enables characterization of relative spatial organization within a regulatory complex. Applying our approach to collections of ChIP-exo data, we demonstrate that it can recover aspects of regulatory complex spatial organization at yeast ribosomal protein genes and yeast tRNA genes. We also demonstrate the ability to quantify changes in protein-DNA complex organization across conditions by applying our approach to analyze Drosophila Pol II transcriptional components. Our results suggest that principled analyses of ChIP-exo crosslinking patterns enable inference of spatial organization within protein-DNA complexes.
Collapse
Affiliation(s)
- Naomi Yamada
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Matthew J Rossi
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Nina Farrell
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - B Franklin Pugh
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Shaun Mahony
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
14
|
Srivastava D, Mahony S. Sequence and chromatin determinants of transcription factor binding and the establishment of cell type-specific binding patterns. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2020; 1863:194443. [PMID: 31639474 PMCID: PMC7166147 DOI: 10.1016/j.bbagrm.2019.194443] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Revised: 09/21/2019] [Accepted: 10/06/2019] [Indexed: 12/14/2022]
Abstract
Transcription factors (TFs) selectively bind distinct sets of sites in different cell types. Such cell type-specific binding specificity is expected to result from interplay between the TF's intrinsic sequence preferences, cooperative interactions with other regulatory proteins, and cell type-specific chromatin landscapes. Cell type-specific TF binding events are highly correlated with patterns of chromatin accessibility and active histone modifications in the same cell type. However, since concurrent chromatin may itself be a consequence of TF binding, chromatin landscapes measured prior to TF activation provide more useful insights into how cell type-specific TF binding events became established in the first place. Here, we review the various sequence and chromatin determinants of cell type-specific TF binding specificity. We identify the current challenges and opportunities associated with computational approaches to characterizing, imputing, and predicting cell type-specific TF binding patterns. We further focus on studies that characterize TF binding in dynamic regulatory settings, and we discuss how these studies are leading to a more complex and nuanced understanding of dynamic protein-DNA binding activities. We propose that TF binding activities at individual sites can be viewed along a two-dimensional continuum of local sequence and chromatin context. Under this view, cell type-specific TF binding activities may result from either strongly favorable sequence features or strongly favorable chromatin context.
Collapse
Affiliation(s)
- Divyanshi Srivastava
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, PA, United States of America
| | - Shaun Mahony
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, PA, United States of America.
| |
Collapse
|
15
|
Zhang Y, Chan HL, Garcia-Martinez L, Karl DL, Weich N, Slingerland JM, Verdun RE, Morey L. Estrogen induces dynamic ERα and RING1B recruitment to control gene and enhancer activities in luminal breast cancer. SCIENCE ADVANCES 2020; 6:eaaz7249. [PMID: 32548262 PMCID: PMC7274770 DOI: 10.1126/sciadv.aaz7249] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 04/02/2020] [Indexed: 05/04/2023]
Abstract
RING1B, a core Polycomb repressive complex 1 subunit, is a histone H2A ubiquitin ligase essential for development. RING1B is overexpressed in patients with luminal breast cancer (BC) and recruited to actively transcribed genes and enhancers co-occupied by the estrogen receptor α (ERα). Whether ERα-induced transcriptional programs are mediated by RING1B is not understood. We show that prolonged estrogen administration induces transcriptional output and chromatin landscape fluctuations. RING1B loss impairs full estrogen-mediated gene expression and chromatin accessibility for key BC transcription factors. These effects were mediated, in part, by RING1B enzymatic activity and nucleosome binding functions. RING1B is recruited in a cyclic manner to ERα, FOXA1, and GRHL2 cobound sites and regulates estrogen-induced enhancers and ERα recruitment. Last, ChIP exo revealed multiple binding events of these factors at single-nucleotide resolution, including RING1B occupancy approximately 10 base pairs around ERα bound sites. We propose RING1B as a key regulator of the dynamic, liganded-ERα transcriptional regulatory circuit in luminal BC.
Collapse
Affiliation(s)
- Yusheng Zhang
- Sylvester Comprehensive Cancer Center, Miami, FL 33136, USA
- Department of Human Genetics, University of Miami Miller School of Medicine, Biomedical Research Building, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Ho Lam Chan
- Sylvester Comprehensive Cancer Center, Miami, FL 33136, USA
- Department of Human Genetics, University of Miami Miller School of Medicine, Biomedical Research Building, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Liliana Garcia-Martinez
- Sylvester Comprehensive Cancer Center, Miami, FL 33136, USA
- Department of Human Genetics, University of Miami Miller School of Medicine, Biomedical Research Building, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Daniel L. Karl
- Sylvester Comprehensive Cancer Center, Miami, FL 33136, USA
| | - Natalia Weich
- Sylvester Comprehensive Cancer Center, Miami, FL 33136, USA
- Division of Hematology, Department of Medicine, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Joyce M. Slingerland
- Sylvester Comprehensive Cancer Center, Miami, FL 33136, USA
- Division of Hematology, Department of Medicine, University of Miami Miller School of Medicine, Miami, FL 33136, USA
- Department of Biochemistry and Molecular Biology, University of Miami Miller School of Medicine, Miami, FL 33136, USA
- Braman Family Breast Cancer Institute, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Ramiro E. Verdun
- Sylvester Comprehensive Cancer Center, Miami, FL 33136, USA
- Division of Hematology, Department of Medicine, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Lluis Morey
- Sylvester Comprehensive Cancer Center, Miami, FL 33136, USA
- Department of Human Genetics, University of Miami Miller School of Medicine, Biomedical Research Building, 1501 NW 10th Avenue, Miami, FL 33136, USA
- Corresponding author.
| |
Collapse
|
16
|
Elsarraj HS, Hong Y, Limback D, Zhao R, Berger J, Bishop SC, Sabbagh A, Oppenheimer L, Harper HE, Tsimelzon A, Huang S, Hilsenbeck SG, Edwards DP, Fontes J, Fan F, Madan R, Fangman B, Ellis A, Tawfik O, Persons DL, Fields T, Godwin AK, Hagan CR, Swenson-Fields K, Coarfa C, Thompson J, Behbod F. BCL9/STAT3 regulation of transcriptional enhancer networks promote DCIS progression. NPJ Breast Cancer 2020; 6:12. [PMID: 32352029 PMCID: PMC7181646 DOI: 10.1038/s41523-020-0157-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Accepted: 03/04/2020] [Indexed: 12/21/2022] Open
Abstract
The molecular processes by which some human ductal carcinoma in situ (DCIS) lesions advance to the more aggressive form, while others remain indolent, are largely unknown. Experiments utilizing a patient-derived (PDX) DCIS Mouse INtraDuctal (MIND) animal model combined with ChIP-exo and RNA sequencing revealed that the formation of protein complexes between B Cell Lymphoma-9 (BCL9), phosphoserine 727 STAT3 (PS-727-STAT3) and non-STAT3 transcription factors on chromatin enhancers lead to subsequent transcription of key drivers of DCIS malignancy. Downregulation of two such targets, integrin β3 and its associated metalloproteinase, MMP16, resulted in a significant inhibition of DCIS invasive progression. Finally, in vivo targeting of BCL9, using rosemary extract, resulted in significant inhibition of DCIS malignancy in both cell line and PDX DCIS MIND animal models. As such, our studies provide compelling evidence for future testing of rosemary extract as a chemopreventive agent in breast cancer.
Collapse
Affiliation(s)
- Hanan S. Elsarraj
- Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Yan Hong
- Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Darlene Limback
- Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Ruonan Zhao
- Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Jenna Berger
- Warren Alpert Medical School of Brown University, Providence, RI 02912 USA
| | - Stephanie C. Bishop
- Department of Pharmaceutical Sciences, South University, 709 Mall Blvd, Savannah, GA 31406 USA
| | - Aria Sabbagh
- McGovern Medical School, The University of Texas Health Science Center, Houston, TX 77030 USA
| | - Linzi Oppenheimer
- Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Haleigh E. Harper
- University of Kansas School of Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Anna Tsimelzon
- Department of Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Shixia Huang
- Dan L. Duncan Cancer Center and Department of Molecular & Cellular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Susan G. Hilsenbeck
- Lester and Sue Smith Breast Center, Baylor College of Medicine, One Baylor Plaza, Houston, TX C30 USA
| | - Dean P. Edwards
- Dan L. Duncan Cancer Center and Department of Molecular & Cellular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Joseph Fontes
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Fang Fan
- Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Rashna Madan
- Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Ben Fangman
- University of Kansas School of Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Ashley Ellis
- University of Kansas School of Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Ossama Tawfik
- MAWD Pathology Group, St Luke’s Health System of Kansas City, 2750 Clay Edwards Dr, Kansas City, MO 64116 USA
| | - Diane L. Persons
- Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Timothy Fields
- Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Andrew K. Godwin
- Department of Pathology and Laboratory Medicine, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Christy R. Hagan
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Katherine Swenson-Fields
- Department of Anatomy and Cell Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Cristian Coarfa
- Dan L. Duncan Cancer Center and Department of Molecular & Cellular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Jeffrey Thompson
- Department of Biostatistics, The University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA
| | - Fariba Behbod
- Department of Pathology and Laboratory Medicine, MS 3045, The University of Kansas Medical Center, Kansas City, KS 66160 USA
| |
Collapse
|
17
|
Sharma V, Majumdar S. Comparative analysis of ChIP-exo peak-callers: impact of data quality, read duplication and binding subtypes. BMC Bioinformatics 2020; 21:65. [PMID: 32085702 PMCID: PMC7035708 DOI: 10.1186/s12859-020-3403-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2019] [Accepted: 02/10/2020] [Indexed: 01/26/2023] Open
Abstract
Background ChIP (Chromatin immunoprecipitation)-exo has emerged as an important and versatile improvement over conventional ChIP-seq as it reduces the level of noise, maps the transcription factor (TF) binding location in a very precise manner, upto single base-pair resolution, and enables binding mode prediction. Availability of numerous peak-callers for analyzing ChIP-exo reads has motivated the need to assess their performance and report which tool executes reasonably well for the task. Results This study has focussed on comparing peak-callers that report direct binding events with those that report indirect binding events. The effect of strandedness of reads and duplication of data on the performance of peak-callers has been investigated. The number of peaks reported by each peak-caller is compared followed by a comparison of the annotated motifs present in the reported peaks. The significance of peaks is assessed based on the presence of a motif in top peaks. Indirect binding tools have been compared on the basis of their ability to identify annotated motifs and predict mode of protein-DNA interaction. Conclusion By studying the output of the peak-callers investigated in this study, it is concluded that the tools that use self-learning algorithms, i.e. the tools that estimate all the essential parameters from the aligned reads, perform better than the algorithms which require formation of peak-pairs. The latest tools that account for indirect binding of TFs appear to be an upgrade over the available tools, as they are able to reveal valuable information about the mode of binding in addition to direct binding. Furthermore, the quality of ChIP-exo reads have important consequences on the output of data analysis.
Collapse
Affiliation(s)
- Vasudha Sharma
- Discipline of Biological Engineering, Indian Institute of Technology Gandhinagar, Palaj, Gujarat, 382355, India
| | - Sharmistha Majumdar
- Discipline of Biological Engineering, Indian Institute of Technology Gandhinagar, Palaj, Gujarat, 382355, India.
| |
Collapse
|
18
|
Yamada N, Kuntala PK, Pugh BF, Mahony S. ChExMix: A Method for Identifying and Classifying Protein-DNA Interaction Subtypes. J Comput Biol 2020; 27:429-435. [PMID: 32023130 DOI: 10.1089/cmb.2019.0466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Regulatory proteins can employ multiple direct and indirect modes of interaction with the genome. The ChIP-exo mixture model (ChExMix) provides a principled approach to detecting multiple protein-DNA interaction modes in a single ChIP-exo experiment. ChExMix discovers and characterizes binding event subtypes in ChIP-exo data by leveraging both protein-DNA cross-linking signatures and DNA motifs. In this study, we present a summary of the major features and applications of ChExMix. We demonstrate that ChExMix does not require high-resolution protein-DNA binding assay data to detect binding event subtypes. Specifically, we apply ChExMix to analyze 393 ChIP-seq data profiles in K562 cells. Similar binding event subtypes are discovered across multiple proteins, suggesting the existence of colocalized regulatory protein modules that are recruited to DNA through a particular sequence-specific transcription factor. Our results thus suggest that ChExMix can characterize protein-DNA binding interaction modes using data from multiple types of protein-DNA interaction assays.
Collapse
Affiliation(s)
- Naomi Yamada
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania
| | - Prashant Kumar Kuntala
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania
| | - B Franklin Pugh
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania
| | - Shaun Mahony
- Center for Eukaryotic Gene Regulation, Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania
| |
Collapse
|
19
|
Gheorghe M, Sandve GK, Khan A, Chèneby J, Ballester B, Mathelier A. A map of direct TF-DNA interactions in the human genome. Nucleic Acids Res 2019; 47:e21. [PMID: 30517703 PMCID: PMC6393237 DOI: 10.1093/nar/gky1210] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2018] [Revised: 10/31/2018] [Accepted: 11/20/2018] [Indexed: 12/11/2022] Open
Abstract
Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most popular assay to identify genomic regions, called ChIP-seq peaks, that are bound in vivo by transcription factors (TFs). These regions are derived from direct TF-DNA interactions, indirect binding of the TF to the DNA (through a co-binding partner), nonspecific binding to the DNA, and noise/bias/artifacts. Delineating the bona fide direct TF-DNA interactions within the ChIP-seq peaks remains challenging. We developed a dedicated software, ChIP-eat, that combines computational TF binding models and ChIP-seq peaks to automatically predict direct TF-DNA interactions. Our work culminated with predicted interactions covering >4% of the human genome, obtained by uniformly processing 1983 ChIP-seq peak data sets from the ReMap database for 232 unique TFs. The predictions were a posteriori assessed using protein binding microarray and ChIP-exo data, and were predominantly found in high quality ChIP-seq peaks. The set of predicted direct TF-DNA interactions suggested that high-occupancy target regions are likely not derived from direct binding of the TFs to the DNA. Our predictions derived co-binding TFs supported by protein-protein interaction data and defined cis-regulatory modules enriched for disease- and trait-associated SNPs. We provide this collection of direct TF-DNA interactions and cis-regulatory modules through the UniBind web-interface (http://unibind.uio.no).
Collapse
Affiliation(s)
- Marius Gheorghe
- Centre for Molecular Medicine Norway (NCMM), University of Oslo, Oslo, Norway
| | | | - Aziz Khan
- Centre for Molecular Medicine Norway (NCMM), University of Oslo, Oslo, Norway
| | - Jeanne Chèneby
- Aix Marseille Université, INSERM, TAGC, Marseille, France
| | | | - Anthony Mathelier
- Centre for Molecular Medicine Norway (NCMM), University of Oslo, Oslo, Norway.,Department of Cancer Genetics, Institute for Cancer Research, Radiumhospitalet, Oslo, Norway
| |
Collapse
|
20
|
Datta V, Hannenhalli S, Siddharthan R. ChIPulate: A comprehensive ChIP-seq simulation pipeline. PLoS Comput Biol 2019; 15:e1006921. [PMID: 30897079 PMCID: PMC6445533 DOI: 10.1371/journal.pcbi.1006921] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 04/02/2019] [Accepted: 03/04/2019] [Indexed: 12/17/2022] Open
Abstract
ChIP-seq (Chromatin Immunoprecipitation followed by sequencing) is a high-throughput technique to identify genomic regions that are bound in vivo by a particular protein, e.g., a transcription factor (TF). Biological factors, such as chromatin state, indirect and cooperative binding, as well as experimental factors, such as antibody quality, cross-linking, and PCR biases, are known to affect the outcome of ChIP-seq experiments. However, the relative impact of these factors on inferences made from ChIP-seq data is not entirely clear. Here, via a detailed ChIP-seq simulation pipeline, ChIPulate, we assess the impact of various biological and experimental sources of variation on several outcomes of a ChIP-seq experiment, viz., the recoverability of the TF binding motif, accuracy of TF-DNA binding detection, the sensitivity of inferred TF-DNA binding strength, and number of replicates needed to confidently infer binding strength. We find that the TF motif can be recovered despite poor and non-uniform extraction and PCR amplification efficiencies. The recovery of the motif is, however, affected to a larger extent by the fraction of sites that are either cooperatively or indirectly bound. Importantly, our simulations reveal that the number of ChIP-seq replicates needed to accurately measure in vivo occupancy at high-affinity sites is larger than the recommended community standards. Our results establish statistical limits on the accuracy of inferences of protein-DNA binding from ChIP-seq and suggest that increasing the mean extraction efficiency, rather than amplification efficiency, would better improve sensitivity. The source code and instructions for running ChIPulate can be found at https://github.com/vishakad/chipulate. DNA-binding proteins perform many key roles in biology, such as transcriptional regulation of gene expression and chromatin modification. ChIP-seq (Chromatin immunoprecipitation followed by high-throughput sequencing) is a widely used experimental technique to identify DNA-binding sites of specific proteins of interest, within cells, genome-wide. DNA fragments from genomic regions that are bound by a protein of interest, often a transcription factor (TF), are selectively extracted using specific antibodies, amplified using PCR, and sequenced. The sequences are mapped to the reference genome. Regions where many sequences map, called “peaks”, are used to infer the location of TF-bound loci (peaks), in vivo occupancy at those loci, and the sequence pattern (motif) to which the TF shows a binding affinity. But measurements of TF occupancy and motif inference are vulnerable to several biological and experimental sources of variation that are poorly understood and difficult to assess directly. Here, we simulate key steps of the ChIP-seq protocol with the aim of estimating the relative effects of various sources of variations on motif inference and binding affinity estimations. Besides providing specific insights and recommendations, we provide a general framework to simulate sequence reads in a ChIP-seq experiment, which should considerably aid in the development of software aimed at analyzing ChIP-seq data.
Collapse
Affiliation(s)
- Vishaka Datta
- Simons Centre for the Study of Living Machines, National Centre for Biological Sciences, TIFR, Bengaluru, Karnataka, India
- * E-mail:
| | - Sridhar Hannenhalli
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, United States of America
| | - Rahul Siddharthan
- The Institute of Mathematical Sciences/HBNI, Taramani, Chennai, India
| |
Collapse
|