1
|
Chromatin accessibility variation provides insights into missing regulation underlying immune-mediated diseases. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.12.589213. [PMID: 38659802 PMCID: PMC11042205 DOI: 10.1101/2024.04.12.589213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Most genetic loci associated with complex traits and diseases through genome-wide association studies (GWAS) are noncoding, suggesting that the causal variants likely have gene regulatory effects. However, only a small number of loci have been linked to expression quantitative trait loci (eQTLs) detected currently. To better understand the potential reasons for many trait-associated loci lacking eQTL colocalization, we investigated whether chromatin accessibility QTLs (caQTLs) in lymphoblastoid cell lines (LCLs) explain immune-mediated disease associations that eQTLs in LCLs did not. The power to detect caQTLs was greater than that of eQTLs and was less affected by the distance from the transcription start site of the associated gene. Meta-analyzing LCL eQTL data to increase the sample size to over a thousand led to additional loci with eQTL colocalization, demonstrating that insufficient statistical power is still likely to be a factor. Moreover, further eQTL colocalization loci were uncovered by surveying eQTLs of other immune cell types. Altogether, insufficient power and context-specificity of eQTLs both contribute to the 'missing regulation.'
Collapse
|
2
|
Widespread variation in molecular interactions and regulatory properties among transcription factor isoforms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.12.584681. [PMID: 38617209 PMCID: PMC11014633 DOI: 10.1101/2024.03.12.584681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Most human Transcription factors (TFs) genes encode multiple protein isoforms differing in DNA binding domains, effector domains, or other protein regions. The global extent to which this results in functional differences between isoforms remains unknown. Here, we systematically compared 693 isoforms of 246 TF genes, assessing DNA binding, protein binding, transcriptional activation, subcellular localization, and condensate formation. Relative to reference isoforms, two-thirds of alternative TF isoforms exhibit differences in one or more molecular activities, which often could not be predicted from sequence. We observed two primary categories of alternative TF isoforms: "rewirers" and "negative regulators", both of which were associated with differentiation and cancer. Our results support a model wherein the relative expression levels of, and interactions involving, TF isoforms add an understudied layer of complexity to gene regulatory networks, demonstrating the importance of isoform-aware characterization of TF functions and providing a rich resource for further studies.
Collapse
|
3
|
DNA binding analysis of rare variants in homeodomains reveals homeodomain specificity-determining residues. Nat Commun 2024; 15:3110. [PMID: 38600112 PMCID: PMC11006913 DOI: 10.1038/s41467-024-47396-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 03/29/2024] [Indexed: 04/12/2024] Open
Abstract
Homeodomains (HDs) are the second largest class of DNA binding domains (DBDs) among eukaryotic sequence-specific transcription factors (TFs) and are the TF structural class with the largest number of disease-associated mutations in the Human Gene Mutation Database (HGMD). Despite numerous structural studies and large-scale analyses of HD DNA binding specificity, HD-DNA recognition is still not fully understood. Here, we analyze 92 human HD mutants, including disease-associated variants and variants of uncertain significance (VUS), for their effects on DNA binding activity. Many of the variants alter DNA binding affinity and/or specificity. Detailed biochemical analysis and structural modeling identifies 14 previously unknown specificity-determining positions, 5 of which do not contact DNA. The same missense substitution at analogous positions within different HDs often exhibits different effects on DNA binding activity. Variant effect prediction tools perform moderately well in distinguishing variants with altered DNA binding affinity, but poorly in identifying those with altered binding specificity. Our results highlight the need for biochemical assays of TF coding variants and prioritize dozens of variants for further investigations into their pathogenicity and the development of clinical diagnostics and precision therapies.
Collapse
|
4
|
Overlapping binding sites underlie TF genomic occupancy. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.05.583629. [PMID: 38496549 PMCID: PMC10942454 DOI: 10.1101/2024.03.05.583629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Sequence-specific DNA binding by transcription factors (TFs) is a crucial step in gene regulation. However, current high-throughput in vitro approaches cannot reliably detect lower affinity TF-DNA interactions, which play key roles in gene regulation. Here, we developed PADIT-seq ( p rotein a ffinity to D NA by in vitro transcription and RNA seq uencing) to assay TF binding preferences to all 10-bp DNA sequences at far greater sensitivity than prior approaches. The expanded catalogs of low affinity DNA binding sites for the human TFs HOXD13 and EGR1 revealed that nucleotides flanking high affinity DNA binding sites create overlapping lower affinity sites that together modulate TF genomic occupancy in vivo . Formation of such extended recognition sequences stems from an inherent property of TF binding sites to interweave each other and expands the genomic sequence space for identifying noncoding variants that directly alter TF binding. One-Sentence Summary Overlapping DNA binding sites underlie TF genomic occupancy through their inherent propensity to interweave each other.
Collapse
|
5
|
Pioneer factors - key regulators of chromatin and gene expression. Nat Rev Genet 2023; 24:809-815. [PMID: 37740118 DOI: 10.1038/s41576-023-00648-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/02/2023] [Indexed: 09/24/2023]
|
6
|
Altered binding affinity of SIX1-Q177R correlates with enhanced WNT5A and WNT pathway effector expression in Wilms tumor. Dis Model Mech 2023; 16:dmm050208. [PMID: 37815464 PMCID: PMC10668032 DOI: 10.1242/dmm.050208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 09/27/2023] [Indexed: 10/11/2023] Open
Abstract
Wilms tumors present as an amalgam of varying proportions of tissues located within the developing kidney, one being the nephrogenic blastema comprising multipotent nephron progenitor cells (NPCs). The recurring missense mutation Q177R in NPC transcription factors SIX1 and SIX2 is most correlated with tumors of blastemal histology and is significantly associated with relapse. Yet, the transcriptional regulatory consequences of SIX1/2-Q177R that might promote tumor progression and recurrence have not been investigated extensively. Utilizing multiple Wilms tumor transcriptomic datasets, we identified upregulation of the gene encoding non-canonical WNT ligand WNT5A in addition to other WNT pathway effectors in SIX1/2-Q177R mutant tumors. SIX1 ChIP-seq datasets from Wilms tumors revealed shared binding sites for SIX1/SIX1-Q177R within a promoter of WNT5A and at putative distal cis-regulatory elements (CREs). We demonstrate colocalization of SIX1 and WNT5A in Wilms tumor tissue and utilize in vitro assays that support SIX1 and SIX1-Q177R activation of expression from the WNT5A CREs, as well as enhanced binding affinity within the WNT5A promoter that may promote the differential expression of WNT5A and other WNT pathway effectors associated with SIX1-Q177R tumors.
Collapse
|
7
|
A stem cell epigenome is associated with primary nonresponse to CD19 CAR T cells in pediatric acute lymphoblastic leukemia. Blood Adv 2023; 7:4218-4232. [PMID: 36607839 PMCID: PMC10440404 DOI: 10.1182/bloodadvances.2022008977] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 12/19/2022] [Accepted: 12/28/2022] [Indexed: 01/07/2023] Open
Abstract
CD19 chimeric antigen receptor T-cell therapy (CD19-CAR) has changed the treatment landscape and outcomes for patients with pre-B-cell acute lymphoblastic leukemia (B-ALL). Unfortunately, primary nonresponse (PNR), sustained CD19+ disease, and concurrent expansion of CD19-CAR occur in 20% of the patients and is associated with adverse outcomes. Although some failures may be attributable to CD19 loss, mechanisms of CD19-independent, leukemia-intrinsic resistance to CD19-CAR remain poorly understood. We hypothesize that PNR leukemias are distinct compared with primary sensitive (PS) leukemias and that these differences are present before treatment. We used a multiomic approach to investigate this in 14 patients (7 with PNR and 7 with PS) enrolled in the PLAT-02 trial at Seattle Children's Hospital. Long-read PacBio sequencing helped identify 1 PNR in which 47% of CD19 transcripts had exon 2 skipping, but other samples lacked CD19 transcript abnormalities. Epigenetic profiling discovered DNA hypermethylation at genes targeted by polycomb repressive complex 2 (PRC2) in embryonic stem cells. Similarly, assays of transposase-accessible chromatin-sequencing revealed reduced accessibility at these PRC2 target genes, with a gain in accessibility of regions characteristic of hematopoietic stem cells and multilineage progenitors in PNR. Single-cell RNA sequencing and cytometry by time of flight analyses identified leukemic subpopulations expressing multilineage markers and decreased antigen presentation in PNR. We thus describe the association of a stem cell epigenome with primary resistance to CD19-CAR therapy. Future trials incorporating these biomarkers, with the addition of multispecific CAR T cells targeting against leukemic stem cell or myeloid antigens, and/or combined epigenetic therapy to disrupt this distinct stem cell epigenome may improve outcomes of patients with B-ALL.
Collapse
|
8
|
Blood cell traits' GWAS loci colocalization with variation in PU.1 genomic occupancy prioritizes causal noncoding regulatory variants. CELL GENOMICS 2023; 3:100327. [PMID: 37492098 PMCID: PMC10363807 DOI: 10.1016/j.xgen.2023.100327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 02/10/2023] [Accepted: 04/25/2023] [Indexed: 07/27/2023]
Abstract
Genome-wide association studies (GWASs) have uncovered numerous trait-associated loci across the human genome, most of which are located in noncoding regions, making interpretation difficult. Moreover, causal variants are hard to statistically fine-map at many loci because of widespread linkage disequilibrium. To address this challenge, we present a strategy utilizing transcription factor (TF) binding quantitative trait loci (bQTLs) for colocalization analysis to identify trait associations likely mediated by TF occupancy variation and to pinpoint likely causal variants using motif scores. We applied this approach to PU.1 bQTLs in lymphoblastoid cell lines and blood cell trait GWAS data. Colocalization analysis revealed 69 blood cell trait GWAS loci putatively driven by PU.1 occupancy variation. We nominate PU.1 motif-altering variants as the likely shared causal variants at 51 loci. Such integration of TF bQTL data with other GWAS data may reveal transcriptional regulatory mechanisms and causal noncoding variants underlying additional complex traits.
Collapse
|
9
|
Colocalization of blood cell traits GWAS associations and variation in PU.1 genomic occupancy prioritizes causal noncoding regulatory variants. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.29.534582. [PMID: 37034747 PMCID: PMC10081269 DOI: 10.1101/2023.03.29.534582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Genome-wide association studies (GWAS) have uncovered numerous trait-associated loci across the human genome, most of which are located in noncoding regions, making interpretations difficult. Moreover, causal variants are hard to statistically fine-map at many loci because of widespread linkage disequilibrium. To address this challenge, we present a strategy utilizing transcription factor (TF) binding quantitative trait loci (bQTLs) for colocalization analysis to identify trait associations likely mediated by TF occupancy variation and to pinpoint likely causal variants using motif scores. We applied this approach to PU.1 bQTLs in lymphoblastoid cell lines and blood cell traits GWAS data. Colocalization analysis revealed 69 blood cell trait GWAS loci putatively driven by PU.1 occupancy variation. We nominate PU.1 motif-altering variants as the likely shared causal variants at 51 loci. Such integration of TF bQTL data with other GWAS data may reveal transcriptional regulatory mechanisms and causal noncoding variants underlying additional complex traits.
Collapse
|
10
|
MORF and MOZ acetyltransferases target unmethylated CpG islands through the winged helix domain. Nat Commun 2023; 14:697. [PMID: 36754959 PMCID: PMC9908889 DOI: 10.1038/s41467-023-36368-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 01/26/2023] [Indexed: 02/10/2023] Open
Abstract
Human acetyltransferases MOZ and MORF are implicated in chromosomal translocations associated with aggressive leukemias. Oncogenic translocations involve the far amino terminus of MOZ/MORF, the function of which remains unclear. Here, we identified and characterized two structured winged helix (WH) domains, WH1 and WH2, in MORF and MOZ. WHs bind DNA in a cooperative manner, with WH1 specifically recognizing unmethylated CpG sequences. Structural and genomic analyses show that the DNA binding function of WHs targets MORF/MOZ to gene promoters, stimulating transcription and H3K23 acetylation, and WH1 recruits oncogenic fusions to HOXA genes that trigger leukemogenesis. Cryo-EM, NMR, mass spectrometry and mutagenesis studies provide mechanistic insight into the DNA-binding mechanism, which includes the association of WH1 with the CpG-containing linker DNA and binding of WH2 to the dyad of the nucleosome. The discovery of WHs in MORF and MOZ and their DNA binding functions could open an avenue in developing therapeutics to treat diseases associated with aberrant MOZ/MORF acetyltransferase activities.
Collapse
|
11
|
Comparative chromatin accessibility upon BDNF stimulation delineates neuronal regulatory elements. Mol Syst Biol 2022; 18:e10473. [PMID: 35996956 PMCID: PMC9396287 DOI: 10.15252/msb.202110473] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 07/28/2022] [Accepted: 08/01/2022] [Indexed: 12/30/2022] Open
Abstract
Neuronal stimulation induced by the brain-derived neurotrophic factor (BDNF) triggers gene expression, which is crucial for neuronal survival, differentiation, synaptic plasticity, memory formation, and neurocognitive health. However, its role in chromatin regulation is unclear. Here, using temporal profiling of chromatin accessibility and transcription in mouse primary cortical neurons upon either BDNF stimulation or depolarization (KCl), we identify features that define BDNF-specific chromatin-to-gene expression programs. Enhancer activation is an early event in the regulatory control of BDNF-treated neurons, where the bZIP motif-binding Fos protein pioneered chromatin opening and cooperated with co-regulatory transcription factors (Homeobox, EGRs, and CTCF) to induce transcription. Deleting cis-regulatory sequences affect BDNF-mediated Arc expression, a regulator of synaptic plasticity. BDNF-induced accessible regions are linked to preferential exon usage by neurodevelopmental disorder-related genes and the heritability of neuronal complex traits, which were validated in human iPSC-derived neurons. Thus, we provide a comprehensive view of BDNF-mediated genome regulatory features using comparative genomic approaches to dissect mammalian neuronal stimulation.
Collapse
|
12
|
Abstract 3581: Multi-omic analysis identifies mechanisms of resistance to CD19 CAR T-cell therapy in children with acute lymphoblastic leukemia. Cancer Res 2022. [DOI: 10.1158/1538-7445.am2022-3581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Background: Acute lymphoblastic leukemia (ALL) is the most common childhood cancer. Despite the survival rate of 90% for newly diagnosed children with ALL, the outcome for relapsed patients is historically poor with a less than 30% survival. CD19 CAR T-cell therapy (CART19) has shown remarkable response rates, between 80-90% in relapsed/refractory disease. Little is known about antigen-independent factors that predict initial resistance to CART19. We hypothesized that leukemias that are resistant to CART19 are distinct from sensitive leukemias and that these differences can be detected prior to therapy.
Methods: To interrogate differences between resistant and sensitive leukemias, we obtained pre-treatment bone marrow aspirates (BMAs) from patients enrolled in a clinical trial at Seattle Children’s Hospital (PLAT-02). Samples were categorized based on patient response, with non-response defined as not achieving and maintaining minimal residual disease negativity at Day +63. Our study included 7 resistant and 7 sensitive leukemias as controls. We performed whole exome sequencing, bulk RNA-seq, PacBio-seq of the CD19 locus, array-based methylation, ATAC-seq, scRNA-seq, and CyTOF.
Results: We found that non-response to CART19 is independent of leukemic subtype. Despite blasts being CD19+ in all patients by flow cytometry, we identified alternative splicing of CD19 in one non-responder, while the remaining non-responders expressed high levels of wildtype CD19. We discovered a distinctive DNA methylation pattern in the non-responders characterized by hypermethylation of PRC2 targets in embryonic and cancer stem cells (p = 8.15E-25) Furthermore, using gene set enrichment analysis of ATAC-seq data, we found increased accessibility of chromatin at regions associated with stem cell proliferation (NES = 2.31; p < 0.0001) and cell cycling (NES = 2.27; p < 0.0001). We found a greater similarity between accessibility patterns of non-responders to hematopoietic progenitors, including hematopoietic stem cells (p = 0.037) and common myeloid progenitors (p = 0.047). These findings were supported by an increased frequency of cell subpopulations expressing a multi-lineage phenotype (CD19, CD20, CD33, CD34; p = 0.009). Moreover, we find decreased expression of antigen presentation and processing pathways across all leukemic cells relative to responders (p = 0.0001).
Conclusions: This study, one of the most comprehensive multi-omic analyses of samples from patients treated with CAR T-cells, identified resistance mechanisms that can be detected prior to treatment. We report the novel association of a stem cell phenotype, lineage plasticity, and decreased antigen presentation with resistance. These results support further refinement of eligibility for CART19 for children with leukemia and highlights the need for alternative of complimentary approaches for these patients.
Citation Format: Katherine E. Masih, Rebecca Gardner, Hsien-Chao Chou, Abdalla Abdelmaksoud, Young K. Song, Luca Mariani, Vineela Gangalapudi, Berkley E. Gryder, Ashley Wilson, Serifat O. Adebola, Benjamin Z. Stanton, Chaoyu Wang, Xinyu Wen, Gregoire Altan-Bonnet, Michael C. Kelly, Jun S. Wei, Martha L. Bulyk, Michael C. Jensen, Rimas J. Orentas, Javed Khan. Multi-omic analysis identifies mechanisms of resistance to CD19 CAR T-cell therapy in children with acute lymphoblastic leukemia [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2022; 2022 Apr 8-13. Philadelphia (PA): AACR; Cancer Res 2022;82(12_Suppl):Abstract nr 3581.
Collapse
|
13
|
Trans-omics analysis of insulin action reveals a cell growth subnetwork which co-regulates anabolic processes. iScience 2022; 25:104231. [PMID: 35494245 PMCID: PMC9044165 DOI: 10.1016/j.isci.2022.104231] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 03/09/2022] [Accepted: 04/06/2022] [Indexed: 12/16/2022] Open
Abstract
Insulin signaling promotes anabolic metabolism to regulate cell growth through multi-omic interactions. To obtain a comprehensive view of the cellular responses to insulin, we constructed a trans-omic network of insulin action in Drosophila cells that involves the integration of multi-omic data sets. In this network, 14 transcription factors, including Myc, coordinately upregulate the gene expression of anabolic processes such as nucleotide synthesis, transcription, and translation, consistent with decreases in metabolites such as nucleotide triphosphates and proteinogenic amino acids required for transcription and translation. Next, as cell growth is required for cell proliferation and insulin can stimulate proliferation in a context-dependent manner, we integrated the trans-omic network with results from a CRISPR functional screen for cell proliferation. This analysis validates the role of a Myc-mediated subnetwork that coordinates the activation of genes involved in anabolic processes required for cell growth. A trans-omic network of insulin action in Drosophila cells was constructed Insulin co-regulates various anabolic processes in a time-dependent manner The trans-omic network and a CRISPR screen for cell proliferation were integrated A Myc-mediated subnetwork promoting anabolic processes is required for cell growth
Collapse
|
14
|
EP300 Selectively Controls the Enhancer Landscape of MYCN-Amplified Neuroblastoma. Cancer Discov 2022; 12:730-751. [PMID: 34772733 PMCID: PMC8904277 DOI: 10.1158/2159-8290.cd-21-0385] [Citation(s) in RCA: 55] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 08/25/2021] [Accepted: 11/08/2021] [Indexed: 01/09/2023]
Abstract
Gene expression is regulated by promoters and enhancers marked by histone H3 lysine 27 acetylation (H3K27ac), which is established by the paralogous histone acetyltransferases (HAT) EP300 and CBP. These enzymes display overlapping regulatory roles in untransformed cells, but less characterized roles in cancer cells. We demonstrate that the majority of high-risk pediatric neuroblastoma (NB) depends on EP300, whereas CBP has a limited role. EP300 controls enhancer acetylation by interacting with TFAP2β, a transcription factor member of the lineage-defining transcriptional core regulatory circuitry (CRC) in NB. To disrupt EP300, we developed a proteolysis-targeting chimera (PROTAC) compound termed "JQAD1" that selectively targets EP300 for degradation. JQAD1 treatment causes loss of H3K27ac at CRC enhancers and rapid NB apoptosis, with limited toxicity to untransformed cells where CBP may compensate. Furthermore, JQAD1 activity is critically determined by cereblon (CRBN) expression across NB cells. SIGNIFICANCE EP300, but not CBP, controls oncogenic CRC-driven transcription in high-risk NB by binding TFAP2β. We developed JQAD1, a CRBN-dependent PROTAC degrader with preferential activity against EP300 and demonstrated its activity in NB. JQAD1 has limited toxicity to untransformed cells and is effective in vivo in a CRBN-dependent manner. This article is highlighted in the In This Issue feature, p. 587.
Collapse
|
15
|
Precision Medicine: Using Artificial Intelligence to Improve Diagnostics and Healthcare. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2022; 27:223-230. [PMID: 34890151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The continued generation of large amounts of data within healthcare-from imaging to electronic medical health records to genomics and multi-omics -necessitates tools and methods to parse and interpret these data to improve healthcare outcomes. Artificial intelligence, and in particular deep learning, has enabled researchers to gain new insights from large scale and multimodal data. At the 2022 Pacific Symposium on Biocomputing (PSB) session entitled "Precision Medicine: Using Artificial Intelligence to Improve Diagnostics and Healthcare", we showcase the latest research, influenced and inspired by the idea of using technology to build a more fair, tailored, and cost-effective healthcare system after the COVID-19 pandemic.
Collapse
|
16
|
Quantitative-enhancer-FACS-seq (QeFS) reveals epistatic interactions among motifs within transcriptional enhancers in developing Drosophila tissue. Genome Biol 2021; 22:348. [PMID: 34930411 PMCID: PMC8686523 DOI: 10.1186/s13059-021-02574-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 12/10/2021] [Indexed: 11/16/2022] Open
Abstract
Understanding the contributions of transcription factor DNA binding sites to transcriptional enhancers is a significant challenge. We developed Quantitative enhancer-FACS-Seq for highly parallel quantification of enhancer activities from a genomically integrated reporter in Drosophila melanogaster embryos. We investigate the contributions of the DNA binding motifs of four poorly characterized TFs to the activities of twelve embryonic mesodermal enhancers. We measure quantitative changes in enhancer activity and discover a range of epistatic interactions among the motifs, both synergistic and alleviating. We find that understanding the regulatory consequences of TF binding motifs requires that they be investigated in combination across enhancer contexts.
Collapse
|
17
|
Lineage-specific control of convergent differentiation by a Forkhead repressor. Development 2021; 148:272306. [PMID: 34423346 DOI: 10.1242/dev.199493] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 08/17/2021] [Indexed: 12/14/2022]
Abstract
During convergent differentiation, multiple developmental lineages produce a highly similar or identical cell type. However, few molecular players that drive convergent differentiation are known. Here, we show that the C. elegans Forkhead transcription factor UNC-130 is required in only one of three convergent lineages that produce the same glial cell type. UNC-130 acts transiently as a repressor in progenitors and newly-born terminal cells to allow the proper specification of cells related by lineage rather than by cell type or function. Specification defects correlate with UNC-130:DNA binding, and UNC-130 can be functionally replaced by its human homolog, the neural crest lineage determinant FoxD3. We propose that, in contrast to terminal selectors that activate cell type-specific transcriptional programs in terminally differentiating cells, UNC-130 acts early and specifically in one convergent lineage to produce a cell type that also arises from molecularly distinct progenitors in other lineages.
Collapse
|
18
|
A ChIP-exo screen of 887 Protein Capture Reagents Program transcription factor antibodies in human cells. Genome Res 2021; 31:1663-1679. [PMID: 34426512 PMCID: PMC8415381 DOI: 10.1101/gr.275472.121] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 07/07/2021] [Indexed: 12/22/2022]
Abstract
Antibodies offer a powerful means to interrogate specific proteins in a complex milieu. However, antibody availability and reliability can be problematic, whereas epitope tagging can be impractical in many cases. To address these limitations, the Protein Capture Reagents Program (PCRP) generated over a thousand renewable monoclonal antibodies (mAbs) against human presumptive chromatin proteins. However, these reagents have not been widely field-tested. We therefore performed a screen to test their ability to enrich genomic regions via chromatin immunoprecipitation (ChIP) and a variety of orthogonal assays. Eight hundred eighty-seven unique antibodies against 681 unique human transcription factors (TFs) were assayed by ultra-high-resolution ChIP-exo/seq, generating approximately 1200 ChIP-exo data sets, primarily in a single pass in one cell type (K562). Subsets of PCRP mAbs were further tested in ChIP-seq, CUT&RUN, STORM super-resolution microscopy, immunoblots, and protein binding microarray (PBM) experiments. About 5% of the tested antibodies displayed high-confidence target (i.e., cognate antigen) enrichment across at least one assay and are strong candidates for additional validation. An additional 34% produced ChIP-exo data that were distinct from background and thus warrant further testing. The remaining 61% were not substantially different from background, and likely require consideration of a much broader survey of cell types and/or assay optimizations. We show and discuss the metrics and challenges to antibody validation in chromatin-based assays.
Collapse
|
19
|
The SAM domain-containing protein 1 (SAMD1) acts as a repressive chromatin regulator at unmethylated CpG islands. SCIENCE ADVANCES 2021; 7:7/20/eabf2229. [PMID: 33980486 PMCID: PMC8115922 DOI: 10.1126/sciadv.abf2229] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Accepted: 03/25/2021] [Indexed: 05/06/2023]
Abstract
CpG islands (CGIs) are key regulatory DNA elements at most promoters, but how they influence the chromatin status and transcription remains elusive. Here, we identify and characterize SAMD1 (SAM domain-containing protein 1) as an unmethylated CGI-binding protein. SAMD1 has an atypical winged-helix domain that directly recognizes unmethylated CpG-containing DNA via simultaneous interactions with both the major and the minor groove. The SAM domain interacts with L3MBTL3, but it can also homopolymerize into a closed pentameric ring. At a genome-wide level, SAMD1 localizes to H3K4me3-decorated CGIs, where it acts as a repressor. SAMD1 tethers L3MBTL3 to chromatin and interacts with the KDM1A histone demethylase complex to modulate H3K4me2 and H3K4me3 levels at CGIs, thereby providing a mechanism for SAMD1-mediated transcriptional repression. The absence of SAMD1 impairs ES cell differentiation processes, leading to misregulation of key biological pathways. Together, our work establishes SAMD1 as a newly identified chromatin regulator acting at unmethylated CGIs.
Collapse
|
20
|
Zinc Finger Protein SALL4 Functions through an AT-Rich Motif to Regulate Gene Expression. Cell Rep 2021; 34:108574. [PMID: 33406418 PMCID: PMC8197658 DOI: 10.1016/j.celrep.2020.108574] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2017] [Revised: 10/29/2020] [Accepted: 12/08/2020] [Indexed: 11/19/2022] Open
Abstract
The zinc finger transcription factor SALL4 is highly expressed in embryonic stem cells, downregulated in most adult tissues, but reactivated in many aggressive cancers. This unique expression pattern makes SALL4 an attractive therapeutic target. However, whether SALL4 binds DNA directly to regulate gene expression is unclear, and many of its targets in cancer cells remain elusive. Here, through an unbiased screen of protein binding microarray (PBM) and cleavage under targets and release using nuclease (CUT&RUN) experiments, we identify and validate the DNA binding domain of SALL4 and its consensus binding sequence. Combined with RNA sequencing (RNA-seq) analyses after SALL4 knockdown, we discover hundreds of new SALL4 target genes that it directly regulates in aggressive liver cancer cells, including genes encoding a family of histone 3 lysine 9-specific demethylases (KDMs). Taken together, these results elucidate the mechanism of SALL4 DNA binding and reveal pathways and molecules to target in SALL4-dependent tumors. In this paper, Kong et al. elucidate the DNA binding mechanisms of the transcription factor SALL4 and an epigenetic pathway that it regulates. Due to its important role in driving aggressive cancers, better understanding of SALL4 function will lead to strategies to target this protein in cancer.
Collapse
|
21
|
Common variants in signaling transcription-factor-binding sites drive phenotypic variability in red blood cell traits. Nat Genet 2020; 52:1333-1345. [PMID: 33230299 DOI: 10.1038/s41588-020-00738-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Accepted: 10/14/2020] [Indexed: 12/13/2022]
Abstract
Genome-wide association studies identify genomic variants associated with human traits and diseases. Most trait-associated variants are located within cell-type-specific enhancers, but the molecular mechanisms governing phenotypic variation are less well understood. Here, we show that many enhancer variants associated with red blood cell (RBC) traits map to enhancers that are co-bound by lineage-specific master transcription factors (MTFs) and signaling transcription factors (STFs) responsive to extracellular signals. The majority of enhancer variants reside on STF and not MTF motifs, perturbing DNA binding by various STFs (BMP/TGF-β-directed SMADs or WNT-induced TCFs) and affecting target gene expression. Analyses of engineered human blood cells and expression quantitative trait loci verify that disrupted STF binding leads to altered gene expression. Our results propose that the majority of the RBC-trait-associated variants that reside on transcription-factor-binding sequences fall in STF target sequences, suggesting that the phenotypic variation of RBC traits could stem from altered responsiveness to extracellular stimuli.
Collapse
|
22
|
A Comprehensive Drosophila melanogaster Transcription Factor Interactome. Cell Rep 2020; 27:955-970.e7. [PMID: 30995488 PMCID: PMC6485956 DOI: 10.1016/j.celrep.2019.03.071] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 02/04/2019] [Accepted: 03/18/2019] [Indexed: 12/14/2022] Open
Abstract
Combinatorial interactions among transcription factors (TFs) play essential roles in generating gene expression specificity and diversity in metazoans. Using yeast 2-hybrid (Y2H) assays on nearly all sequence-specific Drosophila TFs, we identified 1,983 protein-protein interactions (PPIs), more than doubling the number of currently known PPIs among Drosophila TFs. For quality assessment, we validated a subset of our interactions using MITOMI and bimolecular fluorescence complementation assays. We combined our interactome with prior PPI data to generate an integrated Drosophila TF-TF binary interaction network. Our analysis of ChIP-seq data, integrating PPI and gene expression information, uncovered different modes by which interacting TFs are recruited to DNA. We further demonstrate the utility of our Drosophila interactome in shedding light on human TF-TF interactions. This study reveals how TFs interact to bind regulatory elements in vivo and serves as a resource of Drosophila TF-TF binary PPIs for understanding tissue-specific gene regulation. Combinatorial regulation by transcription factors (TFs) is one mechanism for achieving condition and tissue-specific gene regulation. Shokri et al. mapped TF-TF interactions between most Drosophila TFs, reporting a comprehensive TF-TF network integrated with previously known interactions. They used this network to discern distinct TF-DNA binding modes.
Collapse
|
23
|
MEDEA: analysis of transcription factor binding motifs in accessible chromatin. Genome Res 2020; 30:736-748. [PMID: 32424069 PMCID: PMC7263192 DOI: 10.1101/gr.260877.120] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Accepted: 04/10/2020] [Indexed: 12/15/2022]
Abstract
Deciphering the interplay between chromatin accessibility and transcription factor (TF) binding is fundamental to understanding transcriptional regulation, control of cellular states, and the establishment of new phenotypes. Recent genome-wide chromatin accessibility profiling studies have provided catalogs of putative open regions, where TFs can recognize their motifs and regulate gene expression programs. Here, we present motif enrichment in differential elements of accessibility (MEDEA), a computational tool that analyzes high-throughput chromatin accessibility genomic data to identify cell-type-specific accessible regions and lineage-specific motifs associated with TF binding therein. To benchmark MEDEA, we used a panel of reference cell lines profiled by ENCODE and curated by the ENCODE Project Consortium for the ENCODE-DREAM Challenge. By comparing results with RNA-seq data, ChIP-seq peaks, and DNase-seq footprints, we show that MEDEA improves the detection of motifs associated with known lineage specifiers. We then applied MEDEA to 610 ENCODE DNase-seq data sets, where it revealed significant motifs even when absolute enrichment was low and where it identified novel regulators, such as NRF1 in kidney development. Finally, we show that MEDEA performs well on both bulk and single-cell ATAC-seq data. MEDEA is publicly available as part of our Glossary-GENRE suite for motif enrichment analysis.
Collapse
|
24
|
Context and number of noncanonical repeat variable diresidues impede the design of TALE proteins with improved DNA targeting. Protein Sci 2019; 29:606-616. [PMID: 31833142 DOI: 10.1002/pro.3801] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 11/27/2019] [Accepted: 12/02/2019] [Indexed: 12/18/2022]
Abstract
Transcription activator-like effector (TALE) proteins have been used extensively for targeted binding of fusion proteins to loci of interest in (epi)genome engineering. Such approaches typically utilize four canonical TALE repeat variable diresidue (RVD) types, corresponding to the identities of two key amino acids, to target each nucleotide. Alternate RVDs with improved specificity are desired. Here, we focused on seven noncanonical RVDs that have been suggested to have improved specificity for their target nucleotides. We used custom protein binding microarrays to characterize the DNA-binding activity of 65 TALEs containing these alternate or corresponding canonical RVDs at multiple positions to ~5,000 unique DNA sequences per protein. We found that none of the noncanonical thymine-targeting RVDs displayed stronger preference for thymine than did the canonical RVD. Of the noncanonical RVDs with putatively improved specificity for guanine, only EN and NH showed greater discrimination of guanine over adenine. This improved specificity, however, comes at a cost: more substitutions of a noncanonical RVD for a canonical RVD generally decreased the protein's DNA-binding activity. Our results highlight the need to investigate RVD-nucleotide specificities in multiple protein contexts and suggest that a balance between canonical and noncanonical RVDs is needed to build TALEs with improved specificity.
Collapse
|
25
|
Transcriptional Silencers in Drosophila Serve a Dual Role as Transcriptional Enhancers in Alternate Cellular Contexts. Mol Cell 2019; 77:324-337.e8. [PMID: 31704182 DOI: 10.1016/j.molcel.2019.10.004] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Revised: 08/15/2019] [Accepted: 10/01/2019] [Indexed: 12/26/2022]
Abstract
A major challenge in biology is to understand how complex gene expression patterns are encoded in the genome. While transcriptional enhancers have been studied extensively, few transcriptional silencers have been identified, and they remain poorly understood. Here, we used a novel strategy to screen hundreds of sequences for tissue-specific silencer activity in whole Drosophila embryos. Almost all of the transcriptional silencers that we identified were also active enhancers in other cellular contexts. These elements are bound by more transcription factors than non-silencers. A subset of these silencers forms long-range contacts with promoters. Deletion of a silencer caused derepression of its target gene. Our results challenge the common practice of treating enhancers and silencers as separate classes of regulatory elements and suggest the possibility that thousands or more bifunctional CRMs remain to be discovered in Drosophila and 104-105 in humans.
Collapse
|
26
|
Interspecies analysis of MYC targets identifies tRNA synthetases as mediators of growth and survival in MYC-overexpressing cells. Proc Natl Acad Sci U S A 2019; 116:14614-14619. [PMID: 31262815 PMCID: PMC6642371 DOI: 10.1073/pnas.1821863116] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Aberrant MYC oncogene activation is one of the most prevalent characteristics of cancer. By overlapping datasets of Drosophila genes that are insulin-responsive and also regulate nucleolus size, we enriched for Myc target genes required for cellular biosynthesis. Among these, we identified the aminoacyl tRNA synthetases (aaRSs) as essential mediators of Myc growth control in Drosophila and found that their pharmacologic inhibition is sufficient to kill MYC-overexpressing human cells, indicating that aaRS inhibitors might be used to selectively target MYC-driven cancers. We suggest a general principle in which oncogenic increases in cellular biosynthesis sensitize cells to disruption of protein homeostasis.
Collapse
|
27
|
Identification of Human Lineage-Specific Transcriptional Coregulators Enabled by a Glossary of Binding Modules and Tunable Genomic Backgrounds. Cell Syst 2019; 5:187-201.e7. [PMID: 28957653 DOI: 10.1016/j.cels.2017.06.015] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Revised: 06/03/2017] [Accepted: 06/29/2017] [Indexed: 01/08/2023]
Abstract
Transcription factors (TFs) control cellular processes by binding specific DNA motifs to modulate gene expression. Motif enrichment analysis of regulatory regions can identify direct and indirect TF binding sites. Here, we created a glossary of 108 non-redundant TF-8mer "modules" of shared specificity for 671 metazoan TFs from publicly available and new universal protein binding microarray data. Analysis of 239 ENCODE TF chromatin immunoprecipitation sequencing datasets and associated RNA sequencing profiles suggest the 8mer modules are more precise than position weight matrices in identifying indirect binding motifs and their associated tethering TFs. We also developed GENRE (genomically equivalent negative regions), a tunable tool for construction of matched genomic background sequences for analysis of regulatory regions. GENRE outperformed four state-of-the-art approaches to background sequence construction. We used our TF-8mer glossary and GENRE in the analysis of the indirect binding motifs for the co-occurrence of tethering factors, suggesting novel TF-TF interactions. We anticipate that these tools will aid in elucidating tissue-specific gene-regulatory programs.
Collapse
|
28
|
Bispecific Forkhead Transcription Factor FoxN3 Recognizes Two Distinct Motifs with Different DNA Shapes. Mol Cell 2019; 74:245-253.e6. [PMID: 30826165 PMCID: PMC6474805 DOI: 10.1016/j.molcel.2019.01.019] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Revised: 12/17/2018] [Accepted: 01/11/2019] [Indexed: 12/13/2022]
Abstract
Transcription factors (TFs) control gene expression by binding DNA recognition sites in genomic regulatory regions. Although most forkhead TFs recognize a canonical forkhead (FKH) motif, RYAAAYA, some forkheads recognize a completely different (FHL) motif, GACGC. Bispecific forkhead proteins recognize both motifs, but the molecular basis for bispecific DNA recognition is not understood. We present co-crystal structures of the FoxN3 DNA binding domain bound to the FKH and FHL sites, respectively. FoxN3 adopts a similar conformation to recognize both motifs, making contacts with different DNA bases using the same amino acids. However, the DNA structure is different in the two complexes. These structures reveal how a single TF binds two unrelated DNA sequences and the importance of DNA shape in the mechanism of bispecific recognition.
Collapse
|
29
|
|
30
|
Workshop during the Pacific Symposium of Biocomputing, Jan 3-7, 2019: Reading between the genes: interpreting non-coding DNA in high-throughput. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2019; 24:444-448. [PMID: 30864345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Identifying functional elements and predicting mechanistic insight from non-coding DNA and noncoding variation remains a challenge. Advances in genome-scale, high-throughput technology, however, have brought these answers closer within reach than ever, though there is still a need for new computational approaches to analysis and integration. This workshop aims to explore these resources and new computational methods applied to regulatory elements, chromatin interactions, non-protein-coding genes, and other non-coding DNA.
Collapse
|
31
|
Ancient mechanisms for the evolution of the bicoid homeodomain's function in fly development. eLife 2018; 7:e34594. [PMID: 30298815 PMCID: PMC6177261 DOI: 10.7554/elife.34594] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2017] [Accepted: 07/28/2018] [Indexed: 12/14/2022] Open
Abstract
The ancient mechanisms that caused developmental gene regulatory networks to diversify among distantly related taxa are not well understood. Here we use ancestral protein reconstruction, biochemical experiments, and developmental assays of transgenic animals carrying reconstructed ancestral genes to investigate how the transcription factor Bicoid (Bcd) evolved its central role in anterior-posterior patterning in flies. We show that most of Bcd's derived functions are attributable to evolutionary changes within its homeodomain (HD) during a phylogenetic interval >140 million years ago. A single substitution from this period (Q50K) accounts almost entirely for the evolution of Bcd's derived DNA specificity in vitro. In transgenic embryos expressing the reconstructed ancestral HD, however, Q50K confers activation of only a few of Bcd's transcriptional targets and yields a very partial rescue of anterior development. Adding a second historical substitution (M54R) confers regulation of additional Bcd targets and further rescues anterior development. These results indicate that two epistatically interacting mutations played a major role in the evolution of Bcd's controlling regulatory role in early development. They also show how ancestral sequence reconstruction can be combined with in vivo characterization of transgenic animals to illuminate the historical mechanisms of developmental evolution.
Collapse
|
32
|
Diversification of transcription factor-DNA interactions and the evolution of gene regulatory networks. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2018; 10:e1423. [PMID: 29694718 PMCID: PMC6202284 DOI: 10.1002/wsbm.1423] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Revised: 02/23/2018] [Accepted: 03/11/2018] [Indexed: 01/17/2023]
Abstract
Sequence-specific transcription factors (TFs) bind short DNA sequences in the genome to regulate the expression of target genes. In the last decade, numerous technical advances have enabled the determination of the DNA-binding specificities of many of these factors. Large-scale screens of many TFs enabled the creation of databases of TF DNA-binding specificities, typically represented as position weight matrices (PWMs). Although great progress has been made in determining and predicting binding specificities systematically, there are still many surprises to be found when studying a particular TF's interactions with DNA in detail. Paralogous TFs' binding specificities can differ in subtle ways, in a manner that is not immediately apparent from looking at their PWMs. These differences affect gene regulatory outputs and enable TFs to rewire transcriptional networks over evolutionary time. This review discusses recent observations made in the study of TF-DNA interactions that highlight the importance of continued in-depth analysis of TF-DNA interactions and their inherent complexity. This article is categorized under: Biological Mechanisms > Regulatory Biology.
Collapse
|
33
|
A feed-forward relay integrates the regulatory activities of Bicoid and Orthodenticle via sequential binding to suboptimal sites. Genes Dev 2018; 32:723-736. [PMID: 29764918 PMCID: PMC6004077 DOI: 10.1101/gad.311985.118] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Accepted: 04/17/2018] [Indexed: 11/25/2022]
Abstract
Datta et al. define three major classes of enhancers that are differentially sensitive to binding and transcriptional activation by Bicoid (Bcd) and Orthodenticle (Otd). The specific activities of enhancers in each class are mediated by DNA motif variants preferentially bound by Bcd or Otd and the presence or absence of sites for cofactors that interact with these proteins. The K50 (lysine at amino acid position 50) homeodomain (HD) protein Orthodenticle (Otd) is critical for anterior patterning and brain and eye development in most metazoans. In Drosophila melanogaster, another K50HD protein, Bicoid (Bcd), has evolved to replace Otd's ancestral function in embryo patterning. Bcd is distributed as a long-range maternal gradient and activates transcription of a large number of target genes, including otd. Otd and Bcd bind similar DNA sequences in vitro, but how their transcriptional activities are integrated to pattern anterior regions of the embryo is unknown. Here we define three major classes of enhancers that are differentially sensitive to binding and transcriptional activation by Bcd and Otd. Class 1 enhancers are initially activated by Bcd, and activation is transferred to Otd via a feed-forward relay (FFR) that involves sequential binding of the two proteins to the same DNA motif. Class 2 enhancers are activated by Bcd and maintained by an Otd-independent mechanism. Class 3 enhancers are never bound by Bcd, but Otd binds and activates them in a second wave of zygotic transcription. The specific activities of enhancers in each class are mediated by DNA motif variants preferentially bound by Bcd or Otd and the presence or absence of sites for cofactors that interact with these proteins. Our results define specific patterning roles for Bcd and Otd and provide mechanisms for coordinating the precise timing of gene expression patterns during embryonic development.
Collapse
|
34
|
Direct Promoter Repression by BCL11A Controls the Fetal to Adult Hemoglobin Switch. Cell 2018; 173:430-442.e17. [PMID: 29606353 DOI: 10.1016/j.cell.2018.03.016] [Citation(s) in RCA: 270] [Impact Index Per Article: 45.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Revised: 01/16/2018] [Accepted: 03/06/2018] [Indexed: 01/06/2023]
Abstract
Fetal hemoglobin (HbF, α2γ2) level is genetically controlled and modifies severity of adult hemoglobin (HbA, α2β2) disorders, sickle cell disease, and β-thalassemia. Common genetic variation affects expression of BCL11A, a regulator of HbF silencing. To uncover how BCL11A supports the developmental switch from γ- to β- globin, we use a functional assay and protein binding microarray to establish a requirement for a zinc-finger cluster in BCL11A in repression and identify a preferred DNA recognition sequence. This motif appears in embryonic and fetal-expressed globin promoters and is duplicated in γ-globin promoters. The more distal of the duplicated motifs is mutated in individuals with hereditary persistence of HbF. Using the CUT&RUN approach to map protein binding sites in erythroid cells, we demonstrate BCL11A occupancy preferentially at the distal motif, which can be disrupted by editing the promoter. Our findings reveal that direct γ-globin gene promoter repression by BCL11A underlies hemoglobin switching.
Collapse
|
35
|
Differential Occupancy of Two GA-Binding Proteins Promotes Targeting of the Drosophila Dosage Compensation Complex to the Male X Chromosome. Cell Rep 2018; 22:3227-3239. [PMID: 29562179 PMCID: PMC6402580 DOI: 10.1016/j.celrep.2018.02.098] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Revised: 01/09/2018] [Accepted: 02/25/2018] [Indexed: 01/28/2023] Open
Abstract
Little is known about how variation in sequence composition alters transcription factor occupancy to precisely recruit large transcription complexes. A key model for understanding how transcription complexes are targeted is the Drosophila dosage compensation system in which the male-specific lethal (MSL) transcription complex specifically identifies and regulates the male X chromosome. The chromatin-linked adaptor for MSL proteins (CLAMP) zinc-finger protein targets MSL to the X chromosome but also binds to GA-rich sequence elements throughout the genome. Furthermore, the GAGA-associated factor (GAF) transcription factor also recognizes GA-rich sequences but does not associate with the MSL complex. Here, we demonstrate that MSL complex recruitment sites are optimal CLAMP targets. Specificity for CLAMP binding versus GAF binding is driven by variability in sequence composition within similar GA-rich motifs. Therefore, variation within seemingly similar cis elements drives the context-specific targeting of a large transcription complex.
Collapse
|
36
|
Identification of Human Lineage-Specific Transcriptional Coregulators Enabled by a Glossary of Binding Modules and Tunable Genomic Backgrounds. Cell Syst 2017; 5:654. [PMID: 29284131 DOI: 10.1016/j.cels.2017.12.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
37
|
p53 pulses lead to distinct patterns of gene expression albeit similar DNA-binding dynamics. Nat Struct Mol Biol 2017; 24:840-847. [PMID: 28825732 DOI: 10.1038/nsmb.3452] [Citation(s) in RCA: 66] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2017] [Accepted: 07/20/2017] [Indexed: 02/07/2023]
Abstract
The dynamics of transcription factors play important roles in a variety of biological systems. However, the mechanisms by which these dynamics are decoded into different transcriptional responses are not well understood. Here we focus on the dynamics of the tumor-suppressor protein p53, which exhibits a series of pulses in response to DNA damage. We performed time course RNA sequencing (RNA-seq) and chromatin immunoprecipitation sequencing (ChIP-seq) measurements to determine how p53 oscillations are linked with gene expression genome wide. We discovered multiple distinct patterns of gene expression in response to p53 pulses. Surprisingly, p53-binding dynamics were uniform across all genomic loci, even for genes that exhibited distinct mRNA dynamics. Using a mathematical model, supported by additional experimental measurements in response to sustained p53 input, we determined that p53 binds to and activates transcription of its target genes uniformly, whereas post-transcriptional mechanisms are responsible for the differences in gene expression dynamics.
Collapse
|
38
|
Transcription factor-DNA binding: beyond binding site motifs. Curr Opin Genet Dev 2017; 43:110-119. [PMID: 28359978 PMCID: PMC5447501 DOI: 10.1016/j.gde.2017.02.007] [Citation(s) in RCA: 180] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2016] [Revised: 02/02/2017] [Accepted: 02/07/2017] [Indexed: 12/12/2022]
Abstract
Sequence-specific transcription factors (TFs) regulate gene expression by binding to cis-regulatory elements in promoter and enhancer DNA. While studies of TF-DNA binding have focused on TFs' intrinsic preferences for primary nucleotide sequence motifs, recent studies have elucidated additional layers of complexity that modulate TF-DNA binding. In this review, we discuss technological developments for identifying TF binding preferences and highlight recent discoveries that elaborate how TF interactions, local DNA structure, and genomic features influence TF-DNA binding. We highlight novel approaches for characterizing functional binding site motifs that promise to inform our understanding of how TF binding controls gene expression and ultimately contributes to phenotype.
Collapse
|
39
|
CellMapper: rapid and accurate inference of gene expression in difficult-to-isolate cell types. Genome Biol 2016; 17:201. [PMID: 27687735 PMCID: PMC5043525 DOI: 10.1186/s13059-016-1062-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2016] [Accepted: 09/13/2016] [Indexed: 02/25/2023] Open
Abstract
We present a sensitive approach to predict genes expressed selectively in specific cell types, by searching publicly available expression data for genes with a similar expression profile to known cell-specific markers. Our method, CellMapper, strongly outperforms previous computational algorithms to predict cell type-specific expression, especially for rare and difficult-to-isolate cell types. Furthermore, CellMapper makes accurate predictions for human brain cell types that have never been isolated, and can be rapidly applied to diverse cell types from many tissues. We demonstrate a clinically relevant application to prioritize candidate genes in disease susceptibility loci identified by GWAS.
Collapse
|
40
|
Expansion of GA Dinucleotide Repeats Increases the Density of CLAMP Binding Sites on the X-Chromosome to Promote Drosophila Dosage Compensation. PLoS Genet 2016; 12:e1006120. [PMID: 27414415 PMCID: PMC4945028 DOI: 10.1371/journal.pgen.1006120] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 05/23/2016] [Indexed: 12/15/2022] Open
Abstract
Dosage compensation is an essential process that equalizes transcript levels of X-linked genes between sexes by forming a domain of coordinated gene expression. Throughout the evolution of Diptera, many different X-chromosomes acquired the ability to be dosage compensated. Once each newly evolved X-chromosome is targeted for dosage compensation in XY males, its active genes are upregulated two-fold to equalize gene expression with XX females. In Drosophila melanogaster, the CLAMP zinc finger protein links the dosage compensation complex to the X-chromosome. However, the mechanism for X-chromosome identification has remained unknown. Here, we combine biochemical, genomic and evolutionary approaches to reveal that expansion of GA-dinucleotide repeats likely accumulated on the X-chromosome over evolutionary time to increase the density of CLAMP binding sites, thereby driving the evolution of dosage compensation. Overall, we present new insight into how subtle changes in genomic architecture, such as expansions of a simple sequence repeat, promote the evolution of coordinated gene expression.
Collapse
|
41
|
Survey of variation in human transcription factors reveals prevalent DNA binding changes. Science 2016; 351:1450-1454. [PMID: 27013732 PMCID: PMC4825693 DOI: 10.1126/science.aad2257] [Citation(s) in RCA: 100] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2015] [Accepted: 02/18/2016] [Indexed: 12/13/2022]
Abstract
Sequencing of exomes and genomes has revealed abundant genetic variation affecting the coding sequences of human transcription factors (TFs), but the consequences of such variation remain largely unexplored. We developed a computational, structure-based approach to evaluate TF variants for their impact on DNA binding activity and used universal protein-binding microarrays to assay sequence-specific DNA binding activity across 41 reference and 117 variant alleles found in individuals of diverse ancestries and families with Mendelian diseases. We found 77 variants in 28 genes that affect DNA binding affinity or specificity and identified thousands of rare alleles likely to alter the DNA binding activity of human sequence-specific TFs. Our results suggest that most individuals have unique repertoires of TF DNA binding activities, which may contribute to phenotypic variation.
Collapse
|
42
|
Phosphorylation of the chromatin remodeling factor DPF3a induces cardiac hypertrophy through releasing HEY repressors from DNA. Nucleic Acids Res 2015; 44:2538-53. [PMID: 26582913 PMCID: PMC4824069 DOI: 10.1093/nar/gkv1244] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2015] [Accepted: 11/01/2015] [Indexed: 01/09/2023] Open
Abstract
DPF3 (BAF45c) is a member of the BAF chromatin remodeling complex. Two isoforms have been described, namely DPF3a and DPF3b. The latter binds to acetylated and methylated lysine residues of histones. Here, we elaborate on the role of DPF3a and describe a novel pathway of cardiac gene transcription leading to pathological cardiac hypertrophy. Upon hypertrophic stimuli, casein kinase 2 phosphorylates DPF3a at serine 348. This initiates the interaction of DPF3a with the transcriptional repressors HEY, followed by the release of HEY from the DNA. Moreover, BRG1 is bound by DPF3a, and is thus recruited to HEY genomic targets upon interaction of the two components. Consequently, the transcription of downstream targets such as NPPA and GATA4 is initiated and pathological cardiac hypertrophy is established. In human, DPF3a is significantly up-regulated in hypertrophic hearts of patients with hypertrophic cardiomyopathy or aortic stenosis. Taken together, we show that activation of DPF3a upon hypertrophic stimuli switches cardiac fetal gene expression from being silenced by HEY to being activated by BRG1. Thus, we present a novel pathway for pathological cardiac hypertrophy, whose inhibition is a long-term therapeutic goal for the treatment of the course of heart failure.
Collapse
|
43
|
Abstract
The mitochondrial deacetylase SIRT3 regulates several important metabolic processes. SIRT3 is transcriptionally upregulated in multiple tissues during nutrient stresses such as dietary restriction and fasting, but the molecular mechanism of this induction is unclear. We conducted a bioinformatic study to identify transcription factor(s) involved in SIRT3 induction. Our analysis identified an enrichment of binding sites for nuclear respiratory factor 2 (NRF-2), a transcription factor known to play a role in the expression of mitochondrial genes, in the DNA sequences of SIRT3 and genes with closely correlated expression patterns. In vitro, knockdown or overexpression of NRF-2 modulated SIRT3 levels, and the NRF-2α subunit directly bound to the SIRT3 promoter. Our results suggest that NRF-2 is a regulator of SIRT3 expression and may shed light on how SIRT3 is upregulated during nutrient stress.
Collapse
|
44
|
Grhl2 is required in nonneural tissues for neural progenitor survival and forebrain development. Genesis 2015; 53:573-582. [PMID: 26177923 PMCID: PMC4713386 DOI: 10.1002/dvg.22875] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Revised: 07/06/2015] [Accepted: 07/07/2015] [Indexed: 11/06/2022]
Abstract
Grainyhead-like genes are part of a highly conserved gene family that play a number of roles in ectoderm development and maintenance in mammals. Here we identify a novel allele of Grhl2, cleft-face 3 (clft3), in a mouse line recovered from an ENU mutagenesis screen for organogenesis defects. Homozygous clft3 mutants have a number of phenotypes in common with other alleles of Grhl2. We note a significant effect of genetic background on the clft3 phenotype. One of these is a reduction in size of the telencephalon where we find abnormal patterns of neural progenitor mitosis and apoptosis in mutant brains. Interestingly, Grhl2 is not expressed in the developing forebrain, suggesting this is a survival factor for neural progenitors exerting a paracrine effect on the neural tissue from the overlying ectoderm where Grhl2 is highly expressed. genesis 53:573-582, 2015. © 2015 Wiley Periodicals, Inc.
Collapse
|
45
|
A direct fate exclusion mechanism by Sonic hedgehog-regulated transcriptional repressors. Development 2015; 142:3286-93. [PMID: 26293298 DOI: 10.1242/dev.124636] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2015] [Accepted: 08/04/2015] [Indexed: 01/19/2023]
Abstract
Sonic hedgehog (Shh) signaling patterns the vertebrate spinal cord by activating a group of transcriptional repressors in distinct neural progenitors of somatic motor neuron and interneuron subtypes. To identify the action of this network, we performed a genome-wide analysis of the regulatory actions of three key ventral determinants in mammalian neural tube patterning: Nkx2.2, Nkx6.1 and Olig2. Previous studies have demonstrated that each factor acts predominantly as a transcriptional repressor, at least in part, to inhibit alternative progenitor fate choices. Here, we reveal broad and direct repression of multiple alternative fates as a general mechanism of repressor action. Additionally, the repressor network targets multiple Shh signaling components providing negative feedback to ongoing Shh signaling. Analysis of chromatin organization around Nkx2.2-, Nkx6.1- and Olig2-bound regions, together with co-analysis of engagement of the transcriptional activator Sox2, indicate that repressors bind to, and probably modulate the action of, neural enhancers. Together, the data suggest a model for neural progenitor specification downstream of Shh signaling, in which Nkx2.2 and Olig2 direct repression of alternative neural progenitor fate determinants, an action augmented by the overlapping activity of Nkx6.1 in each cell type. Integration of repressor and activator inputs, notably activator inputs mediated by Sox2, is probably a key mechanism in achieving cell type-specific transcriptional outcomes in mammalian neural progenitor fate specification.
Collapse
|
46
|
Context influences on TALE-DNA binding revealed by quantitative profiling. Nat Commun 2015; 6:7440. [PMID: 26067805 PMCID: PMC4467457 DOI: 10.1038/ncomms8440] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Accepted: 05/08/2015] [Indexed: 12/13/2022] Open
Abstract
Transcription activator-like effector (TALE) proteins recognize DNA using a seemingly simple DNA-binding code, which makes them attractive for use in genome engineering technologies that require precise targeting. Although this code is used successfully to design TALEs to target specific sequences, off-target binding has been observed and is difficult to predict. Here we explore TALE-DNA interactions comprehensively by quantitatively assaying the DNA-binding specificities of 21 representative TALEs to ∼5,000-20,000 unique DNA sequences per protein using custom-designed protein-binding microarrays (PBMs). We find that protein context features exert significant influences on binding. Thus, the canonical recognition code does not fully capture the complexity of TALE-DNA binding. We used the PBM data to develop a computational model, Specificity Inference For TAL-Effector Design (SIFTED), to predict the DNA-binding specificity of any TALE. We provide SIFTED as a publicly available web tool that predicts potential genomic off-target sites for improved TALE design.
Collapse
|
47
|
UniPROBE, update 2015: new tools and content for the online database of protein-binding microarray data on protein-DNA interactions. Nucleic Acids Res 2014; 43:D117-22. [PMID: 25378322 PMCID: PMC4383892 DOI: 10.1093/nar/gku1045] [Citation(s) in RCA: 202] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The Universal PBM Resource for Oligonucleotide Binding Evaluation (UniPROBE) serves as a convenient source of information on published data generated using universal protein-binding microarray (PBM) technology, which provides in vitro data about the relative DNA-binding preferences of transcription factors for all possible sequence variants of a length k (‘k-mers’). The database displays important information about the proteins and displays their DNA-binding specificity data in terms of k-mers, position weight matrices and graphical sequence logos. This update to the database documents the growth of UniPROBE since the last update 4 years ago, and introduces a variety of new features and tools, including a new streamlined pipeline that facilitates data deposition by universal PBM data generators in the research community, a tool that generates putative nonbinding (i.e. negative control) DNA sequences for one or more proteins and novel motifs obtained by analyzing the PBM data using the BEEML-PBM algorithm for motif inference. The UniPROBE database is available at http://uniprobe.org.
Collapse
|
48
|
Modular evolution of DNA-binding preference of a Tbrain transcription factor provides a mechanism for modifying gene regulatory networks. Mol Biol Evol 2014; 31:2672-88. [PMID: 25016582 PMCID: PMC4166925 DOI: 10.1093/molbev/msu213] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Gene regulatory networks (GRNs) describe the progression of transcriptional states that take a single-celled zygote to a multicellular organism. It is well documented that GRNs can evolve extensively through mutations to cis-regulatory modules (CRMs). Transcription factor proteins that bind these CRMs may also evolve to produce novelty. Coding changes are considered to be rarer, however, because transcription factors are multifunctional and hence are more constrained to evolve in ways that will not produce widespread detrimental effects. Recent technological advances have unearthed a surprising variation in DNA-binding abilities, such that individual transcription factors may recognize both a preferred primary motif and an additional secondary motif. This provides a source of modularity in function. Here, we demonstrate that orthologous transcription factors can also evolve a changed preference for a secondary binding motif, thereby offering an unexplored mechanism for GRN evolution. Using protein-binding microarray, surface plasmon resonance, and in vivo reporter assays, we demonstrate an important difference in DNA-binding preference between Tbrain protein orthologs in two species of echinoderms, the sea star, Patiria miniata, and the sea urchin, Strongylocentrotus purpuratus. Although both orthologs recognize the same primary motif, only the sea star Tbr also has a secondary binding motif. Our in vivo assays demonstrate that this difference may allow for greater evolutionary change in timing of regulatory control. This uncovers a layer of transcription factor binding divergence that could exist for many pairs of orthologs. We hypothesize that this divergence provides modularity that allows orthologous transcription factors to evolve novel roles in GRNs through modification of binding to secondary sites.
Collapse
|
49
|
The NF-κB genomic landscape in lymphoblastoid B cells. Cell Rep 2014; 8:1595-606. [PMID: 25159142 DOI: 10.1016/j.celrep.2014.07.037] [Citation(s) in RCA: 121] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2014] [Revised: 06/09/2014] [Accepted: 07/21/2014] [Indexed: 01/17/2023] Open
Abstract
The nuclear factor κB (NF-κΒ) subunits RelA, RelB, cRel, p50, and p52 are each critical for B cell development and function. To systematically characterize their responses to canonical and noncanonical NF-κB pathway activity, we performed chromatin immunoprecipitation followed by high-throughput DNA sequencing (ChIP-seq) analysis in lymphoblastoid B cell lines (LCLs). We found a complex NF-κB-binding landscape, which did not readily reflect the two NF-κB pathway paradigms. Instead, 10 subunit-binding patterns were observed at promoters and 11 at enhancers. Nearly one-third of NF-κB-binding sites lacked κB motifs and were instead enriched for alternative motifs. The oncogenic forkhead box protein FOXM1 co-occupied nearly half of NF-κB-binding sites and was identified in protein complexes with NF-κB on DNA. FOXM1 knockdown decreased NF-κB target gene expression and ultimately induced apoptosis, highlighting FOXM1 as a synthetic lethal target in B cell malignancy. These studies provide a resource for understanding mechanisms that underlie NF-κB nuclear activity and highlight opportunities for selective NF-κB blockade.
Collapse
|
50
|
Diversification of transcription factor paralogs via noncanonical modularity in C2H2 zinc finger DNA binding. Mol Cell 2014; 55:640-8. [PMID: 25042805 DOI: 10.1016/j.molcel.2014.06.019] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Revised: 05/27/2014] [Accepted: 06/09/2014] [Indexed: 12/25/2022]
Abstract
A major challenge in obtaining a full molecular description of evolutionary adaptation is to characterize how transcription factor (TF) DNA-binding specificity can change. To identify mechanisms of TF diversification, we performed detailed comparisons of yeast C2H2 ZF proteins with identical canonical recognition residues that are expected to bind the same DNA sequences. Unexpectedly, we found that ZF proteins can adapt to recognize new binding sites in a modular fashion whereby binding to common core sites remains unaffected. We identified two distinct mechanisms, conserved across multiple Ascomycota species, by which this molecular adaptation occurred. Our results suggest a route for TF evolution that alleviates negative pleiotropic effects by modularly gaining new binding sites. These findings expand our current understanding of ZF DNA binding and provide evidence for paralogous ZFs utilizing alternate modes of DNA binding to recognize unique sets of noncanonical binding sites.
Collapse
|