1
|
Due AD, Davey NE, Thomasen FE, Morffy N, Prestel A, Brakti I, O'Shea C, Strader LC, Lindorff‐Larsen K, Skriver K, Kragelund BB. Hierarchy in regulator interactions with distant transcriptional activation domains empowers rheostatic regulation. Protein Sci 2025; 34:e70142. [PMID: 40371733 PMCID: PMC12079402 DOI: 10.1002/pro.70142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2025] [Revised: 04/14/2025] [Accepted: 04/15/2025] [Indexed: 05/16/2025]
Abstract
Transcription factors carry long intrinsically disordered regions often containing multiple activation domains. Despite numerous recent high-throughput identifications and characterizations of activation domains, the interplay between sequence motifs, activation domains, and regulator binding in intrinsically disordered transcription factor regions remains unresolved. Here, we map sequence motifs and activation domains in an Arabidopsis thaliana NAC transcription factor clade, revealing that although sequence motifs and activation domains often coincide, no systematic overlap exists. Biophysical analyses using NMR spectroscopy show that the long intrinsically disordered region of senescence-associated transcription factor ANAC046 is devoid of residual structure. We identify two activation domain/sequence motif regions, one at each end that both bind a panel of six positive and negative regulator domains from biologically relevant regulators promiscuously. Binding affinities measured using isothermal titration calorimetry reveal a hierarchy for regulator binding of the two ANAC046 activation domain/sequence motif regions defining these as regulatory hotspots. Despite extensive dynamic intramolecular contacts along the disordered chain revealed using paramagnetic relaxation enhancement experiments and simulations, the regions remain uncoupled in binding. Together, the results imply rheostatic regulation by ANAC046 through concentration-dependent regulator competition, a mechanism likely mirrored in other transcription factors with distantly located activation domains.
Collapse
Affiliation(s)
- Amanda D. Due
- REPINUniversity of CopenhagenCopenhagenDenmark
- Linderstrøm‐Lang Centre for Protein ScienceUniversity of CopenhagenCopenhagenDenmark
- Structural Biology and NMR Laboratory, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| | - Norman E. Davey
- Division of Cancer BiologyThe Institute of Cancer ResearchLondonUK
| | - F. Emil Thomasen
- Linderstrøm‐Lang Centre for Protein ScienceUniversity of CopenhagenCopenhagenDenmark
- Structural Biology and NMR Laboratory, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| | | | - Andreas Prestel
- REPINUniversity of CopenhagenCopenhagenDenmark
- Linderstrøm‐Lang Centre for Protein ScienceUniversity of CopenhagenCopenhagenDenmark
- Structural Biology and NMR Laboratory, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| | - Inna Brakti
- REPINUniversity of CopenhagenCopenhagenDenmark
- Linderstrøm‐Lang Centre for Protein ScienceUniversity of CopenhagenCopenhagenDenmark
- Structural Biology and NMR Laboratory, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| | - Charlotte O'Shea
- REPINUniversity of CopenhagenCopenhagenDenmark
- Linderstrøm‐Lang Centre for Protein ScienceUniversity of CopenhagenCopenhagenDenmark
| | | | - Kresten Lindorff‐Larsen
- Linderstrøm‐Lang Centre for Protein ScienceUniversity of CopenhagenCopenhagenDenmark
- Structural Biology and NMR Laboratory, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| | - Karen Skriver
- REPINUniversity of CopenhagenCopenhagenDenmark
- Linderstrøm‐Lang Centre for Protein ScienceUniversity of CopenhagenCopenhagenDenmark
| | - Birthe B. Kragelund
- REPINUniversity of CopenhagenCopenhagenDenmark
- Linderstrøm‐Lang Centre for Protein ScienceUniversity of CopenhagenCopenhagenDenmark
- Structural Biology and NMR Laboratory, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| |
Collapse
|
2
|
Lyons H, Pradhan P, Prakasam G, Vashishtha S, Li X, Eppert M, Fornero C, Tcheuyap VT, McGlynn K, Yu Z, Raju DR, Koduru PR, Xing C, Kapur P, Brugarolas J, Sabari BR. RNA polymerase II partitioning is a shared feature of diverse oncofusion condensates. Cell 2025:S0092-8674(25)00404-0. [PMID: 40286793 DOI: 10.1016/j.cell.2025.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 12/12/2024] [Accepted: 04/01/2025] [Indexed: 04/29/2025]
Abstract
Condensates regulate transcription by selectively compartmentalizing biomolecules, yet the rules of specificity and their relationship to function remain enigmatic. To identify rules linked to function, we leverage the genetic selection bias of condensate-promoting oncofusions. Focusing on the three most frequent oncofusions driving translocation renal cell carcinoma, we find that they promote the formation of condensates that activate transcription by gain-of-function RNA polymerase II partitioning through a shared signature of elevated π and π-interacting residues and depletion of aliphatic residues. This signature is shared among a broad set of DNA-binding oncofusions associated with diverse cancers. We find that this signature is necessary and sufficient for RNA polymerase II partitioning, gene activation, and cancer cell phenotypes. Our results reveal that dysregulated condensate specificity is a shared molecular mechanism of diverse oncofusions, highlighting the functional role of condensate composition and the power of disease genetics in investigating relationships between condensate specificity and function.
Collapse
Affiliation(s)
- Heankel Lyons
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Prashant Pradhan
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Gopinath Prakasam
- Kidney Cancer Program, Simmons Comprehensive Cancer Center, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA; Hematology-Oncology Division, Department of Internal Medicine, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Shubham Vashishtha
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Xiang Li
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Mikayla Eppert
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Christy Fornero
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Vanina T Tcheuyap
- Kidney Cancer Program, Simmons Comprehensive Cancer Center, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA; Hematology-Oncology Division, Department of Internal Medicine, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Kathleen McGlynn
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Ze Yu
- Eugene McDermott Center for Human Growth and Development, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Dinesh Ravindra Raju
- Eugene McDermott Center for Human Growth and Development, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Prasad R Koduru
- Department of Pathology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Chao Xing
- Eugene McDermott Center for Human Growth and Development, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA; Lyda Hill Department of Bioinformatics, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA; Peter O'Donnell School of Public Health, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Payal Kapur
- Kidney Cancer Program, Simmons Comprehensive Cancer Center, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA; Department of Pathology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA; Department of Urology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - James Brugarolas
- Kidney Cancer Program, Simmons Comprehensive Cancer Center, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA; Hematology-Oncology Division, Department of Internal Medicine, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Benjamin R Sabari
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA; Department of Molecular Biology, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA; Hamon Center for Regenerative Science and Medicine, The University of Texas Southwestern Medical Center, Dallas, TX 75390, USA.
| |
Collapse
|
3
|
Flores E, Acharya N, Castañeda CA, Sukenik S. Single-point mutations in disordered proteins: Linking sequence, ensemble, and function. Curr Opin Struct Biol 2025; 91:102987. [PMID: 39914051 DOI: 10.1016/j.sbi.2025.102987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2024] [Revised: 01/02/2025] [Accepted: 01/03/2025] [Indexed: 03/08/2025]
Abstract
Mutations in genomic DNA often result in single-point missense mutations in proteins. For folded proteins, the functional effect of these missense mutations can often be understood by their impact on structure. However, missense mutations in intrinsically disordered protein regions (IDRs) remain poorly understood. In IDRs, function can depend on the structural ensemble- the collection of accessible, interchanging conformations that is encoded in their amino acid sequence. We argue that, analogously to folded proteins, single-point mutations in IDRs can alter their structural ensemble, and consequently alter their biological function. To make this argument, we first provide experimental evidence from the literature showcasing how single-point missense mutations in IDRs affect their ensemble dimensions. Then, we use genomic data from patients to show that disease-linked missense mutations occurring in IDRs can, in many cases, significantly alter IDR structural ensembles. We hope this analysis prompts further study of disease-linked, single-point mutations in IDRs.
Collapse
Affiliation(s)
- Eduardo Flores
- Department of Chemistry and Biochemistry, UC Merced, United States
| | | | - Carlos A Castañeda
- Department of Chemistry, Syracuse University, United States; Department of Biology, Syracuse University, United States; Bioinspired Institute, Syracuse University, United States.
| | - Shahar Sukenik
- Department of Chemistry and Biochemistry, UC Merced, United States; Department of Chemistry, Syracuse University, United States; Bioinspired Institute, Syracuse University, United States.
| |
Collapse
|
4
|
Flores E, Camacho AR, Cuevas-Zepeda E, McCoy MB, Yu F, Staller MV, Sukenik S. Correlating disordered activation domain ensembles with gene expression levels. BIOPHYSICAL REPORTS 2025; 5:100195. [PMID: 39755236 PMCID: PMC11791265 DOI: 10.1016/j.bpr.2024.100195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2024] [Revised: 12/14/2024] [Accepted: 12/31/2024] [Indexed: 01/06/2025]
Abstract
Transcription factor proteins bind to specific DNA promoter sequences and initiate gene transcription. These proteins often contain intrinsically disordered activation domains (ADs) that regulate their transcriptional activity. Like other disordered protein regions, ADs do not have a fixed three-dimensional structure and instead exist in an ensemble of conformations. Disordered ensembles contain sequence-encoded structural preferences that are often linked to their function. We hypothesize that this link exists between the structural preferences of AD ensembles and their ability to induce gene expression. To test this, we measured the ensemble dimensions of two ADs, HIF-1α and CITED2, in live cells using fluorescence resonance energy transfer microscopy and correlated this structural information with their transcriptional activity. We find that mutations that expanded the ensemble of HIF-1α increased transcriptional activity, while compacting mutations reduced it, highlighting the critical role of structural plasticity in regulating HIF-1α function. Conversely, CITED2 showed no correlation between ensemble dimensions and activity. Our results highlight a possible link between AD ensemble dimensions and their transcriptional activity, with implications for transcriptional regulation and dysfunction.
Collapse
Affiliation(s)
- Eduardo Flores
- Department of Chemistry and Biochemistry, University of California, Merced, Merced, California
| | - Aleah R Camacho
- Department of Chemistry and Biochemistry, University of California, Merced, Merced, California
| | - Estefania Cuevas-Zepeda
- Department of Chemistry and Biochemistry, University of California, Merced, Merced, California
| | - Mary B McCoy
- Department of Chemistry and Biochemistry, University of California, Merced, Merced, California
| | - Feng Yu
- Department of Chemistry and Biochemistry, University of California, Merced, Merced, California; Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, California
| | - Max V Staller
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California; Center for Computational Biology, University of California, Berkeley, Berkeley, California; Chan Zuckerberg Biohub-San Francisco, San Francisco, California
| | - Shahar Sukenik
- Department of Chemistry and Biochemistry, University of California, Merced, Merced, California; Department of Chemistry, Syracuse University, Syracuse, New York.
| |
Collapse
|
5
|
Vashishtha S, Sabari BR. Disordered Regions of Condensate-promoting Proteins Have Distinct Molecular Signatures Associated with Cellular Function. J Mol Biol 2025; 437:168953. [PMID: 39826710 DOI: 10.1016/j.jmb.2025.168953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Revised: 12/23/2024] [Accepted: 01/10/2025] [Indexed: 01/22/2025]
Abstract
Disordered regions of proteins play crucial roles in cellular functions through diverse mechanisms. Some disordered regions function by promoting the formation of biomolecular condensates through dynamic multivalent interactions. While many have assumed that interactions among these condensate-promoting disordered regions are non-specific, recent studies have shown that distinct sequence compositions and patterning lead to specific condensate compositions associated with cellular function. Despite in-depth characterization of several key examples, the full chemical diversity of condensate-promoting disordered regions has not been surveyed. Here, we define a list of disordered regions of condensate-promoting proteins to survey the relationship between sequence and function. We find that these disordered regions show amino acid biases associated with different cellular functions. These amino acid biases are evolutionarily conserved in the absence of positional sequence conservation. Overall, our analysis highlights the relationship between sequence features and function for condensate-promoting disordered regions. This analysis suggests that molecular signatures encoded within disordered regions could impart functional specificity.
Collapse
Affiliation(s)
- Shubham Vashishtha
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, Department of Molecular Biology, Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Benjamin R Sabari
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, Department of Molecular Biology, Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA.
| |
Collapse
|
6
|
Jonas F, Navon Y, Barkai N. Intrinsically disordered regions as facilitators of the transcription factor target search. Nat Rev Genet 2025:10.1038/s41576-025-00816-3. [PMID: 39984675 DOI: 10.1038/s41576-025-00816-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/14/2025] [Indexed: 02/23/2025]
Abstract
Transcription factors (TFs) contribute to organismal development and function by regulating gene expression. Despite decades of research, the factors determining the specificity and speed at which eukaryotic TFs detect their target binding sites remain poorly understood. Recent studies have pointed to intrinsically disordered regions (IDRs) within TFs as key regulators of the process by which TFs find their target sites on DNA (the TF target search). However, IDRs are challenging to study because they can confer specificity despite low sequence complexity and can be functionally conserved despite rapid sequence divergence. Nevertheless, emerging computational and experimental approaches are beginning to elucidate the sequence-function relationship within the IDRs of TFs. Additional insights are informing potential mechanisms underlying the IDR-directed search for the DNA targets of TFs, including incorporation into biomolecular condensates, facilitating TF co-localization, and the hypothesis that IDRs recognize and directly interact with specific genomic regions.
Collapse
Affiliation(s)
- Felix Jonas
- School of Science, Constructor University, Bremen, Germany.
| | - Yoav Navon
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
7
|
He C, Liang Y, Chen R, Shen Y, Li R, Sun T, Du X, Ni X, Shang J, He Y, Bao M, Luo H, Wang J, Liao P, Kang C, Yuan YW, Ning G. Boosting transcriptional activities by employing repeated activation domains in transcription factors. THE PLANT CELL 2025; 37:koae315. [PMID: 39657052 PMCID: PMC11823830 DOI: 10.1093/plcell/koae315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 10/24/2024] [Accepted: 10/28/2024] [Indexed: 12/17/2024]
Abstract
Enhancing the transcriptional activation activity of transcription factors (TFs) has multiple applications in organism improvement, metabolic engineering, and other aspects of plant science, but the approaches remain unclear. Here, we used gene activation assays and genetic transformation to investigate the transcriptional activities of two MYB TFs, PRODUCTION OF ANTHOCYANIN PIGMENT 1 (AtPAP1) from Arabidopsis (Arabidopsis thaliana) and EsMYBA1 from Epimedium (Epimedium sagittatum), and their synthetic variants in a range of plant species from several families. Using anthocyanin biosynthesis as a convenient readout, we discovered that homologous naturally occurring TFs showed differences in the transcriptional activation ability and that similar TFs induced large changes in the genetic program when heterologously expressed in different species. In some cases, shuffling the DNA-binding domains and transcriptional activation domains (ADs) between homologous TFs led to synthetic TFs that had stronger activation potency than the original TFs. More importantly, synthetic TFs derived from MYB, NAC, bHLH, and ethylene-insensitive3-like (EIL) family members containing tandemly repeated ADs had greatly enhanced activity compared to their natural counterparts. These findings enhance our understanding of TF activity and demonstrate that employing tandemly repeated ADs from natural TFs is a simple and widely applicable strategy to enhance the activation potency of synthetic TFs.
Collapse
Affiliation(s)
- Chaochao He
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Yue Liang
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Runzhou Chen
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Yuxiao Shen
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Runhui Li
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Tingting Sun
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Xing Du
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Xiaomei Ni
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Junzhong Shang
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Yanhong He
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Manzhu Bao
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| | - Hong Luo
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC 29634, USA
| | - Jihua Wang
- Flower Research Institute of Yunnan Academy of Agricultural Sciences, National Engineering Research Center for Ornamental Horticulture, Kunming 650205, China
| | - Pan Liao
- Department of Biology, Hong Kong Baptist University, Kowloon Tong, Hong Kong SAR 999077, China
| | - Chunying Kang
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
| | - Yao-Wu Yuan
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA
| | - Guogui Ning
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan 430070, China
- The Institute of Flowers Research, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
8
|
Delaforge E, Due A, Theisen F, Morffy N, O’Shea C, Blackledge M, Strader L, Skriver K, Kragelund B. Allovalent scavenging of activation domains in the transcription factor ANAC013 gears transcriptional regulation. Nucleic Acids Res 2025; 53:gkaf065. [PMID: 39933695 PMCID: PMC11811731 DOI: 10.1093/nar/gkaf065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 01/18/2025] [Accepted: 01/23/2025] [Indexed: 02/13/2025] Open
Abstract
Transcriptional regulation involves interactions between transcription factors, coregulators, and DNA. Intrinsic disorder is a major player in this regulation, but mechanisms driven by disorder remain elusive. Here, we address molecular communication within the stress-regulating Arabidopsis thaliana transcription factor ANAC013. Through high-throughput screening of ANAC013 for transcriptional activation activity, we identify three activation domains within its C-terminal intrinsically disordered region. Two of these overlap with acidic islands and form dynamic interactions with the DNA-binding domain and are released, not only upon binding of target promoter DNA, but also by nonspecific DNA. We show that independently of DNA binding, the RST (RCD--SRO--TAF4) domain of the negative regulator RCD1 (Radical-induced Cell Death1) scavenges the two acidic activation domains positioned vis-à-vis through allovalent binding, leading to dynamic occupation at enhanced affinity. We propose an allovalency model for transcriptional regulation, where sequentially close activation domains in both DNA-bound and DNA-free states allow for efficient regulation. The model is likely relevant for many transcription factor systems, explaining the functional advantage of carrying sequentially close activation domains.
Collapse
Affiliation(s)
- Elise Delaforge
- REPIN, Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
- Linderstrøm-Lang Centre for Protein Science and Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
| | - Amanda D Due
- REPIN, Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
- Linderstrøm-Lang Centre for Protein Science and Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
| | - Frederik Friis Theisen
- REPIN, Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
- Linderstrøm-Lang Centre for Protein Science and Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
| | - Nicolas Morffy
- Department of Biology, Duke University, 27708 Durham, NC, United States
| | - Charlotte O’Shea
- Linderstrøm-Lang Centre for Protein Science and Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
| | - Martin Blackledge
- Université Grenoble Alpes, Le Centre National de la Recherche Scientifique, Commissariat à l’Energie Atomique et aux Energies Alternatives, Institut de Biologie Structurale, 38000 Grenoble, France
| | - Lucia C Strader
- Department of Biology, Duke University, 27708 Durham, NC, United States
| | - Karen Skriver
- REPIN, Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
- Linderstrøm-Lang Centre for Protein Science and Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
| | - Birthe B Kragelund
- REPIN, Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
- Linderstrøm-Lang Centre for Protein Science and Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Ole Maaløes vej 5, DK-2200 Copenhagen N, Denmark
| |
Collapse
|
9
|
Jemth P. Protein binding and folding through an evolutionary lens. Curr Opin Struct Biol 2025; 90:102980. [PMID: 39817990 DOI: 10.1016/j.sbi.2024.102980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Revised: 12/18/2024] [Accepted: 12/19/2024] [Indexed: 01/18/2025]
Abstract
Protein-protein associations are often mediated by an intrinsically disordered protein region interacting with a folded domain in a coupled binding and folding reaction. Classic physical organic chemistry approaches together with structural biology have shed light on mechanistic aspects of such reactions. Further insight into general principles may be obtained by interpreting the results through an evolutionary lens. This review attempts to provide an overview on how the analysis of binding and folding reactions can benefit from an evolutionary approach, and is aimed at protein scientists without a background in evolution. Evolution constantly reshapes existing proteins by sampling more or less fit variants. Most new variants are weeded out as generations and new species come and go over hundreds to hundreds of millions of years. The huge ongoing genome sequencing efforts have provided us with a snapshot of existing adapted fit-for-purpose protein homologs in thousands of different organisms. Comparison of present-day orthologs and paralogs highlights general principles of the evolution of coupled binding and folding reactions and demonstrate a great potential for evolution to operate on disordered regions and modulate affinity and specificity of the interactions.
Collapse
Affiliation(s)
- Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Box 582, SE-75123 Uppsala, Sweden.
| |
Collapse
|
10
|
Datta RR, Akdogan D, Tezcan EB, Onal P. Versatile roles of disordered transcription factor effector domains in transcriptional regulation. FEBS J 2025. [PMID: 39888268 DOI: 10.1111/febs.17424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2024] [Revised: 11/25/2024] [Accepted: 01/21/2025] [Indexed: 02/01/2025]
Abstract
Transcription, a crucial step in the regulation of gene expression, is tightly controlled and involves several essential processes, such as chromatin organization, recognition of the specific genomic sequences, DNA binding, and ultimately recruiting the transcriptional machinery to facilitate transcript synthesis. At the center of this regulation are transcription factors (TFs), which comprise at least one DNA-binding domain (DBD) and an effector domain (ED). Although the structure and function of DBDs have been well studied, our knowledge of the structure and function of effector domains is limited. EDs are of particular importance in generating distinct transcriptional responses between protein members of the same TF family that have similar DBDs and specificities. The study of transcriptional activity conferred by effector domains has traditionally been conducted through examining protein-protein interactions. However, recent research has uncovered alternative mechanisms by which EDs regulate gene expression, such as the formation of condensates that increase the local concentration of transcription factors, cofactors, and coregulated genes, as well as DNA binding. Here, we provide a comprehensive overview of the known roles of transcription factor EDs, with a specific focus on disordered regions. Additionally, we emphasize the significance of intrinsically disordered regions (IDRs) during transcriptional regulation. We examine the mechanisms underlying the establishment and maintenance of transcriptional specificity through the structural properties of predominantly disordered EDs. We then provide a comprehensive overview of the current understanding of these domains, including their physical and chemical characteristics, as well as their functional roles.
Collapse
Affiliation(s)
| | - Dilan Akdogan
- Molecular Biology and Genetics Department, Ihsan Dogramaci Bilkent University, Ankara, Turkey
| | - Elif B Tezcan
- Molecular Biology and Genetics Department, Ihsan Dogramaci Bilkent University, Ankara, Turkey
| | - Pinar Onal
- Molecular Biology and Genetics Department, Ihsan Dogramaci Bilkent University, Ankara, Turkey
| |
Collapse
|
11
|
Falo-Sanjuan J, Diaz-Tirado Y, Turner MA, Rourke O, Davis J, Medrano C, Haines J, McKenna J, Karshenas A, Eisen MB, Garcia HG. Targeted mutagenesis of specific genomic DNA sequences in animals for the in vivo generation of variant libraries. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.10.598328. [PMID: 38915503 PMCID: PMC11195090 DOI: 10.1101/2024.06.10.598328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Understanding how the number, placement and affinity of transcription factor binding sites dictates gene regulatory programs remains a major unsolved challenge in biology, particularly in the context of multicellular organisms. To uncover these rules, it is first necessary to find the binding sites within a regulatory region with high precision, and then to systematically modulate this binding site arrangement while simultaneously measuring the effect of this modulation on output gene expression. Massively parallel reporter assays (MPRAs), where the gene expression stemming from 10,000s of in vitro-generated regulatory sequences is measured, have made this feat possible in high-throughput in single cells in culture. However, because of lack of technologies to incorporate DNA libraries, MPRAs are limited in whole organisms. To enable MPRAs in multicellular organisms, we generated tools to create a high degree of mutagenesis in specific genomic loci in vivo using base editing. Targeting GFP integrated in the genome of Drosophila cell culture and whole animals as a case study, we show that the base editor AIDevoCDA1 stemming from sea lamprey fused to nCas9 is highly mutagenic. Surprisingly, longer gRNAs increase mutation efficiency and expand the mutating window, which can allow the introduction of mutations in previously untargetable sequences. Finally, we demonstrate arrays of >20 gRNAs that can efficiently introduce mutations along a 200bp sequence, making it a promising tool to test enhancer function in vivo in a high throughput manner.
Collapse
Affiliation(s)
- Julia Falo-Sanjuan
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
| | - Yuliana Diaz-Tirado
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
| | - Meghan A. Turner
- Biophysics Graduate Group, University of California, Berkeley, CA, USA
| | - Olivia Rourke
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
| | - Julian Davis
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
| | - Claudia Medrano
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
| | - Jenna Haines
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
| | - Joey McKenna
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
| | - Arman Karshenas
- Biophysics Graduate Group, University of California, Berkeley, CA, USA
| | - Michael B. Eisen
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
- Department of Integrative Biology, University of California, Berkeley, CA, USA
- Howard Hughes Medical Institute, University of California, Berkeley, CA, USA
| | - Hernan G. Garcia
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
- Department of Physics, University of California, Berkeley, CA, USA
- Institute for Quantitative Biosciences-QB3, University of California, Berkeley, CA, USA
- Chan Zuckerberg Biohub – San Francisco, San Francisco, CA, USA
- Biophysics Graduate Group, University of California, Berkeley, CA, USA
| |
Collapse
|
12
|
LeBlanc C, Stefani J, Soriano M, Lam A, Zintel MA, Kotha SR, Chase E, Pimentel-Solorio G, Vunnum A, Flug K, Fultineer A, Hummel N, Staller MV. Conservation of function without conservation of amino acid sequence in intrinsically disordered transcriptional activation domains. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.03.626510. [PMID: 39677729 PMCID: PMC11642888 DOI: 10.1101/2024.12.03.626510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/17/2024]
Abstract
Protein function is canonically believed to be more conserved than amino acid sequence, but this idea is only well supported in folded domains, where highly diverged sequences can fold into equivalent 3D structures. In contrast, intrinsically disordered protein regions (IDRs) do not fold into a stable 3D structure, thus it remains unknown when and how function is conserved for IDRs that experience rapid amino acid sequence divergence. As a model system for studying the evolution of IDRs, we examined transcriptional activation domains, the regions of transcription factors that bind to coactivator complexes. We systematically identified activation domains on 502 orthologs of the transcriptional activator Gcn4 spanning 600 MY of fungal evolution. We find that the central activation domain shows strong conservation of function without conservation of sequence. This conservation of function without conservation of sequence is facilitated by evolutionary turnover (gain and loss) of key acidic and aromatic residues, the positions most important for function. This high sequence flexibility of functional orthologs mirrors the physical flexibility of the activation domain coactivator interaction interface, suggesting that physical flexibility enables evolutionary plasticity. We propose that turnover of short functional elements, sometimes individual amino acids, is a general mechanism for conservation of function without conservation of sequence during IDR evolution.
Collapse
Affiliation(s)
- Claire LeBlanc
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
- Center for Computational Biology, University of California Berkeley, Berkeley, 94720
| | - Jordan Stefani
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
- Center for Computational Biology, University of California Berkeley, Berkeley, 94720
| | - Melvin Soriano
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
- Center for Computational Biology, University of California Berkeley, Berkeley, 94720
| | - Angelica Lam
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
- Center for Computational Biology, University of California Berkeley, Berkeley, 94720
| | - Marissa A. Zintel
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
| | - Sanjana R. Kotha
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
- Center for Computational Biology, University of California Berkeley, Berkeley, 94720
| | - Emily Chase
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
- Center for Computational Biology, University of California Berkeley, Berkeley, 94720
| | - Giovani Pimentel-Solorio
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
- Center for Computational Biology, University of California Berkeley, Berkeley, 94720
| | - Aditya Vunnum
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
| | - Katherine Flug
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
| | - Aaron Fultineer
- Department of Physics, University of California Berkeley, Berkeley, 94720
| | - Niklas Hummel
- Department of Biology, Technische Universität Darmstadt, Darmstadt, Germany
| | - Max V. Staller
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
- Center for Computational Biology, University of California Berkeley, Berkeley, 94720
- Chan Zuckerberg Biohub–San Francisco, San Francisco, CA 94158
| |
Collapse
|
13
|
Bremer A, Lang WH, Kempen RP, Sweta K, Taylor AB, Borgia MB, Ansari AZ, Mittag T. Reconciling competing models on the roles of condensates and soluble complexes in transcription factor function. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.21.624739. [PMID: 39605529 PMCID: PMC11601617 DOI: 10.1101/2024.11.21.624739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
Phase separation explains the exquisite spatial and temporal regulation of many biological processes, but the role of transcription factor-mediated condensates in gene regulation is contentious, requiring head-to-head comparison of competing models. Here, we focused on the prototypical yeast transcription factor Gcn4 and assessed two models for gene transcription activation, i.e., mediated via soluble complexes or transcriptional condensates. Both models rely on the ability of transcription factors and coactivators to engage in multivalent interactions. Unexpectedly, we found that propensity to form homotypic Gcn4 condensates does not correlate well with transcriptional activity. Contrary to prevailing models, binding to DNA suppresses Gcn4 phase separation. Notably, the ability of Gcn4 to form soluble complexes with coactivator subunit Med15 closely mirrored the propensity to recruit Med15 into condensates, indicating that these properties are intertwined and cautioning against interpretation of mutational data without head-to-head comparisons. However, Gcn4 variants with the highest affinity for Med15 do not function as well as expected and instead have activities that reflect their abilities to phase separate with Med15. These variants therefore indeed form cellular condensates, and those attenuate activity. Our results show that transcription factors can function as soluble complexes as well as condensates, reconciling two seemingly opposing models, and have implications for other phase-separating systems.
Collapse
Affiliation(s)
- Anne Bremer
- Department of Structural Biology, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Walter H. Lang
- Department of Chemical Biology & Therapeutics, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Ryan P. Kempen
- Department of Chemical Biology & Therapeutics, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Kumari Sweta
- Department of Chemical Biology & Therapeutics, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Aaron B. Taylor
- Cellular Imaging Shared Resource, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Madeleine B. Borgia
- Department of Structural Biology, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Aseem Z. Ansari
- Department of Chemical Biology & Therapeutics, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Tanja Mittag
- Department of Structural Biology, St. Jude Children’s Research Hospital, Memphis, TN, USA
| |
Collapse
|
14
|
Cooper DG, Erkina TY, Broyles BK, Class CA, Erkine AM. Grammar rules and exceptions for the language of transcriptional activation domains. iScience 2024; 27:111057. [PMID: 39524347 PMCID: PMC11546935 DOI: 10.1016/j.isci.2024.111057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 07/11/2024] [Accepted: 09/24/2024] [Indexed: 11/16/2024] Open
Abstract
Transcriptional activation domains (ADs) of gene activators have remained enigmatic for decades as short, extremely variable, and structurally disordered sequences. Using a rational design and high throughput in vivo experimentation, we determine the grammar rules and exceptions for the language of ADs. According to identified rules, billions of highly active ADs can be composed of balanced amounts of acidic/aromatic amino acids, with either mixed composition of aromatic residues, or using only one aromatic residue mixed with acidic residues. However, equally active sequences can be composed of only aliphatic leucine and aspartic acid residues. The much rarer LD exceptions have a higher ratio of hydrophobic/acidic balance and display a specific LDL(L/D)DLL motif. For aromatic/acidic Ads, the intermixing of proline residues in context of amphipathic α-helix structures significantly increases the AD activity. The identified grammar rules and exceptions are interpreted in application to the biochemistry of AD function and eukaryotic gene expression.
Collapse
Affiliation(s)
- David G. Cooper
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| | - Tamara Y. Erkina
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| | - Bradley K. Broyles
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| | - Caleb A. Class
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| | - Alexandre M. Erkine
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| |
Collapse
|
15
|
Mindel V, Brodsky S, Yung H, Manadre W, Barkai N. Revisiting the model for coactivator recruitment: Med15 can select its target sites independent of promoter-bound transcription factors. Nucleic Acids Res 2024; 52:12093-12111. [PMID: 39187372 PMCID: PMC11551773 DOI: 10.1093/nar/gkae718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 07/08/2024] [Accepted: 08/09/2024] [Indexed: 08/28/2024] Open
Abstract
Activation domains (ADs) within transcription factors (TFs) induce gene expression by recruiting coactivators such as the Mediator complex. Coactivators lack DNA binding domains (DBDs) and are assumed to passively follow their recruiting TFs. This is supported by direct AD-coactivator interactions seen in vitro but has not yet been tested in living cells. To examine that, we targeted two Med15-recruiting ADs to a range of budding yeast promoters through fusion with different DBDs. The DBD-AD fusions localized to hundreds of genomic sites but recruited Med15 and induced transcription in only a subset of bound promoters, characterized by a fuzzy-nucleosome architecture. Direct DBD-Med15 fusions shifted DBD localization towards fuzzy-nucleosome promoters, including promoters devoid of the endogenous Mediator. We propose that Med15, and perhaps other coactivators, possess inherent promoter preference and thus actively contribute to the selection of TF-induced genes.
Collapse
Affiliation(s)
- Vladimir Mindel
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Sagie Brodsky
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Hadas Yung
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Wajd Manadre
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
16
|
Shepherdson JL, Granas DM, Li J, Shariff Z, Plassmeyer SP, Holehouse AS, White MA, Cohen BA. Mutational scanning of CRX classifies clinical variants and reveals biochemical properties of the transcriptional effector domain. Genome Res 2024; 34:1540-1552. [PMID: 39322280 PMCID: PMC11529990 DOI: 10.1101/gr.279415.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 09/11/2024] [Indexed: 09/27/2024]
Abstract
The transcription factor (TF) cone-rod homeobox (CRX) is essential for the differentiation and maintenance of photoreceptor cell identity. Several human CRX variants cause degenerative retinopathies, but most are variants of uncertain significance. We performed a deep mutational scan (DMS) of nearly all possible single amino acid substitutions in CRX using a cell-based transcriptional reporter assay, curating a high-confidence list of nearly 2000 variants with altered transcriptional activity. In the structured homeodomain, activity scores closely aligned to a predicted structure and demonstrated position-specific constraints on amino acid substitution. In contrast, the intrinsically disordered transcriptional effector domain displayed a qualitatively different pattern of substitution effects, following compositional constraints without specific residue position requirements in the peptide chain. These compositional constraints were consistent with the acidic exposure model of transcriptional activation. We evaluated the performance of the DMS assay as a clinical variant classification tool using gold-standard classified human variants from ClinVar, identifying pathogenic variants with high specificity and moderate sensitivity. That this performance could be achieved using a synthetic reporter assay in a foreign cell type, even for a highly cell type-specific TF like CRX, suggests that this approach shows promise for DMS of other TFs that function in cell types that are not easily accessible. Together, the results of the CRX DMS identify molecular features of the CRX effector domain and demonstrate utility for integration into the clinical variant classification pipeline.
Collapse
Affiliation(s)
- James L Shepherdson
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
| | - David M Granas
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
| | - Jie Li
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
| | - Zara Shariff
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
| | - Stephen P Plassmeyer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
- Center for Biomolecular Condensates, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
| | - Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
- Center for Biomolecular Condensates, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
| | - Michael A White
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
| | - Barak A Cohen
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA;
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA
| |
Collapse
|
17
|
Flores E, Camacho AR, Cuevas-Zepeda E, McCoy MB, Yu F, Staller MV, Sukenik S. Correlating Disordered Activation Domain Ensembles with Gene Expression Levels. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.19.619222. [PMID: 39484498 PMCID: PMC11527027 DOI: 10.1101/2024.10.19.619222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/03/2024]
Abstract
Transcription factor proteins bind to specific DNA promoter sequences and initiate gene transcription. In eukaryotes, most transcription factors contain intrinsically disordered activation domains (ADs) that regulate their transcriptional activity. Like other disordered protein regions, ADs do not have a fixed three-dimensional structure and instead exist in an ensemble of conformations. Disordered ensembles contain sequence-encoded structural preferences which are often linked to their function. We hypothesize this link exists between the structural preferences of disordered AD ensembles and their ability to induce gene expression. To test this, we used FRET microscopy to measure the ensemble dimensions of two activation domains, HIF-1α and CITED2, in live cells, and correlate this structural information with transcriptional activity. We find that point mutations that expanded the HIF-1α ensemble increased transcriptional activity, while those that compacted it reduced activity. Conversely, CITED2 showed no correlation between ensemble dimensions and activity. Our results reveal a sequence-dependent relationship between AD ensemble dimensions and their transcriptional activity.
Collapse
Affiliation(s)
- Eduardo Flores
- Department of Chemistry and Biochemistry, University of California Merced, Merced, 95343
| | - Aleah R Camacho
- Department of Chemistry and Biochemistry, University of California Merced, Merced, 95343
| | | | - Mary B McCoy
- Department of Chemistry and Biochemistry, University of California Merced, Merced, 95343
| | - Feng Yu
- Department of Chemistry and Biochemistry, University of California Merced, Merced, 95343
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, 94720 Berkeley, CA, USA
| | - Max V Staller
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, 94720
- Center for Computational Biology, University of California Berkeley, Berkeley, 94720
- Chan Zuckerberg Biohub-San Francisco, San Francisco, CA 94158
| | - Shahar Sukenik
- Department of Chemistry and Biochemistry, University of California Merced, Merced, 95343
- Department of Chemistry, Syracuse University, Syracuse, 13244
| |
Collapse
|
18
|
Kribelbauer-Swietek JF, Pushkarev O, Gardeux V, Faltejskova K, Russeil J, van Mierlo G, Deplancke B. Context transcription factors establish cooperative environments and mediate enhancer communication. Nat Genet 2024; 56:2199-2212. [PMID: 39363017 PMCID: PMC11525195 DOI: 10.1038/s41588-024-01892-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 08/01/2024] [Indexed: 10/05/2024]
Abstract
Many enhancers control gene expression by assembling regulatory factor clusters, also referred to as condensates. This process is vital for facilitating enhancer communication and establishing cellular identity. However, how DNA sequence and transcription factor (TF) binding instruct the formation of high regulatory factor environments remains poorly understood. Here we developed a new approach leveraging enhancer-centric chromatin accessibility quantitative trait loci (caQTLs) to nominate regulatory factor clusters genome-wide. By analyzing TF-binding signatures within the context of caQTLs and comparing episomal versus endogenous enhancer activities, we discovered a class of regulators, 'context-only' TFs, that amplify the activity of cell type-specific caQTL-binding TFs, that is, 'context-initiator' TFs. Similar to super-enhancers, enhancers enriched for context-only TF-binding sites display high coactivator binding and sensitivity to bromodomain-inhibiting molecules. We further show that binding sites for context-only and context-initiator TFs underlie enhancer coordination, providing a mechanistic rationale for how a loose TF syntax confers regulatory specificity.
Collapse
Affiliation(s)
- Judith F Kribelbauer-Swietek
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| | - Olga Pushkarev
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Vincent Gardeux
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Katerina Faltejskova
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague, Czech Republic
- Computer Science Institute, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic
| | - Julie Russeil
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Guido van Mierlo
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Bart Deplancke
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| |
Collapse
|
19
|
Valbuena R, Nigam A, Tycko J, Suzuki P, Spees K, Aradhana, Arana S, Du P, Patel RA, Bintu L, Kundaje A, Bassik MC. Prediction and design of transcriptional repressor domains with large-scale mutational scans and deep learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.21.614253. [PMID: 39386603 PMCID: PMC11463546 DOI: 10.1101/2024.09.21.614253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/12/2024]
Abstract
Regulatory proteins have evolved diverse repressor domains (RDs) to enable precise context-specific repression of transcription. However, our understanding of how sequence variation impacts the functional activity of RDs is limited. To address this gap, we generated a high-throughput mutational scanning dataset measuring the repressor activity of 115,000 variant sequences spanning more than 50 RDs in human cells. We identified thousands of clinical variants with loss or gain of repressor function, including TWIST1 HLH variants associated with Saethre-Chotzen syndrome and MECP2 domain variants associated with Rett syndrome. We also leveraged these data to annotate short linear interacting motifs (SLiMs) that are critical for repression in disordered RDs. Then, we designed a deep learning model called TENet ( T ranscriptional E ffector Net work) that integrates sequence, structure and biochemical representations of sequence variants to accurately predict repressor activity. We systematically tested generalization within and across domains with varying homology using the mutational scanning dataset. Finally, we employed TENet within a directed evolution sequence editing framework to tune the activity of both structured and disordered RDs and experimentally test thousands of designs. Our work highlights critical considerations for future dataset design and model training strategies to improve functional variant prioritization and precision design of synthetic regulatory proteins.
Collapse
|
20
|
Folimonova V, Chen X, Negi H, Schwieters CD, Li J, Byrd RA, Taylor N, Youkharibache P, Walters KJ. CD28 hinge used in chimeric antigen receptor (CAR) T-cells exhibits local structure and conformational exchange amidst global disorder. Commun Biol 2024; 7:1072. [PMID: 39217198 PMCID: PMC11365992 DOI: 10.1038/s42003-024-06770-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Accepted: 08/21/2024] [Indexed: 09/04/2024] Open
Abstract
T-cell therapies based on chimeric antigen receptor (CAR) targeting of a tumor-specific antigen offer hope for patients with relapsed or refractory cancers. CAR hinge and transmembrane regions link antigen recognition domains to intracellular signal transduction domains. Here, we apply biophysical methods to characterize the structure and dynamic properties of the CD28 CAR hinge (CD28H) used in an FDA-approved CD19 CAR for the treatment of B-lineage leukemia/lymphoma. By using nuclear Overhauser effect spectroscopy (NOESY), which detects even transiently occupied structural motifs, we observed otherwise elusive local structural elements amidst overall disorder in CD28H, including a conformational switch from a native β-strand to a 310-helix and polyproline II helix-like structure. These local structural motifs contribute to an overall loosely formed extended geometry that could be captured by NOESY data. All FDA-approved CARs use prolines in the hinge region, which we find in CD28, and previously in CD8α, isomerize to promote structural plasticity and dynamics. These local structural elements may function in recognition and signaling events and constrain the spacing between the transmembrane and antigen recognition domains. Our study thus demonstrates a method for detecting local and transient structure within intrinsically disordered systems and moreover, our CD28H findings may inform future CAR design.
Collapse
Affiliation(s)
- Varvara Folimonova
- Protein Processing Section, Center for Structural Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, USA
| | - Xiang Chen
- Protein Processing Section, Center for Structural Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, USA
| | - Hitendra Negi
- Protein Processing Section, Center for Structural Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, USA
| | - Charles D Schwieters
- Computational Biomolecular Magnetic Resonance Core, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Jess Li
- Macromolecular NMR Section, Center for Structural Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, USA
| | - R Andrew Byrd
- Macromolecular NMR Section, Center for Structural Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, USA
| | - Naomi Taylor
- Pediatric Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Philippe Youkharibache
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Kylie J Walters
- Protein Processing Section, Center for Structural Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, USA.
| |
Collapse
|
21
|
Rubin AJ, Dao TT, Schueppert AV, Regev A, Shalek AK. LAT encodes T cell activation pathway balance. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.26.609683. [PMID: 39253472 PMCID: PMC11383308 DOI: 10.1101/2024.08.26.609683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
Immune cells transduce environmental stimuli into responses essential for host health via complex signaling cascades. T cells, in particular, leverage their unique T cell receptors (TCRs) to detect specific Human Leukocyte Antigen (HLA)-presented peptides. TCR activation is then relayed via linker for activation of T cells (LAT), a TCR-proximal disordered adapter protein, which organizes protein partners and mediates the propagation of signals down diverse pathways including NFAT and AP-1. Here, we studied how balanced downstream pathway activation is encoded in the amino acid sequence of LAT. To comprehensively profile the sequence-function relationship of LAT, we developed a pooled, single-cell, high-content screening approach in which a large series of mutants in the LAT protein were analyzed to characterize their effects on T cell activation. Measuring epigenetic, transcriptomic, and cell surface protein dynamics of single cells harboring distinct LAT mutants, we found functional regions spanning over 40% of the LAT amino acid sequence. Conserved sequence motifs for protein interactions along with charge distribution are critical sequence features, and contribute to interpretation of human genetic variation in LAT. While mutant defect severity spans from moderate to complete loss of function, nearly all defective mutants, irrespective of their position in LAT, confer balanced defects across all downstream pathways. To understand the molecular basis for this observation, we performed proximal protein labeling which demonstrated that disruption of LAT interaction with a single partner protein indirectly disrupts other partner interactions, likely through the dual roles of these proteins as effectors of downstream pathways and bridging factors between LAT molecules. Overall, we report widely distributed functional regions throughout a disordered adapter and a precise physical organization of LAT and interacting molecules which constrains signaling outputs. More broadly, we describe an approach for interrogating sequence-function relationships for proteins with complex activities across regulatory layers of the cell.
Collapse
Affiliation(s)
- Adam J. Rubin
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Institute for Medical Engineering & Science, Department of Chemistry, and Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Ragon Institute of MIT, MGH, and Harvard, Cambridge, MA 02139, USA
| | - Tyler T. Dao
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Institute for Medical Engineering & Science, Department of Chemistry, and Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Ragon Institute of MIT, MGH, and Harvard, Cambridge, MA 02139, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Amelia V. Schueppert
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Institute for Medical Engineering & Science, Department of Chemistry, and Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Ragon Institute of MIT, MGH, and Harvard, Cambridge, MA 02139, USA
| | - Aviv Regev
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Current address: Genentech, South San Francisco, CA, 94080
| | - Alex K. Shalek
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Institute for Medical Engineering & Science, Department of Chemistry, and Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Ragon Institute of MIT, MGH, and Harvard, Cambridge, MA 02139, USA
| |
Collapse
|
22
|
DelRosso N, Suzuki PH, Griffith D, Lotthammer JM, Novak B, Kocalar S, Sheth MU, Holehouse AS, Bintu L, Fordyce P. High-throughput affinity measurements of direct interactions between activation domains and co-activators. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.19.608698. [PMID: 39229005 PMCID: PMC11370418 DOI: 10.1101/2024.08.19.608698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
Sequence-specific activation by transcription factors is essential for gene regulation1,2. Key to this are activation domains, which often fall within disordered regions of transcription factors3,4 and recruit co-activators to initiate transcription5. These interactions are difficult to characterize via most experimental techniques because they are typically weak and transient6,7. Consequently, we know very little about whether these interactions are promiscuous or specific, the mechanisms of binding, and how these interactions tune the strength of gene activation. To address these questions, we developed a microfluidic platform for expression and purification of hundreds of activation domains in parallel followed by direct measurement of co-activator binding affinities (STAMMPPING, for Simultaneous Trapping of Affinity Measurements via a Microfluidic Protein-Protein INteraction Generator). By applying STAMMPPING to quantify direct interactions between eight co-activators and 204 human activation domains (>1,500 K ds), we provide the first quantitative map of these interactions and reveal 334 novel binding pairs. We find that the metazoan-specific co-activator P300 directly binds >100 activation domains, potentially explaining its widespread recruitment across the genome to influence transcriptional activation. Despite sharing similar molecular properties (e.g. enrichment of negative and hydrophobic residues), activation domains utilize distinct biophysical properties to recruit certain co-activator domains. Co-activator domain affinity and occupancy are well-predicted by analytical models that account for multivalency, and in vitro affinities quantitatively predict activation in cells with an ultrasensitive response. Not only do our results demonstrate the ability to measure affinities between even weak protein-protein interactions in high throughput, but they also provide a necessary resource of over 1,500 activation domain/co-activator affinities which lays the foundation for understanding the molecular basis of transcriptional activation.
Collapse
Affiliation(s)
| | - Peter H Suzuki
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Daniel Griffith
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Jeffrey M Lotthammer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Borna Novak
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Selin Kocalar
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Maya U Sheth
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Lacramioara Bintu
- Biophysics Program, Stanford University, Stanford, CA, USA
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Polly Fordyce
- Biophysics Program, Stanford University, Stanford, CA, USA
- Department of Bioengineering, Stanford University, Stanford, CA, USA
- Sarafan ChEM-H Institute, Stanford University, Stanford, CA, USA
- Chan Zuckerberg Biohub San Francisco, CA, USA
| |
Collapse
|
23
|
Morffy N, Van den Broeck L, Miller C, Emenecker RJ, Bryant JA, Lee TM, Sageman-Furnas K, Wilkinson EG, Pathak S, Kotha SR, Lam A, Mahatma S, Pande V, Waoo A, Wright RC, Holehouse AS, Staller MV, Sozzani R, Strader LC. Identification of plant transcriptional activation domains. Nature 2024; 632:166-173. [PMID: 39020176 PMCID: PMC11589624 DOI: 10.1038/s41586-024-07707-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 06/12/2024] [Indexed: 07/19/2024]
Abstract
Gene expression in Arabidopsis is regulated by more than 1,900 transcription factors (TFs), which have been identified genome-wide by the presence of well-conserved DNA-binding domains. Activator TFs contain activation domains (ADs) that recruit coactivator complexes; however, for nearly all Arabidopsis TFs, we lack knowledge about the presence, location and transcriptional strength of their ADs1. To address this gap, here we use a yeast library approach to experimentally identify Arabidopsis ADs on a proteome-wide scale, and find that more than half of the Arabidopsis TFs contain an AD. We annotate 1,553 ADs, the vast majority of which are, to our knowledge, previously unknown. Using the dataset generated, we develop a neural network to accurately predict ADs and to identify sequence features that are necessary to recruit coactivator complexes. We uncover six distinct combinations of sequence features that result in activation activity, providing a framework to interrogate the subfunctionalization of ADs. Furthermore, we identify ADs in the ancient AUXIN RESPONSE FACTOR family of TFs, revealing that AD positioning is conserved in distinct clades. Our findings provide a deep resource for understanding transcriptional activation, a framework for examining function in intrinsically disordered regions and a predictive model of ADs.
Collapse
Affiliation(s)
| | - Lisa Van den Broeck
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - Caelan Miller
- Department of Biology, Duke University, Durham, NC, USA
| | - Ryan J Emenecker
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - John A Bryant
- Biological Systems Engineering, Virginia Tech, Blacksburg, VA, USA
| | - Tyler M Lee
- Department of Biology, Duke University, Durham, NC, USA
| | | | | | - Sunita Pathak
- Department of Biology, Duke University, Durham, NC, USA
| | - Sanjana R Kotha
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Angelica Lam
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Saloni Mahatma
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - Vikram Pande
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - Aman Waoo
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - R Clay Wright
- Biological Systems Engineering, Virginia Tech, Blacksburg, VA, USA
| | - Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Max V Staller
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Rosangela Sozzani
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | | |
Collapse
|
24
|
Naderi J, Magalhaes AP, Kibar G, Stik G, Zhang Y, Mackowiak SD, Wieler HM, Rossi F, Buschow R, Christou-Kent M, Alcoverro-Bertran M, Graf T, Vingron M, Hnisz D. An activity-specificity trade-off encoded in human transcription factors. Nat Cell Biol 2024; 26:1309-1321. [PMID: 38969762 PMCID: PMC11321997 DOI: 10.1038/s41556-024-01411-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 03/20/2024] [Indexed: 07/07/2024]
Abstract
Transcription factors (TFs) control specificity and activity of gene transcription, but whether a relationship between these two features exists is unclear. Here we provide evidence for an evolutionary trade-off between the activity and specificity in human TFs encoded as submaximal dispersion of aromatic residues in their intrinsically disordered protein regions. We identified approximately 500 human TFs that encode short periodic blocks of aromatic residues in their intrinsically disordered regions, resembling imperfect prion-like sequences. Mutation of periodic aromatic residues reduced transcriptional activity, whereas increasing the aromatic dispersion of multiple human TFs enhanced transcriptional activity and reprogramming efficiency, promoted liquid-liquid phase separation in vitro and more promiscuous DNA binding in cells. Together with recent work on enhancer elements, these results suggest an important evolutionary role of suboptimal features in transcriptional control. We propose that rational engineering of amino acid features that alter phase separation may be a strategy to optimize TF-dependent processes, including cellular reprogramming.
Collapse
Affiliation(s)
- Julian Naderi
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
- Institute of Chemistry and Biochemistry, Department of Biology, Chemistry and Pharmacy, Freie Universität Berlin, Berlin, Germany
| | - Alexandre P Magalhaes
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Gözde Kibar
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Gregoire Stik
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
- Josep Carreras Leukaemia Research Institute, Badalona, Spain
| | - Yaotian Zhang
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Sebastian D Mackowiak
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Hannah M Wieler
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Francesca Rossi
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Rene Buschow
- Microscopy Core Facility, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Marie Christou-Kent
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Marc Alcoverro-Bertran
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Thomas Graf
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Martin Vingron
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Denes Hnisz
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany.
| |
Collapse
|
25
|
Hummel NFC, Markel K, Stefani J, Staller MV, Shih PM. Systematic identification of transcriptional activation domains from non-transcription factor proteins in plants and yeast. Cell Syst 2024; 15:662-672.e4. [PMID: 38866009 DOI: 10.1016/j.cels.2024.05.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 04/26/2024] [Accepted: 05/22/2024] [Indexed: 06/14/2024]
Abstract
Transcription factors can promote gene expression through activation domains. Whole-genome screens have systematically mapped activation domains in transcription factors but not in non-transcription factor proteins (e.g., chromatin regulators and coactivators). To fill this knowledge gap, we employed the activation domain predictor PADDLE to analyze the proteomes of Arabidopsis thaliana and Saccharomyces cerevisiae. We screened 18,000 predicted activation domains from >800 non-transcription factor genes in both species, confirming that 89% of candidate proteins contain active fragments. Our work enables the annotation of hundreds of nuclear proteins as putative coactivators, many of which have never been ascribed any function in plants. Analysis of peptide sequence compositions reveals how the distribution of key amino acids dictates activity. Finally, we validated short, "universal" activation domains with comparable performance to state-of-the-art activation domains used for genome engineering. Our approach enables the genome-wide discovery and annotation of activation domains that can function across diverse eukaryotes.
Collapse
Affiliation(s)
- Niklas F C Hummel
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA; Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Department of Biology, Technische Universität Darmstadt, 64287 Darmstadt, Germany
| | - Kasey Markel
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA; Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Jordan Stefani
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720, USA
| | - Max V Staller
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720, USA; Center for Computational Biology, University of California, Berkeley, CA 94720, USA; Chan Zuckerberg Biohub-San Francisco, San Francisco, CA 9415, USA.
| | - Patrick M Shih
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA; Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Innovative Genomics Institute, University of California, Berkeley, CA 94720, USA.
| |
Collapse
|
26
|
Ginell GM, Emenecker RJ, Lotthammer JM, Usher ET, Holehouse AS. Direct prediction of intermolecular interactions driven by disordered regions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.03.597104. [PMID: 38895487 PMCID: PMC11185574 DOI: 10.1101/2024.06.03.597104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Intrinsically disordered regions (IDRs) are critical for a wide variety of cellular functions, many of which involve interactions with partner proteins. Molecular recognition is typically considered through the lens of sequence-specific binding events. However, a growing body of work has shown that IDRs often interact with partners in a manner that does not depend on the precise order of the amino acid order, instead driven by complementary chemical interactions leading to disordered bound-state complexes. Despite this emerging paradigm, we lack tools to describe, quantify, predict, and interpret these types of structurally heterogeneous interactions from the underlying amino acid sequences. Here, we repurpose the chemical physics developed originally for molecular simulations to develop an approach for predicting intermolecular interactions between IDRs and partner proteins. Our approach enables the direct prediction of phase diagrams, the identification of chemically-specific interaction hotspots on IDRs, and a route to develop and test mechanistic hypotheses regarding IDR function in the context of molecular recognition. We use our approach to examine a range of systems and questions to highlight its versatility and applicability.
Collapse
Affiliation(s)
- Garrett M. Ginell
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Ryan. J Emenecker
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Jeffrey M. Lotthammer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Emery T. Usher
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| |
Collapse
|
27
|
Gonçalves AAM, Ribeiro AJ, Resende CAA, Couto CAP, Gandra IB, Dos Santos Barcelos IC, da Silva JO, Machado JM, Silva KA, Silva LS, Dos Santos M, da Silva Lopes L, de Faria MT, Pereira SP, Xavier SR, Aragão MM, Candida-Puma MA, de Oliveira ICM, Souza AA, Nogueira LM, da Paz MC, Coelho EAF, Giunchetti RC, de Freitas SM, Chávez-Fumagalli MA, Nagem RAP, Galdino AS. Recombinant multiepitope proteins expressed in Escherichia coli cells and their potential for immunodiagnosis. Microb Cell Fact 2024; 23:145. [PMID: 38778337 PMCID: PMC11110257 DOI: 10.1186/s12934-024-02418-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 05/07/2024] [Indexed: 05/25/2024] Open
Abstract
Recombinant multiepitope proteins (RMPs) are a promising alternative for application in diagnostic tests and, given their wide application in the most diverse diseases, this review article aims to survey the use of these antigens for diagnosis, as well as discuss the main points surrounding these antigens. RMPs usually consisting of linear, immunodominant, and phylogenetically conserved epitopes, has been applied in the experimental diagnosis of various human and animal diseases, such as leishmaniasis, brucellosis, cysticercosis, Chagas disease, hepatitis, leptospirosis, leprosy, filariasis, schistosomiasis, dengue, and COVID-19. The synthetic genes for these epitopes are joined to code a single RMP, either with spacers or fused, with different biochemical properties. The epitopes' high density within the RMPs contributes to a high degree of sensitivity and specificity. The RMPs can also sidestep the need for multiple peptide synthesis or multiple recombinant proteins, reducing costs and enhancing the standardization conditions for immunoassays. Methods such as bioinformatics and circular dichroism have been widely applied in the development of new RMPs, helping to guide their construction and better understand their structure. Several RMPs have been expressed, mainly using the Escherichia coli expression system, highlighting the importance of these cells in the biotechnological field. In fact, technological advances in this area, offering a wide range of different strains to be used, make these cells the most widely used expression platform. RMPs have been experimentally used to diagnose a broad range of illnesses in the laboratory, suggesting they could also be useful for accurate diagnoses commercially. On this point, the RMP method offers a tempting substitute for the production of promising antigens used to assemble commercial diagnostic kits.
Collapse
Affiliation(s)
- Ana Alice Maia Gonçalves
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Anna Julia Ribeiro
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Carlos Ananias Aparecido Resende
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Carolina Alves Petit Couto
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Isadora Braga Gandra
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Isabelle Caroline Dos Santos Barcelos
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Jonatas Oliveira da Silva
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Juliana Martins Machado
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Kamila Alves Silva
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Líria Souza Silva
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Michelli Dos Santos
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Lucas da Silva Lopes
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Mariana Teixeira de Faria
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Sabrina Paula Pereira
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Sandra Rodrigues Xavier
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Matheus Motta Aragão
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte, 31270-901, Brazil
| | - Mayron Antonio Candida-Puma
- Computational Biology and Chemistry Research Group, Vicerrectorado de Investigación, Universidad Católica de Santa María, Arequipa, 04000, Peru
| | | | - Amanda Araujo Souza
- Biophysics Laboratory, Institute of Biological Sciences, Department of Cell Biology, University of Brasilia, Brasília, 70910-900, Brazil
| | - Lais Moreira Nogueira
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Mariana Campos da Paz
- Bioactives and Nanobiotechnology Laboratory, Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
| | - Eduardo Antônio Ferraz Coelho
- Postgraduate Program in Health Sciences, Infectious Diseases and Tropical Medicine, Faculty of Medicine, Federal University of Minas Gerais, Belo Horizonte, 30130-100, Brazil
| | - Rodolfo Cordeiro Giunchetti
- Laboratory of Biology of Cell Interactions, National Institute of Science and Technology on Tropical Diseases (INCT-DT), Department of Morphology, Federal University of Minas Gerais, Belo Horizonte, 31270-901, Brazil
| | - Sonia Maria de Freitas
- Biophysics Laboratory, Institute of Biological Sciences, Department of Cell Biology, University of Brasilia, Brasília, 70910-900, Brazil
| | - Miguel Angel Chávez-Fumagalli
- Computational Biology and Chemistry Research Group, Vicerrectorado de Investigación, Universidad Católica de Santa María, Arequipa, 04000, Peru
| | - Ronaldo Alves Pinto Nagem
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte, 31270-901, Brazil
| | - Alexsandro Sobreira Galdino
- Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil.
| |
Collapse
|
28
|
Monté D, Lens Z, Dewitte F, Villeret V, Verger A. Assessment of machine-learning predictions for the Mediator complex subunit MED25 ACID domain interactions with transactivation domains. FEBS Lett 2024; 598:758-773. [PMID: 38436147 DOI: 10.1002/1873-3468.14837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 02/01/2024] [Accepted: 02/10/2024] [Indexed: 03/05/2024]
Abstract
The human Mediator complex subunit MED25 binds transactivation domains (TADs) present in various cellular and viral proteins using two binding interfaces, named H1 and H2, which are found on opposite sides of its ACID domain. Here, we use and compare deep learning methods to characterize human MED25-TAD interfaces and assess the predicted models to published experimental data. For the H1 interface, AlphaFold produces predictions with high-reliability scores that agree well with experimental data, while the H2 interface predictions appear inconsistent, preventing reliable binding modes. Despite these limitations, we experimentally assess the validity of MED25 interface predictions with the viral transcriptional activators Lana-1 and IE62. AlphaFold predictions also suggest the existence of a unique hydrophobic pocket for the Arabidopsis MED25 ACID domain.
Collapse
Affiliation(s)
- Didier Monté
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Zoé Lens
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Frédérique Dewitte
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Vincent Villeret
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Alexis Verger
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| |
Collapse
|
29
|
Shepherdson JL, Granas DM, Li J, Shariff Z, Plassmeyer SP, Holehouse AS, White MA, Cohen BA. Mutational scanning of CRX classifies clinical variants and reveals biochemical properties of the transcriptional effector domain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.21.585809. [PMID: 38585983 PMCID: PMC10996540 DOI: 10.1101/2024.03.21.585809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Cone-Rod Homeobox, encoded by CRX, is a transcription factor (TF) essential for the terminal differentiation and maintenance of mammalian photoreceptors. Structurally, CRX comprises an ordered DNA-binding homeodomain and an intrinsically disordered transcriptional effector domain. Although a handful of human variants in CRX have been shown to cause several different degenerative retinopathies with varying cone and rod predominance, as with most human disease genes the vast majority of observed CRX genetic variants are uncharacterized variants of uncertain significance (VUS). We performed a deep mutational scan (DMS) of nearly all possible single amino acid substitution variants in CRX, using an engineered cell-based transcriptional reporter assay. We measured the ability of each CRX missense variant to transactivate a synthetic fluorescent reporter construct in a pooled fluorescence-activated cell sorting assay and compared the activation strength of each variant to that of wild-type CRX to compute an activity score, identifying thousands of variants with altered transcriptional activity. We calculated a statistical confidence for each activity score derived from multiple independent measurements of each variant marked by unique sequence barcodes, curating a high-confidence list of nearly 2,000 variants with significantly altered transcriptional activity compared to wild-type CRX. We evaluated the performance of the DMS assay as a clinical variant classification tool using gold-standard classified human variants from ClinVar, and determined that activity scores could be used to identify pathogenic variants with high specificity. That this performance could be achieved using a synthetic reporter assay in a foreign cell type, even for a highly cell type-specific TF like CRX, suggests that this approach shows promise for DMS of other TFs that function in cell types that are not easily accessible. Per-position average activity scores closely aligned to a predicted structure of the ordered homeodomain and demonstrated position-specific residue requirements. The intrinsically disordered transcriptional effector domain, by contrast, displayed a qualitatively different pattern of substitution effects, following compositional constraints without specific residue position requirements in the peptide chain. The observed compositional constraints of the effector domain were consistent with the acidic exposure model of transcriptional activation. Together, the results of the CRX DMS identify molecular features of the CRX effector domain and demonstrate clinical utility for variant classification.
Collapse
Affiliation(s)
- James L. Shepherdson
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - David M. Granas
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Jie Li
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Zara Shariff
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Stephen P. Plassmeyer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Center for Biomolecular Condensates, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Center for Biomolecular Condensates, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Michael A. White
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Barak A. Cohen
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| |
Collapse
|
30
|
Mindel V, Brodsky S, Cohen A, Manadre W, Jonas F, Carmi M, Barkai N. Intrinsically disordered regions of the Msn2 transcription factor encode multiple functions using interwoven sequence grammars. Nucleic Acids Res 2024; 52:2260-2272. [PMID: 38109289 PMCID: PMC10954448 DOI: 10.1093/nar/gkad1191] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 11/04/2023] [Accepted: 12/11/2023] [Indexed: 12/20/2023] Open
Abstract
Intrinsically disordered regions (IDRs) are abundant in eukaryotic proteins, but their sequence-function relationship remains poorly understood. IDRs of transcription factors (TFs) can direct promoter selection and recruit coactivators, as shown for the budding yeast TF Msn2. To examine how IDRs encode both these functions, we compared genomic binding specificity, coactivator recruitment, and gene induction amongst a large set of designed Msn2-IDR mutants. We find that both functions depend on multiple regions across the > 600AA IDR. Yet, transcription activity was readily disrupted by mutations that showed no effect on the Msn2 binding specificity. Our data attribute this differential sensitivity to the integration of a relaxed, composition-based code directing binding specificity with a more stringent, motif-based code controlling the recruitment of coactivators and transcription activity. Therefore, Msn2 utilizes interwoven sequence grammars for encoding multiple functions, suggesting a new IDR design paradigm of potentially general use.
Collapse
Affiliation(s)
- Vladimir Mindel
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Sagie Brodsky
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Aileen Cohen
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Wajd Manadre
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Felix Jonas
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Miri Carmi
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
31
|
Gan P, Eppert M, De La Cruz N, Lyons H, Shah AM, Veettil RT, Chen K, Pradhan P, Bezprozvannaya S, Xu L, Liu N, Olson EN, Sabari BR. Coactivator condensation drives cardiovascular cell lineage specification. SCIENCE ADVANCES 2024; 10:eadk7160. [PMID: 38489358 PMCID: PMC10942106 DOI: 10.1126/sciadv.adk7160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 02/12/2024] [Indexed: 03/17/2024]
Abstract
During development, cells make switch-like decisions to activate new gene programs specifying cell lineage. The mechanisms underlying these decisive choices remain unclear. Here, we show that the cardiovascular transcriptional coactivator myocardin (MYOCD) activates cell identity genes by concentration-dependent and switch-like formation of transcriptional condensates. MYOCD forms such condensates and activates cell identity genes at critical concentration thresholds achieved during smooth muscle cell and cardiomyocyte differentiation. The carboxyl-terminal disordered region of MYOCD is necessary and sufficient for condensate formation. Disrupting this region's ability to form condensates disrupts gene activation and smooth muscle cell reprogramming. Rescuing condensate formation by replacing this region with disordered regions from functionally unrelated proteins rescues gene activation and smooth muscle cell reprogramming. Our findings demonstrate that MYOCD condensate formation is required for gene activation during cardiovascular differentiation. We propose that the formation of transcriptional condensates at critical concentrations of cell type-specific regulators provides a molecular switch underlying the activation of key cell identity genes during development.
Collapse
Affiliation(s)
- Peiheng Gan
- Department of Molecular Biology, Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Mikayla Eppert
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Nancy De La Cruz
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Heankel Lyons
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Akansha M. Shah
- Department of Molecular Biology, Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Reshma T. Veettil
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Kenian Chen
- Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Prashant Pradhan
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Svetlana Bezprozvannaya
- Department of Molecular Biology, Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Lin Xu
- Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Ning Liu
- Department of Molecular Biology, Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Eric N. Olson
- Department of Molecular Biology, Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Benjamin R. Sabari
- Department of Molecular Biology, Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| |
Collapse
|
32
|
Swint-Kruse L, Fenton AW. Rheostats, toggles, and neutrals, Oh my! A new framework for understanding how amino acid changes modulate protein function. J Biol Chem 2024; 300:105736. [PMID: 38336297 PMCID: PMC10914490 DOI: 10.1016/j.jbc.2024.105736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/09/2024] [Accepted: 01/25/2024] [Indexed: 02/12/2024] Open
Abstract
Advances in personalized medicine and protein engineering require accurately predicting outcomes of amino acid substitutions. Many algorithms correctly predict that evolutionarily-conserved positions show "toggle" substitution phenotypes, which is defined when a few substitutions at that position retain function. In contrast, predictions often fail for substitutions at the less-studied "rheostat" positions, which are defined when different amino acid substitutions at a position sample at least half of the possible functional range. This review describes efforts to understand the impact and significance of rheostat positions: (1) They have been observed in globular soluble, integral membrane, and intrinsically disordered proteins; within single proteins, their prevalence can be up to 40%. (2) Substitutions at rheostat positions can have biological consequences and ∼10% of substitutions gain function. (3) Although both rheostat and "neutral" (defined when all substitutions exhibit wild-type function) positions are nonconserved, the two classes have different evolutionary signatures. (4) Some rheostat positions have pleiotropic effects on function, simultaneously modulating multiple parameters (e.g., altering both affinity and allosteric coupling). (5) In structural studies, substitutions at rheostat positions appear to cause only local perturbations; the overall conformations appear unchanged. (6) Measured functional changes show promising correlations with predicted changes in protein dynamics; the emergent properties of predicted, dynamically coupled amino acid networks might explain some of the complex functional outcomes observed when substituting rheostat positions. Overall, rheostat positions provide unique opportunities for using single substitutions to tune protein function. Future studies of these positions will yield important insights into the protein sequence/function relationship.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA.
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
33
|
Holehouse AS, Kragelund BB. The molecular basis for cellular function of intrinsically disordered protein regions. Nat Rev Mol Cell Biol 2024; 25:187-211. [PMID: 37957331 PMCID: PMC11459374 DOI: 10.1038/s41580-023-00673-0] [Citation(s) in RCA: 175] [Impact Index Per Article: 175.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2023] [Indexed: 11/15/2023]
Abstract
Intrinsically disordered protein regions exist in a collection of dynamic interconverting conformations that lack a stable 3D structure. These regions are structurally heterogeneous, ubiquitous and found across all kingdoms of life. Despite the absence of a defined 3D structure, disordered regions are essential for cellular processes ranging from transcriptional control and cell signalling to subcellular organization. Through their conformational malleability and adaptability, disordered regions extend the repertoire of macromolecular interactions and are readily tunable by their structural and chemical context, making them ideal responders to regulatory cues. Recent work has led to major advances in understanding the link between protein sequence and conformational behaviour in disordered regions, yet the link between sequence and molecular function is less well defined. Here we consider the biochemical and biophysical foundations that underlie how and why disordered regions can engage in productive cellular functions, provide examples of emerging concepts and discuss how protein disorder contributes to intracellular information processing and regulation of cellular function.
Collapse
Affiliation(s)
- Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St Louis, MO, USA.
- Center for Biomolecular Condensates, Washington University in St Louis, St Louis, MO, USA.
| | - Birthe B Kragelund
- REPIN, Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
34
|
Zheng Y, Chen S. Transcriptional precision in photoreceptor development and diseases - Lessons from 25 years of CRX research. Front Cell Neurosci 2024; 18:1347436. [PMID: 38414750 PMCID: PMC10896975 DOI: 10.3389/fncel.2024.1347436] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 01/19/2024] [Indexed: 02/29/2024] Open
Abstract
The vertebrate retina is made up of six specialized neuronal cell types and one glia that are generated from a common retinal progenitor. The development of these distinct cell types is programmed by transcription factors that regulate the expression of specific genes essential for cell fate specification and differentiation. Because of the complex nature of transcriptional regulation, understanding transcription factor functions in development and disease is challenging. Research on the Cone-rod homeobox transcription factor CRX provides an excellent model to address these challenges. In this review, we reflect on 25 years of mammalian CRX research and discuss recent progress in elucidating the distinct pathogenic mechanisms of four CRX coding variant classes. We highlight how in vitro biochemical studies of CRX protein functions facilitate understanding CRX regulatory principles in animal models. We conclude with a brief discussion of the emerging systems biology approaches that could accelerate precision medicine for CRX-linked diseases and beyond.
Collapse
Affiliation(s)
- Yiqiao Zheng
- Molecular Genetics and Genomics Graduate Program, Division of Biological and Biomedical Sciences, Saint Louis, MO, United States
- Department of Ophthalmology and Visual Sciences, Saint Louis, MO, United States
| | - Shiming Chen
- Molecular Genetics and Genomics Graduate Program, Division of Biological and Biomedical Sciences, Saint Louis, MO, United States
- Department of Ophthalmology and Visual Sciences, Saint Louis, MO, United States
- Department of Developmental Biology, Washington University in St. Louis, Saint Louis, MO, United States
| |
Collapse
|
35
|
Mihalič F, Arcila D, Pettersson ME, Farkhondehkish P, Andersson E, Andersson L, Betancur-R R, Jemth P. Conservation of Affinity Rather Than Sequence Underlies a Dynamic Evolution of the Motif-Mediated p53/MDM2 Interaction in Ray-Finned Fishes. Mol Biol Evol 2024; 41:msae018. [PMID: 38301272 PMCID: PMC10901556 DOI: 10.1093/molbev/msae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 12/12/2023] [Accepted: 01/22/2024] [Indexed: 02/03/2024] Open
Abstract
The transcription factor and cell cycle regulator p53 is marked for degradation by the ubiquitin ligase MDM2. The interaction between these 2 proteins is mediated by a conserved binding motif in the disordered p53 transactivation domain (p53TAD) and the folded SWIB domain in MDM2. The conserved motif in p53TAD from zebrafish displays a 20-fold weaker interaction with MDM2, compared to the interaction in human and chicken. To investigate this apparent difference, we tracked the molecular evolution of the p53TAD/MDM2 interaction among ray-finned fishes (Actinopterygii), the largest vertebrate clade. Intriguingly, phylogenetic analyses, ancestral sequence reconstructions, and binding experiments showed that different loss-of-affinity changes in the canonical binding motif within p53TAD have occurred repeatedly and convergently in different fish lineages, resulting in relatively low extant affinities (KD = 0.5 to 5 μM). However, for 11 different fish p53TAD/MDM2 interactions, nonconserved regions flanking the canonical motif increased the affinity 4- to 73-fold to be on par with the human interaction. Our findings suggest that compensating changes at conserved and nonconserved positions within the motif, as well as in flanking regions of low conservation, underlie a stabilizing selection of "functional affinity" in the p53TAD/MDM2 interaction. Such interplay complicates bioinformatic prediction of binding and calls for experimental validation. Motif-mediated protein-protein interactions involving short binding motifs and folded interaction domains are very common across multicellular life. It is likely that the evolution of affinity in motif-mediated interactions often involves an interplay between specific interactions made by conserved motif residues and nonspecific interactions by nonconserved disordered regions.
Collapse
Affiliation(s)
- Filip Mihalič
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Dahiana Arcila
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA 92093, USA
| | - Mats E Pettersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Pouria Farkhondehkish
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Eva Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Leif Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX 77483, USA
| | - Ricardo Betancur-R
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA 92093, USA
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| |
Collapse
|
36
|
Udupa A, Kotha SR, Staller MV. Commonly asked questions about transcriptional activation domains. Curr Opin Struct Biol 2024; 84:102732. [PMID: 38056064 PMCID: PMC11193542 DOI: 10.1016/j.sbi.2023.102732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 10/23/2023] [Accepted: 10/27/2023] [Indexed: 12/08/2023]
Abstract
Eukaryotic transcription factors activate gene expression with their DNA-binding domains and activation domains. DNA-binding domains bind the genome by recognizing structurally related DNA sequences; they are structured, conserved, and predictable from protein sequences. Activation domains recruit chromatin modifiers, coactivator complexes, or basal transcriptional machinery via structurally diverse protein-protein interactions. Activation domains and DNA-binding domains have been called independent, modular units, but there are many departures from modularity, including interactions between these regions and overlap in function. Compared to DNA-binding domains, activation domains are poorly understood because they are poorly conserved, intrinsically disordered, and difficult to predict from protein sequences. This review, organized around commonly asked questions, describes recent progress that the field has made in understanding the sequence features that control activation domains and predicting them from sequence.
Collapse
Affiliation(s)
- Aditya Udupa
- Department of Molecular and Cell Biology, University of California, Berkeley, 94720, USA
| | - Sanjana R Kotha
- Department of Molecular and Cell Biology, University of California, Berkeley, 94720, USA; Center for Computational Biology, University of California, Berkeley, 94720, USA
| | - Max V Staller
- Department of Molecular and Cell Biology, University of California, Berkeley, 94720, USA; Center for Computational Biology, University of California, Berkeley, 94720, USA; Chan Zuckerberg Biohub-San Francisco, San Francisco, CA 94158, USA.
| |
Collapse
|
37
|
Sreenivasan S, Heffren P, Suh K, Rodnin MV, Kosa E, Fenton AW, Ladokhin AS, Smith PE, Fontes JD, Swint‐Kruse L. The intrinsically disordered transcriptional activation domain of CIITA is functionally tuneable by single substitutions: An exception or a new paradigm? Protein Sci 2024; 33:e4863. [PMID: 38073129 PMCID: PMC10806935 DOI: 10.1002/pro.4863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 12/04/2023] [Accepted: 12/07/2023] [Indexed: 01/27/2024]
Abstract
During protein evolution, some amino acid substitutions modulate protein function ("tuneability"). In most proteins, the tuneable range is wide and can be sampled by a set of protein variants that each contains multiple amino acid substitutions. In other proteins, the full tuneable range can be accessed by a set of variants that each contains a single substitution. Indeed, in some globular proteins, the full tuneable range can be accessed by the set of site-saturating substitutions at an individual "rheostat" position. However, in proteins with intrinsically disordered regions (IDRs), most functional studies-which would also detect tuneability-used multiple substitutions or small deletions. In disordered transcriptional activation domains (ADs), studies with multiple substitutions led to the "acidic exposure" model, which does not anticipate the existence of rheostat positions. In the few studies that did assess effects of single substitutions on AD function, results were mixed: the ADs of two full-length transcription factors did not show tuneability, whereas a fragment of a third AD was tuneable by single substitutions. In this study, we tested tuneability in the AD of full-length human class II transactivator (CIITA). Sequence analyses and experiments showed that CIITA's AD is an IDR. Functional assays of singly-substituted AD variants showed that CIITA's function was highly tuneable, with outcomes not predicted by the acidic exposure model. Four tested positions showed rheostat behavior for transcriptional activation. Thus, tuneability of different IDRs can vary widely. Future studies are needed to illuminate the biophysical features that govern whether an IDR is tuneable by single substitutions.
Collapse
Affiliation(s)
- Shwetha Sreenivasan
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Paul Heffren
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
- Present address:
Department of BiosciencesKansas City UniversityKansas CityMissouriUSA
| | - Kyung‐Shin Suh
- Department of ChemistryKansas State UniversityManhattanKansasUSA
| | - Mykola V. Rodnin
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Edina Kosa
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Aron W. Fenton
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Alexey S. Ladokhin
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Paul E. Smith
- Department of ChemistryKansas State UniversityManhattanKansasUSA
| | - Joseph D. Fontes
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Liskin Swint‐Kruse
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| |
Collapse
|
38
|
Theisen FF, Prestel A, Elkjær S, Leurs YHA, Morffy N, Strader LC, O'Shea C, Teilum K, Kragelund BB, Skriver K. Molecular switching in transcription through splicing and proline-isomerization regulates stress responses in plants. Nat Commun 2024; 15:592. [PMID: 38238333 PMCID: PMC10796322 DOI: 10.1038/s41467-024-44859-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 01/09/2024] [Indexed: 01/22/2024] Open
Abstract
The Arabidopsis thaliana DREB2A transcription factor interacts with the negative regulator RCD1 and the ACID domain of subunit 25 of the transcriptional co-regulator mediator (Med25) to integrate stress signals for gene expression, with elusive molecular interplay. Using biophysical and structural analyses together with high-throughput screening, we reveal a bivalent binding switch in DREB2A containing an ACID-binding motif (ABS) and the known RCD1-binding motif (RIM). The RIM is lacking in a stress-induced DREB2A splice variant with retained transcriptional activity. ABS and RIM bind to separate sites on Med25-ACID, and NMR analyses show a structurally heterogeneous complex deriving from a DREB2A-ABS proline residue populating cis- and trans-isomers with remote impact on the RIM. The cis-isomer stabilizes an α-helix, while the trans-isomer may introduce energetic frustration facilitating rapid exchange between activators and repressors. Thus, DREB2A uses a post-transcriptionally and post-translationally modulated switch for transcriptional regulation.
Collapse
Affiliation(s)
- Frederik Friis Theisen
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Andreas Prestel
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Steffie Elkjær
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Yannick H A Leurs
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | | | | - Charlotte O'Shea
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kaare Teilum
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Birthe B Kragelund
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Karen Skriver
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
39
|
DelRosso N, Bintu L. Using High-Throughput Measurements to Identify Principles of Transcriptional and Epigenetic Regulators. Methods Mol Biol 2024; 2842:79-101. [PMID: 39012591 DOI: 10.1007/978-1-0716-4051-7_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/17/2024]
Abstract
To achieve exquisite control over the epigenome, we need a better predictive understanding of how transcription factors, chromatin regulators, and their individual domain's function, both as modular parts and as full proteins. Transcriptional effector domains are one class of protein domains that regulate transcription and chromatin. These effector domains either repress or activate gene expression by interacting with chromatin-modifying enzymes, transcriptional cofactors, and/or general transcriptional machinery. Here, we discuss important design considerations for high-throughput investigations of effector domains, recent advances in discovering new domains in human cells and testing how domain function depends on amino acid sequence. For every effector domain, we would like to know the following: What role does the cell type, signaling state, and targeted context have on activation, silencing, and epigenetic memory? Large-scale measurements of transcriptional activities can help systematically answer these questions and identify general rules for how all these parameters affect effector domain activities. Last, we discuss what steps need to be taken to turn a newly discovered effector domain into a robust, precise epigenome editor. With more carefully considered high-throughput investigations, soon we will have better predictive control over the epigenome.
Collapse
|
40
|
Moses D, Ginell GM, Holehouse AS, Sukenik S. Intrinsically disordered regions are poised to act as sensors of cellular chemistry. Trends Biochem Sci 2023; 48:1019-1034. [PMID: 37657994 PMCID: PMC10840941 DOI: 10.1016/j.tibs.2023.08.001] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Revised: 07/31/2023] [Accepted: 08/01/2023] [Indexed: 09/03/2023]
Abstract
Intrinsically disordered proteins and protein regions (IDRs) are abundant in eukaryotic proteomes and play a wide variety of essential roles. Instead of folding into a stable structure, IDRs exist in an ensemble of interconverting conformations whose structure is biased by sequence-dependent interactions. The absence of a stable 3D structure, combined with high solvent accessibility, means that IDR conformational biases are inherently sensitive to changes in their environment. Here, we argue that IDRs are ideally poised to act as sensors and actuators of cellular physicochemistry. We review the physical principles that underlie IDR sensitivity, the molecular mechanisms that translate this sensitivity to function, and recent studies where environmental sensing by IDRs may play a key role in their downstream function.
Collapse
Affiliation(s)
- David Moses
- Department of Chemistry and Biochemistry, University of California, Merced, CA, USA
| | - Garrett M Ginell
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA; Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO, USA
| | - Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA; Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO, USA.
| | - Shahar Sukenik
- Department of Chemistry and Biochemistry, University of California, Merced, CA, USA; Quantitative Systems Biology Program, University of California, Merced, CA, USA.
| |
Collapse
|
41
|
Maes S, Deploey N, Peelman F, Eyckerman S. Deep mutational scanning of proteins in mammalian cells. CELL REPORTS METHODS 2023; 3:100641. [PMID: 37963462 PMCID: PMC10694495 DOI: 10.1016/j.crmeth.2023.100641] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 07/06/2023] [Accepted: 10/20/2023] [Indexed: 11/16/2023]
Abstract
Protein mutagenesis is essential for unveiling the molecular mechanisms underlying protein function in health, disease, and evolution. In the past decade, deep mutational scanning methods have evolved to support the functional analysis of nearly all possible single-amino acid changes in a protein of interest. While historically these methods were developed in lower organisms such as E. coli and yeast, recent technological advancements have resulted in the increased use of mammalian cells, particularly for studying proteins involved in human disease. These advancements will aid significantly in the classification and interpretation of variants of unknown significance, which are being discovered at large scale due to the current surge in the use of whole-genome sequencing in clinical contexts. Here, we explore the experimental aspects of deep mutational scanning studies in mammalian cells and report the different methods used in each step of the workflow, ultimately providing a useful guide toward the design of such studies.
Collapse
Affiliation(s)
- Stefanie Maes
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biochemistry and Microbiology, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
| | - Nick Deploey
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
| | - Frank Peelman
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
| | - Sven Eyckerman
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium.
| |
Collapse
|
42
|
Patil A, Strom AR, Paulo JA, Collings CK, Ruff KM, Shinn MK, Sankar A, Cervantes KS, Wauer T, St Laurent JD, Xu G, Becker LA, Gygi SP, Pappu RV, Brangwynne CP, Kadoch C. A disordered region controls cBAF activity via condensation and partner recruitment. Cell 2023; 186:4936-4955.e26. [PMID: 37788668 PMCID: PMC10792396 DOI: 10.1016/j.cell.2023.08.032] [Citation(s) in RCA: 61] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 07/16/2023] [Accepted: 08/24/2023] [Indexed: 10/05/2023]
Abstract
Intrinsically disordered regions (IDRs) represent a large percentage of overall nuclear protein content. The prevailing dogma is that IDRs engage in non-specific interactions because they are poorly constrained by evolutionary selection. Here, we demonstrate that condensate formation and heterotypic interactions are distinct and separable features of an IDR within the ARID1A/B subunits of the mSWI/SNF chromatin remodeler, cBAF, and establish distinct "sequence grammars" underlying each contribution. Condensation is driven by uniformly distributed tyrosine residues, and partner interactions are mediated by non-random blocks rich in alanine, glycine, and glutamine residues. These features concentrate a specific cBAF protein-protein interaction network and are essential for chromatin localization and activity. Importantly, human disease-associated perturbations in ARID1B IDR sequence grammars disrupt cBAF function in cells. Together, these data identify IDR contributions to chromatin remodeling and explain how phase separation provides a mechanism through which both genomic localization and functional partner recruitment are achieved.
Collapse
Affiliation(s)
- Ajinkya Patil
- Department of Pediatric Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Program in Virology, Harvard Medical School, Boston, MA 02115, USA
| | - Amy R Strom
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ 08544, USA
| | - Joao A Paulo
- Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Clayton K Collings
- Department of Pediatric Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Kiersten M Ruff
- Department of Biomedical Engineering and Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Min Kyung Shinn
- Department of Biomedical Engineering and Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Akshay Sankar
- Department of Pediatric Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Kasey S Cervantes
- Department of Pediatric Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Tobias Wauer
- Department of Pediatric Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02115, USA
| | - Jessica D St Laurent
- Department of Pediatric Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02115, USA; Department of Obstetrics and Gynecology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Grace Xu
- Department of Pediatric Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Lindsay A Becker
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ 08544, USA
| | - Steven P Gygi
- Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Rohit V Pappu
- Department of Biomedical Engineering and Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Clifford P Brangwynne
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ 08544, USA; Howard Hughes Medical Institute, Chevy Chase, MD 21044, USA; Omenn-Darling Bioengineering Institute, Princeton University, Princeton, NJ 08544, USA.
| | - Cigall Kadoch
- Department of Pediatric Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Howard Hughes Medical Institute, Chevy Chase, MD 21044, USA.
| |
Collapse
|
43
|
Kind L, Driver M, Raasakka A, Onck PR, Njølstad PR, Arnesen T, Kursula P. Structural properties of the HNF-1A transactivation domain. Front Mol Biosci 2023; 10:1249939. [PMID: 37908230 PMCID: PMC10613711 DOI: 10.3389/fmolb.2023.1249939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 09/26/2023] [Indexed: 11/02/2023] Open
Abstract
Hepatocyte nuclear factor 1α (HNF-1A) is a transcription factor with important gene regulatory roles in pancreatic β-cells. HNF1A gene variants are associated with a monogenic form of diabetes (HNF1A-MODY) or an increased risk for type 2 diabetes. While several pancreatic target genes of HNF-1A have been described, a lack of knowledge regarding the structure-function relationships in HNF-1A prohibits a detailed understanding of HNF-1A-mediated gene transcription, which is important for precision medicine and improved patient care. Therefore, we aimed to characterize the understudied transactivation domain (TAD) of HNF-1A in vitro. We present a bioinformatic approach to dissect the TAD sequence, analyzing protein structure, sequence composition, sequence conservation, and the existence of protein interaction motifs. Moreover, we developed the first protocol for the recombinant expression and purification of the HNF-1A TAD. Small-angle X-ray scattering and synchrotron radiation circular dichroism suggested a disordered conformation for the TAD. Furthermore, we present functional data on HNF-1A undergoing liquid-liquid phase separation, which is in line with in silico predictions and may be of biological relevance for gene transcriptional processes in pancreatic β-cells.
Collapse
Affiliation(s)
- Laura Kind
- Department of Biomedicine, University of Bergen, Bergen, Norway
| | - Mark Driver
- Zernike Institute for Advanced Materials, University of Groningen, Groningen, Netherlands
| | - Arne Raasakka
- Department of Biomedicine, University of Bergen, Bergen, Norway
| | - Patrick R. Onck
- Zernike Institute for Advanced Materials, University of Groningen, Groningen, Netherlands
| | - Pål Rasmus Njølstad
- Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen, Norway
- Section of Endocrinology and Metabolism, Children and Youth Clinic, Haukeland University Hospital, Bergen, Norway
| | - Thomas Arnesen
- Department of Biomedicine, University of Bergen, Bergen, Norway
- Department of Surgery, Haukeland University Hospital, Bergen, Norway
| | - Petri Kursula
- Department of Biomedicine, University of Bergen, Bergen, Norway
- Faculty of Biochemistry and Molecular Medicine & Biocenter Oulu, University of Oulu, Oulu, Finland
| |
Collapse
|
44
|
Kotha SR, Staller MV. Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence. Genetics 2023; 225:iyad131. [PMID: 37462277 PMCID: PMC10550315 DOI: 10.1093/genetics/iyad131] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 07/03/2023] [Indexed: 10/06/2023] Open
Abstract
Transcription factors activate gene expression in development, homeostasis, and stress with DNA binding domains and activation domains. Although there exist excellent computational models for predicting DNA binding domains from protein sequence, models for predicting activation domains from protein sequence have lagged, particularly in metazoans. We recently developed a simple and accurate predictor of acidic activation domains on human transcription factors. Here, we show how the accuracy of this human predictor arises from the clustering of aromatic, leucine, and acidic residues, which together are necessary for acidic activation domain function. When we combine our predictor with the predictions of convolutional neural network (CNN) models trained in yeast, the intersection is more accurate than individual models, emphasizing that each approach carries orthogonal information. We synthesize these findings into a new set of activation domain predictions on human transcription factors.
Collapse
Affiliation(s)
- Sanjana R Kotha
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720, USA
- Center for Computational Biology, University of California, Berkeley, CA 94720, USA
| | - Max Valentín Staller
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720, USA
- Center for Computational Biology, University of California, Berkeley, CA 94720, USA
- Chan Zuckerberg Biohub—San Francisco, San Francisco, CA 94158, USA
| |
Collapse
|
45
|
Gitschlag BL, Cano AV, Payne JL, McCandlish DM, Stoltzfus A. Mutation and Selection Induce Correlations between Selection Coefficients and Mutation Rates. Am Nat 2023; 202:534-557. [PMID: 37792926 DOI: 10.1086/726014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/06/2023]
Abstract
AbstractThe joint distribution of selection coefficients and mutation rates is a key determinant of the genetic architecture of molecular adaptation. Three different distributions are of immediate interest: (1) the "nominal" distribution of possible changes, prior to mutation or selection; (2) the "de novo" distribution of realized mutations; and (3) the "fixed" distribution of selectively established mutations. Here, we formally characterize the relationships between these joint distributions under the strong-selection/weak-mutation (SSWM) regime. The de novo distribution is enriched relative to the nominal distribution for the highest rate mutations, and the fixed distribution is further enriched for the most highly beneficial mutations. Whereas mutation rates and selection coefficients are often assumed to be uncorrelated, we show that even with no correlation in the nominal distribution, the resulting de novo and fixed distributions can have correlations with any combination of signs. Nonetheless, we suggest that natural systems with a finite number of beneficial mutations will frequently have the kind of nominal distribution that induces negative correlations in the fixed distribution. We apply our mathematical framework, along with population simulations, to explore joint distributions of selection coefficients and mutation rates from deep mutational scanning and cancer informatics. Finally, we consider the evolutionary implications of these joint distributions together with two additional joint distributions relevant to parallelism and the rate of adaptation.
Collapse
|
46
|
Mukund AX, Tycko J, Allen SJ, Robinson SA, Andrews C, Sinha J, Ludwig CH, Spees K, Bassik MC, Bintu L. High-throughput functional characterization of combinations of transcriptional activators and repressors. Cell Syst 2023; 14:746-763.e5. [PMID: 37543039 PMCID: PMC10642976 DOI: 10.1016/j.cels.2023.07.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 06/26/2023] [Accepted: 07/06/2023] [Indexed: 08/07/2023]
Abstract
Despite growing knowledge of the functions of individual human transcriptional effector domains, much less is understood about how multiple effector domains within the same protein combine to regulate gene expression. Here, we measure transcriptional activity for 8,400 effector domain combinations by recruiting them to reporter genes in human cells. In our assay, weak and moderate activation domains synergize to drive strong gene expression, whereas combining strong activators often results in weaker activation. In contrast, repressors combine linearly and produce full gene silencing, and repressor domains often overpower activation domains. We use this information to build a synthetic transcription factor whose function can be tuned between repression and activation independent of recruitment to target genes by using a small-molecule drug. Altogether, we outline the basic principles of how effector domains combine to regulate gene expression and demonstrate their value in building precise and flexible synthetic biology tools. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Adi X Mukund
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
| | - Josh Tycko
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Sage J Allen
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | | | - Cecelia Andrews
- Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA
| | - Joydeb Sinha
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305, USA
| | - Connor H Ludwig
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Kaitlyn Spees
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Michael C Bassik
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Lacramioara Bintu
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
47
|
Hummel NFC, Markel K, Stefani J, Staller MV, Shih PM. Systematic identification of transcriptional activator domains from non-transcription factor proteins in plants and yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.12.557247. [PMID: 37745555 PMCID: PMC10515812 DOI: 10.1101/2023.09.12.557247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Transcription factors promote gene expression via trans-regulatory activation domains. Although whole genome scale screens in model organisms (e.g. human, yeast, fly) have helped identify activation domains from transcription factors, such screens have been less extensively used to explore the occurrence of activation domains in non-transcription factor proteins, such as transcriptional coactivators, chromatin regulators and some cytosolic proteins, leaving a blind spot on what role activation domains in these proteins could play in regulating transcription. We utilized the activation domain predictor PADDLE to mine the entire proteomes of two model eukaryotes, Arabidopsis thaliana and Saccharomyces cerevisiae ( 1 ). We characterized 18,000 fragments covering predicted activation domains from >800 non-transcription factor genes in both species, and experimentally validated that 89% of proteins contained fragments capable of activating transcription in yeast. Peptides with similar sequence composition show a broad range of activities, which is explained by the arrangement of key amino acids. We also annotated hundreds of nuclear proteins with activation domains as putative coactivators; many of which have never been ascribed any function in plants. Furthermore, our library contains >250 non-nuclear proteins containing peptides with activation domain function across both eukaryotic lineages, suggesting that there are unknown biological roles of these peptides beyond transcription. Finally, we identify and validate short, 'universal' eukaryotic activation domains that activate transcription in both yeast and plants with comparable or stronger performance to state-of-the-art activation domains. Overall, our dual host screen provides a blueprint on how to systematically discover novel genetic parts for synthetic biology that function across a wide diversity of eukaryotes. Significance Statement Activation domains promote transcription and play a critical role in regulating gene expression. Although the mapping of activation domains from transcription factors has been carried out in previous genome-wide screens, their occurrence in non-transcription factors has been less explored. We utilize an activation domain predictor to mine the entire proteomes of Arabidopsis thaliana and Saccharomyces cerevisiae for new activation domains on non-transcription factor proteins. We validate peptides derived from >750 non-transcription factor proteins capable of activating transcription, discovering many potentially new coactivators in plants. Importantly, we identify novel genetic parts that can function across both species, representing unique synthetic biology tools.
Collapse
|
48
|
Christou-Kent M, Cuartero S, Garcia-Cabau C, Ruehle J, Naderi J, Erber J, Neguembor MV, Plana-Carmona M, Alcoverro-Bertran M, De Andres-Aguayo L, Klonizakis A, Julià-Vilella E, Lynch C, Serrano M, Hnisz D, Salvatella X, Graf T, Stik G. CEBPA phase separation links transcriptional activity and 3D chromatin hubs. Cell Rep 2023; 42:112897. [PMID: 37516962 DOI: 10.1016/j.celrep.2023.112897] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 06/02/2023] [Accepted: 07/14/2023] [Indexed: 08/01/2023] Open
Abstract
Cell identity is orchestrated through an interplay between transcription factor (TF) action and genome architecture. The mechanisms used by TFs to shape three-dimensional (3D) genome organization remain incompletely understood. Here we present evidence that the lineage-instructive TF CEBPA drives extensive chromatin compartment switching and promotes the formation of long-range chromatin hubs during induced B cell-to-macrophage transdifferentiation. Mechanistically, we find that the intrinsically disordered region (IDR) of CEBPA undergoes in vitro phase separation (PS) dependent on aromatic residues. Both overexpressing B cells and native CEBPA-expressing cell types such as primary granulocyte-macrophage progenitors, liver cells, and trophectoderm cells reveal nuclear CEBPA foci and long-range 3D chromatin hubs at CEBPA-bound regions. In short, we show that CEBPA can undergo PS through its IDR, which may underlie in vivo foci formation and suggest a potential role of PS in regulating CEBPA function.
Collapse
Affiliation(s)
- Marie Christou-Kent
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Sergi Cuartero
- Josep Carreras Leukaemia Research Institute (IJC), Badalona, Spain; Germans Trias I Pujol Research Institute (IGTP), Badalona, Spain
| | - Carla Garcia-Cabau
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
| | - Julia Ruehle
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Julian Naderi
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany
| | - Julia Erber
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Maria Victoria Neguembor
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Marcos Plana-Carmona
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | - Luisa De Andres-Aguayo
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Antonios Klonizakis
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | - Cian Lynch
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain; Altos Labs, Cambridge Institute of Science, Cambridge CB21 6GP, UK
| | - Manuel Serrano
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain; Altos Labs, Cambridge Institute of Science, Cambridge CB21 6GP, UK
| | - Denes Hnisz
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany
| | - Xavier Salvatella
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain; ICREA, Passeig Lluís Companys 23, 08010 Barcelona, Spain
| | - Thomas Graf
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain.
| | - Grégoire Stik
- Josep Carreras Leukaemia Research Institute (IJC), Badalona, Spain.
| |
Collapse
|
49
|
Abstract
Multivalent proteins and nucleic acids, collectively referred to as multivalent associative biomacromolecules, provide the driving forces for the formation and compositional regulation of biomolecular condensates. Here, we review the key concepts of phase transitions of aqueous solutions of associative biomacromolecules, specifically proteins that include folded domains and intrinsically disordered regions. The phase transitions of these systems come under the rubric of coupled associative and segregative transitions. The concepts underlying these processes are presented, and their relevance to biomolecular condensates is discussed.
Collapse
Affiliation(s)
- Rohit V. Pappu
- Department of Biomedical Engineering, Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Samuel R. Cohen
- Department of Biomedical Engineering, Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO 63130, USA
- Center of Regenerative Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Furqan Dar
- Department of Biomedical Engineering, Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Mina Farag
- Department of Biomedical Engineering, Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Mrityunjoy Kar
- Max Planck Institute of Cell Biology and Genetics, 01307 Dresden, Germany
| |
Collapse
|
50
|
Hummel NFC, Zhou A, Li B, Markel K, Ornelas IJ, Shih PM. The trans-regulatory landscape of gene networks in plants. Cell Syst 2023; 14:501-511.e4. [PMID: 37348464 DOI: 10.1016/j.cels.2023.05.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 03/21/2023] [Accepted: 05/11/2023] [Indexed: 06/24/2023]
Abstract
The transcriptional effector domains of transcription factors play a key role in controlling gene expression; however, their functional nature is poorly understood, hampering our ability to explore this fundamental dimension of gene regulatory networks. To map the trans-regulatory landscape in a complex eukaryote, we systematically characterized the putative transcriptional effector domains of over 400 Arabidopsis thaliana transcription factors for their capacity to modulate transcription. We demonstrate that transcriptional effector activity can be integrated into gene regulatory networks capable of elucidating the functional dynamics underlying gene expression patterns. We further show how our characterized domains can enhance genome engineering efforts and reveal how plant transcriptional activators share regulatory features conserved across distantly related eukaryotes. Our results provide a framework to systematically characterize the regulatory role of transcription factors at a genome-scale in order to understand the transcriptional wiring of biological systems.
Collapse
Affiliation(s)
- Niklas F C Hummel
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94705, USA; Department of Biology, Technische Universität Darmstadt, Darmstadt 64287, Germany
| | - Andy Zhou
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94705, USA
| | - Baohua Li
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94705, USA
| | - Kasey Markel
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94705, USA
| | - Izaiah J Ornelas
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94705, USA
| | - Patrick M Shih
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94705, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|