1
|
Perez MF. CelEst: a unified gene regulatory network for estimating transcription factor activities in C. elegans. Genetics 2025; 229:iyae189. [PMID: 39705007 PMCID: PMC11912867 DOI: 10.1093/genetics/iyae189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Accepted: 11/02/2024] [Indexed: 12/21/2024] Open
Abstract
Transcription factors (TFs) play a pivotal role in orchestrating critical intricate patterns of gene regulation. Although gene expression is complex, differential expression of hundreds of genes is often due to regulation by just a handful of TFs. Despite extensive efforts to elucidate TF-target regulatory relationships in Caenorhabditis elegans, existing experimental datasets cover distinct subsets of TFs and leave data integration challenging. Here, I introduce CelEst, a unified gene regulatory network designed to estimate the activity of 487 distinct C. elegans TFs-∼58% of the total-from gene expression data. To integrate data from ChIP-seq, DNA-binding motifs, and eY1H screens, optimal processing of each data type was benchmarked against a set of TF perturbation RNA-seq experiments. Moreover, I showcase how leveraging TF motif conservation in target promoters across genomes of related species can distinguish highly informative interactions, a strategy which can be applied to many model organisms. Integrated analyses of data from commonly studied conditions including heat shock, bacterial infection, and sex differences validates CelEst's performance and highlights overlooked TFs that likely play major roles in coordinating the transcriptional response to these conditions. CelEst can infer TF activity on a standard laptop computer within minutes. Furthermore, an R Shiny app with a step-by-step guide is provided for the community to perform rapid analysis with minimal coding required. I anticipate that widespread adoption of CelEsT will significantly enhance the interpretive power of transcriptomic experiments, both present and retrospective, thereby advancing our understanding of gene regulation in C. elegans and beyond.
Collapse
Affiliation(s)
- Marcos Francisco Perez
- Instituto de Biología Molecular de Barcelona (IBMB), CSIC, Parc Científic de Barcelona, C. Baldiri Reixac, 4-8, 08028 Barcelona, Spain
| |
Collapse
|
2
|
Bell CC, Faulkner GJ, Gilan O. Chromatin-based memory as a self-stabilizing influence on cell identity. Genome Biol 2024; 25:320. [PMID: 39736786 DOI: 10.1186/s13059-024-03461-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Accepted: 12/16/2024] [Indexed: 01/01/2025] Open
Abstract
Cell types are traditionally thought to be specified and stabilized by gene regulatory networks. Here, we explore how chromatin memory contributes to the specification and stabilization of cell states. Through pervasive, local, feedback loops, chromatin memory enables cell states that were initially unstable to become stable. Deeper appreciation of this self-stabilizing role for chromatin broadens our perspective of Waddington's epigenetic landscape from a static surface with islands of stability shaped by evolution, to a plasticine surface molded by experience. With implications for the evolution of cell types, stabilization of resistant states in cancer, and the widespread plasticity of complex life.
Collapse
Affiliation(s)
- Charles C Bell
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD, 4102, Australia.
| | - Geoffrey J Faulkner
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD, 4102, Australia
- Queensland Brain Institute, University of Queensland, Brisbane, QLD, 4169, Australia
| | - Omer Gilan
- Australian Centre for Blood Diseases, Monash University, Melbourne, VIC, 3004, Australia
| |
Collapse
|
3
|
McColgan Á, DiFrisco J. Understanding developmental system drift. Development 2024; 151:dev203054. [PMID: 39417684 PMCID: PMC11529278 DOI: 10.1242/dev.203054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
Developmental system drift (DSD) occurs when the genetic basis for homologous traits diverges over time despite conservation of the phenotype. In this Review, we examine the key ideas, evidence and open problems arising from studies of DSD. Recent work suggests that DSD may be pervasive, having been detected across a range of different organisms and developmental processes. Although developmental research remains heavily reliant on model organisms, extrapolation of findings to non-model organisms can be error-prone if the lineages have undergone DSD. We suggest how existing data and modelling approaches may be used to detect DSD and estimate its frequency. More direct study of DSD, we propose, can inform null hypotheses for how much genetic divergence to expect on the basis of phylogenetic distance, while also contributing to principles of gene regulatory evolution.
Collapse
Affiliation(s)
- Áine McColgan
- Theoretical Biology Lab, The Francis Crick Institute, London NW1 1AT, UK
- Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | - James DiFrisco
- Theoretical Biology Lab, The Francis Crick Institute, London NW1 1AT, UK
| |
Collapse
|
4
|
Tsuru S, Hatanaka N, Furusawa C. Promoters Constrain Evolution of Expression Levels of Essential Genes in Escherichia coli. Mol Biol Evol 2024; 41:msae185. [PMID: 39219319 PMCID: PMC11406756 DOI: 10.1093/molbev/msae185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 07/31/2024] [Accepted: 08/28/2024] [Indexed: 09/04/2024] Open
Abstract
Variability in expression levels in response to random genomic mutations varies among genes, influencing both the facilitation and constraint of phenotypic evolution in organisms. Despite its importance, both the underlying mechanisms and evolutionary origins of this variability remain largely unknown due to the mixed contributions of cis- and trans-acting elements. To address this issue, we focused on the mutational variability of cis-acting elements, that is, promoter regions, in Escherichia coli. Random mutations were introduced into the natural and synthetic promoters to generate mutant promoter libraries. By comparing the variance in promoter activity of these mutant libraries, we found no significant difference in mutational variability in promoter activity between promoter groups, suggesting the absence of a signature of natural selection for mutational robustness. In contrast, the promoters controlling essential genes exhibited a remarkable bias in mutational variability, with mutants displaying higher activities than the wild types being relatively rare compared to those with lower activities. Our evolutionary simulation on a rugged fitness landscape provided a rationale for this vulnerability. These findings suggest that past selection created nonuniform mutational variability in promoters biased toward lower activities of random mutants, which now constrains the future evolution of downstream essential genes toward higher expression levels.
Collapse
Affiliation(s)
- Saburo Tsuru
- Universal Biology Institute, Graduate School of Science, The University of Tokyo, Bunkyo-ku, Tokyo 113-0033, Japan
| | - Naoki Hatanaka
- Universal Biology Institute, Graduate School of Science, The University of Tokyo, Bunkyo-ku, Tokyo 113-0033, Japan
| | - Chikara Furusawa
- Universal Biology Institute, Graduate School of Science, The University of Tokyo, Bunkyo-ku, Tokyo 113-0033, Japan
- Department of Physics, Graduate School of Science, The University of Tokyo, Bunkyo-ku, Tokyo 113-0033, Japan
- Center for Biosystems Dynamics Research (BDR), RIKEN, Suita, Osaka 565-0874, Japan
| |
Collapse
|
5
|
Balogun EJ, Ness RW. The Effects of De Novo Mutation on Gene Expression and the Consequences for Fitness in Chlamydomonas reinhardtii. Mol Biol Evol 2024; 41:msae035. [PMID: 38366781 PMCID: PMC10910851 DOI: 10.1093/molbev/msae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 02/01/2024] [Accepted: 02/13/2024] [Indexed: 02/18/2024] Open
Abstract
Mutation is the ultimate source of genetic variation, the bedrock of evolution. Yet, predicting the consequences of new mutations remains a challenge in biology. Gene expression provides a potential link between a genotype and its phenotype. But the variation in gene expression created by de novo mutation and the fitness consequences of mutational changes to expression remain relatively unexplored. Here, we investigate the effects of >2,600 de novo mutations on gene expression across the transcriptome of 28 mutation accumulation lines derived from 2 independent wild-type genotypes of the green algae Chlamydomonas reinhardtii. We observed that the amount of genetic variance in gene expression created by mutation (Vm) was similar to the variance that mutation generates in typical polygenic phenotypic traits and approximately 15-fold the variance seen in the limited species where Vm in gene expression has been estimated. Despite the clear effect of mutation on expression, we did not observe a simple additive effect of mutation on expression change, with no linear correlation between the total expression change and mutation count of individual MA lines. We therefore inferred the distribution of expression effects of new mutations to connect the number of mutations to the number of differentially expressed genes (DEGs). Our inferred DEE is highly L-shaped with 95% of mutations causing 0-1 DEG while the remaining 5% are spread over a long tail of large effect mutations that cause multiple genes to change expression. The distribution is consistent with many cis-acting mutation targets that affect the expression of only 1 gene and a large target of trans-acting targets that have the potential to affect tens or hundreds of genes. Further evidence for cis-acting mutations can be seen in the overabundance of mutations in or near differentially expressed genes. Supporting evidence for trans-acting mutations comes from a 15:1 ratio of DEGs to mutations and the clusters of DEGs in the co-expression network, indicative of shared regulatory architecture. Lastly, we show that there is a negative correlation with the extent of expression divergence from the ancestor and fitness, providing direct evidence of the deleterious effects of perturbing gene expression.
Collapse
Affiliation(s)
- Eniolaye J Balogun
- Department of Biology, William G. Davis Building, University of Toronto, Mississauga L5L-1C6, Canada
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto M5S-3B2, Canada
| | - Rob W Ness
- Department of Biology, William G. Davis Building, University of Toronto, Mississauga L5L-1C6, Canada
| |
Collapse
|
6
|
Xu J, Gao J, Ni P, Gerstein M. Less-is-more: selecting transcription factor binding regions informative for motif inference. Nucleic Acids Res 2024; 52:e20. [PMID: 38214231 PMCID: PMC10899791 DOI: 10.1093/nar/gkad1240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 12/06/2023] [Accepted: 12/17/2023] [Indexed: 01/13/2024] Open
Abstract
Numerous statistical methods have emerged for inferring DNA motifs for transcription factors (TFs) from genomic regions. However, the process of selecting informative regions for motif inference remains understudied. Current approaches select regions with strong ChIP-seq signal for a given TF, assuming that such strong signal primarily results from specific interactions between the TF and its motif. Additionally, these selection approaches do not account for non-target motifs, i.e. motifs of other TFs; they presume the occurrence of these non-target motifs infrequent compared to that of the target motif, and thus assume these have minimal interference with the identification of the target. Leveraging extensive ChIP-seq datasets, we introduced the concept of TF signal 'crowdedness', referred to as C-score, for each genomic region. The C-score helps in highlighting TF signals arising from non-specific interactions. Moreover, by considering the C-score (and adjusting for the length of genomic regions), we can effectively mitigate interference of non-target motifs. Using these tools, we find that in many instances, strong ChIP-seq signal stems mainly from non-specific interactions, and the occurrence of non-target motifs significantly impacts the accurate inference of the target motif. Prioritizing genomic regions with reduced crowdedness and short length markedly improves motif inference. This 'less-is-more' effect suggests that ChIP-seq region selection warrants more attention.
Collapse
Affiliation(s)
- Jinrui Xu
- Department of Biology, Howard University, Washington, DC 20059, USA
- Center for Applied Data Science and Analytics, Howard University, Washington, DC 20059, USA
| | - Jiahao Gao
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | - Pengyu Ni
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
- Department of Computer Science, Yale University, New Haven, CT 06520, USA
- Department of Statistics and Data Science, Yale University, New Haven, CT 06520, USA
| |
Collapse
|
7
|
Kang CK, Kim AR. Deep molecular learning of transcriptional control of a synthetic CRE enhancer and its variants. iScience 2024; 27:108747. [PMID: 38222110 PMCID: PMC10784702 DOI: 10.1016/j.isci.2023.108747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 08/29/2023] [Accepted: 12/12/2023] [Indexed: 01/16/2024] Open
Abstract
Massively parallel reporter assay measures transcriptional activities of various cis-regulatory modules (CRMs) in a single experiment. We developed a thermodynamic computational model framework that calculates quantitative levels of gene expression directly from regulatory DNA sequences. Using the framework, we investigated the molecular mechanisms of cis-regulatory mutations of a synthetic enhancer that cause abnormal gene expression. We found that, in a human cell line, competitive binding between family transcription factors (TFs) with slightly different binding preferences significantly increases the accuracy of recapitulating the transcriptional effects of thousands of single- or multi-mutations. We also discovered that even if various harmful mutations occurred in an activator binding site, CRM could stably maintain or even increase gene expression through a certain form of competitive binding between family TFs. These findings enhance understanding the effect of SNPs and indels on CRMs and would help building robust custom-designed CRMs for biologics production and gene therapy.
Collapse
Affiliation(s)
- Chan-Koo Kang
- School of Life Science, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
- Department of Advanced Convergence, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
| | - Ah-Ram Kim
- School of Life Science, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
- Department of Advanced Convergence, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- School of Applied Artificial Intelligence, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
| |
Collapse
|
8
|
Ciren D, Zebell S, Lippman ZB. Extreme restructuring of cis -regulatory regions controlling a deeply conserved plant stem cell regulator. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.20.572550. [PMID: 38187729 PMCID: PMC10769289 DOI: 10.1101/2023.12.20.572550] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
A striking paradox is that genes with conserved protein sequence, function and expression pattern over deep time often exhibit extremely divergent cis -regulatory sequences. It remains unclear how such drastic cis -regulatory evolution across species allows preservation of gene function, and to what extent these differences influence how cis- regulatory variation arising within species impacts phenotypic change. Here, we investigated these questions using a plant stem cell regulator conserved in expression pattern and function over ∼125 million years. Using in-vivo genome editing in two distantly related models, Arabidopsis thaliana (Arabidopsis) and Solanum lycopersicum (tomato), we generated over 70 deletion alleles in the upstream and downstream regions of the stem cell repressor gene CLAVATA3 ( CLV3 ) and compared their individual and combined effects on a shared phenotype, the number of carpels that make fruits. We found that sequences upstream of tomato CLV3 are highly sensitive to even small perturbations compared to its downstream region. In contrast, Arabidopsis CLV3 function is tolerant to severe disruptions both upstream and downstream of the coding sequence. Combining upstream and downstream deletions also revealed a different regulatory outcome. Whereas phenotypic enhancement from adding downstream mutations was predominantly weak and additive in tomato, mutating both regions of Arabidopsis CLV3 caused substantial and synergistic effects, demonstrating distinct distribution and redundancy of functional cis -regulatory sequences. Our results demonstrate remarkable malleability in cis -regulatory structural organization of a deeply conserved plant stem cell regulator and suggest that major reconfiguration of cis -regulatory sequence space is a common yet cryptic evolutionary force altering genotype-to-phenotype relationships from regulatory variation in conserved genes. Finally, our findings underscore the need for lineage-specific dissection of the spatial architecture of cis -regulation to effectively engineer trait variation from conserved productivity genes in crops. Author summary We investigated the evolution of cis -regulatory elements (CREs) and their interactions in the regulation of a plant stem cell regulator gene, CLAVATA3 (CLV3) , in Arabidopsis and tomato. Despite diverging ∼125 million years ago, the function and expression of CLV3 is conserved in these species; however, cis -regulatory sequences upstream and downstream have drastically diverged, preventing identification of conserved non-coding sequences between them. We used CRISPR-Cas9 to engineer dozens of mutations within the cis -regulatory regions of Arabidopsis and tomato CLV3. In tomato, our results show that tomato CLV3 function primarily relies on interactions among CREs in the 5' non-coding region, unlike Arabidopsis CLV3 , which depends on a more balanced distribution of functional CREs between the 5' and 3' regions. Therefore, despite a high degree of functional conservation, our study demonstrates divergent regulatory strategies between two distantly related CLV3 orthologs, with substantial alterations in regulatory sequences, their spatial arrangement, and their relative effects on CLV3 regulation. These results suggest that regulatory regions are not only extremely robust to mutagenesis, but also that the sequences underlying this robustness can be lineage-specific for conserved genes, due to the complex and often redundant interactions among CREs that ensure proper gene function amidst large-scale sequence turnover.
Collapse
|
9
|
Khodursky S, Zheng EB, Svetec N, Durkin SM, Benjamin S, Gadau A, Wu X, Zhao L. The evolution and mutational robustness of chromatin accessibility in Drosophila. Genome Biol 2023; 24:232. [PMID: 37845780 PMCID: PMC10578003 DOI: 10.1186/s13059-023-03079-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 09/29/2023] [Indexed: 10/18/2023] Open
Abstract
BACKGROUND The evolution of genomic regulatory regions plays a critical role in shaping the diversity of life. While this process is primarily sequence-dependent, the enormous complexity of biological systems complicates the understanding of the factors underlying regulation and its evolution. Here, we apply deep neural networks as a tool to investigate the sequence determinants underlying chromatin accessibility in different species and tissues of Drosophila. RESULTS We train hybrid convolution-attention neural networks to accurately predict ATAC-seq peaks using only local DNA sequences as input. We show that our models generalize well across substantially evolutionarily diverged species of insects, implying that the sequence determinants of accessibility are highly conserved. Using our model to examine species-specific gains in accessibility, we find evidence suggesting that these regions may be ancestrally poised for evolution. Using in silico mutagenesis, we show that accessibility can be accurately predicted from short subsequences in each example. However, in silico knock-out of these sequences does not qualitatively impair classification, implying that accessibility is mutationally robust. Subsequently, we show that accessibility is predicted to be robust to large-scale random mutation even in the absence of selection. Conversely, simulations under strong selection demonstrate that accessibility can be extremely malleable despite its robustness. Finally, we identify motifs predictive of accessibility, recovering both novel and previously known motifs. CONCLUSIONS These results demonstrate the conservation of the sequence determinants of accessibility and the general robustness of chromatin accessibility, as well as the power of deep neural networks to explore fundamental questions in regulatory genomics and evolution.
Collapse
Affiliation(s)
- Samuel Khodursky
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Eric B Zheng
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Nicolas Svetec
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Sylvia M Durkin
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
- Present Address: Department of Integrative Biology and Museum of Vertebrate Zoology, University of California, Berkeley, Berkeley, CA, USA
| | - Sigi Benjamin
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Alice Gadau
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Xia Wu
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Li Zhao
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA.
| |
Collapse
|
10
|
van Duin L, Krautz R, Rennie S, Andersson R. Transcription factor expression is the main determinant of variability in gene co-activity. Mol Syst Biol 2023; 19:e11392. [PMID: 37158788 PMCID: PMC10333863 DOI: 10.15252/msb.202211392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 04/13/2023] [Accepted: 04/17/2023] [Indexed: 05/10/2023] Open
Abstract
Many genes are co-expressed and form genomic domains of coordinated gene activity. However, the regulatory determinants of domain co-activity remain unclear. Here, we leverage human individual variation in gene expression to characterize the co-regulatory processes underlying domain co-activity and systematically quantify their effect sizes. We employ transcriptional decomposition to extract from RNA expression data an expression component related to co-activity revealed by genomic positioning. This strategy reveals close to 1,500 co-activity domains, covering most expressed genes, of which the large majority are invariable across individuals. Focusing specifically on domains with high variability in co-activity reveals that contained genes have a higher sharing of eQTLs, a higher variability in enhancer interactions, and an enrichment of binding by variably expressed transcription factors, compared to genes within non-variable domains. Through careful quantification of the relative contributions of regulatory processes underlying co-activity, we find transcription factor expression levels to be the main determinant of gene co-activity. Our results indicate that distal trans effects contribute more than local genetic variation to individual variation in co-activity domains.
Collapse
Affiliation(s)
- Lucas van Duin
- Section for Computational and RNA Biology, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| | - Robert Krautz
- Section for Computational and RNA Biology, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| | - Sarah Rennie
- Section for Computational and RNA Biology, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| | - Robin Andersson
- Section for Computational and RNA Biology, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| |
Collapse
|
11
|
Wolf S, Melo D, Garske KM, Pallares LF, Lea AJ, Ayroles JF. Characterizing the landscape of gene expression variance in humans. PLoS Genet 2023; 19:e1010833. [PMID: 37410774 DOI: 10.1371/journal.pgen.1010833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 06/15/2023] [Indexed: 07/08/2023] Open
Abstract
Gene expression variance has been linked to organismal function and fitness but remains a commonly neglected aspect of molecular research. As a result, we lack a comprehensive understanding of the patterns of transcriptional variance across genes, and how this variance is linked to context-specific gene regulation and gene function. Here, we use 57 large publicly available RNA-seq data sets to investigate the landscape of gene expression variance. These studies cover a wide range of tissues and allowed us to assess if there are consistently more or less variable genes across tissues and data sets and what mechanisms drive these patterns. We show that gene expression variance is broadly similar across tissues and studies, indicating that the pattern of transcriptional variance is consistent. We use this similarity to create both global and within-tissue rankings of variation, which we use to show that function, sequence variation, and gene regulatory signatures contribute to gene expression variance. Low-variance genes are associated with fundamental cell processes and have lower levels of genetic polymorphisms, have higher gene-gene connectivity, and tend to be associated with chromatin states associated with transcription. In contrast, high-variance genes are enriched for genes involved in immune response, environmentally responsive genes, immediate early genes, and are associated with higher levels of polymorphisms. These results show that the pattern of transcriptional variance is not noise. Instead, it is a consistent gene trait that seems to be functionally constrained in human populations. Furthermore, this commonly neglected aspect of molecular phenotypic variation harbors important information to understand complex traits and disease.
Collapse
Affiliation(s)
- Scott Wolf
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Diogo Melo
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey, United States of America
| | - Kristina M Garske
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Luisa F Pallares
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey, United States of America
- Friedrich Miescher Laboratory of the Max Planck Society, Tübingen, Germany
| | - Amanda J Lea
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
- Child and Brain Development, Canadian Institute for Advanced Research, Toronto, Canada
| | - Julien F Ayroles
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey, United States of America
| |
Collapse
|
12
|
Khodursky S, Zheng EB, Svetec N, Durkin SM, Benjamin S, Gadau A, Wu X, Zhao L. The evolution and mutational robustness of chromatin accessibility in Drosophila. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.26.546587. [PMID: 37425760 PMCID: PMC10327059 DOI: 10.1101/2023.06.26.546587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
The evolution of regulatory regions in the genome plays a critical role in shaping the diversity of life. While this process is primarily sequence-dependent, the enormous complexity of biological systems has made it difficult to understand the factors underlying regulation and its evolution. Here, we apply deep neural networks as a tool to investigate the sequence determinants underlying chromatin accessibility in different tissues of Drosophila. We train hybrid convolution-attention neural networks to accurately predict ATAC-seq peaks using only local DNA sequences as input. We show that a model trained in one species has nearly identical performance when tested in another species, implying that the sequence determinants of accessibility are highly conserved. Indeed, model performance remains excellent even in distantly-related species. By using our model to examine species-specific gains in chromatin accessibility, we find that their orthologous inaccessible regions in other species have surprisingly similar model outputs, suggesting that these regions may be ancestrally poised for evolution. We then use in silico saturation mutagenesis to reveal evidence of selective constraint acting specifically on inaccessible chromatin regions. We further show that chromatin accessibility can be accurately predicted from short subsequences in each example. However, in silico knock-out of these sequences does not qualitatively impair classification, implying that chromatin accessibility is mutationally robust. Subsequently, we demonstrate that chromatin accessibility is predicted to be robust to large-scale random mutation even in the absence of selection. We also perform in silico evolution experiments under the regime of strong selection and weak mutation (SSWM) and show that chromatin accessibility can be extremely malleable despite its mutational robustness. However, selection acting in different directions in a tissue-specific manner can substantially slow adaptation. Finally, we identify motifs predictive of chromatin accessibility and recover motifs corresponding to known chromatin accessibility activators and repressors. These results demonstrate the conservation of the sequence determinants of accessibility and the general robustness of chromatin accessibility, as well as the power of deep neural networks as tools to answer fundamental questions in regulatory genomics and evolution.
Collapse
Affiliation(s)
- Samuel Khodursky
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
- These authors contributed equally
| | - Eric B Zheng
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
- These authors contributed equally
| | - Nicolas Svetec
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
| | - Sylvia M Durkin
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
- Current Address: Department of Integrative Biology and Museum of Vertebrate Zoology, University of California, Berkeley, Berkeley, CA, USA
| | - Sigi Benjamin
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
| | - Alice Gadau
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
| | - Xia Wu
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
| | - Li Zhao
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
| |
Collapse
|
13
|
Baier F, Gauye F, Perez-Carrasco R, Payne JL, Schaerli Y. Environment-dependent epistasis increases phenotypic diversity in gene regulatory networks. SCIENCE ADVANCES 2023; 9:eadf1773. [PMID: 37224262 DOI: 10.1126/sciadv.adf1773] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 04/17/2023] [Indexed: 05/26/2023]
Abstract
Mutations to gene regulatory networks can be maladaptive or a source of evolutionary novelty. Epistasis confounds our understanding of how mutations affect the expression patterns of gene regulatory networks, a challenge exacerbated by the dependence of epistasis on the environment. We used the toolkit of synthetic biology to systematically assay the effects of pairwise and triplet combinations of mutant genotypes on the expression pattern of a gene regulatory network expressed in Escherichia coli that interprets an inducer gradient across a spatial domain. We uncovered a preponderance of epistasis that can switch in magnitude and sign across the inducer gradient to produce a greater diversity of expression pattern phenotypes than would be possible in the absence of such environment-dependent epistasis. We discuss our findings in the context of the evolution of hybrid incompatibilities and evolutionary novelties.
Collapse
Affiliation(s)
- Florian Baier
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015 Lausanne, Switzerland
| | - Florence Gauye
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015 Lausanne, Switzerland
| | | | - Joshua L Payne
- Institute of Integrative Biology, ETH Zurich, 8092 Zurich, Switzerland
| | - Yolanda Schaerli
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015 Lausanne, Switzerland
| |
Collapse
|
14
|
Li XC, Fuqua T, van Breugel ME, Crocker J. Mutational scans reveal differential evolvability of Drosophila promoters and enhancers. Philos Trans R Soc Lond B Biol Sci 2023; 378:20220054. [PMID: 37004721 PMCID: PMC10067265 DOI: 10.1098/rstb.2022.0054] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2023] Open
Abstract
Rapid enhancer and slow promoter evolution have been demonstrated through comparative genomics. However, it is not clear how this information is encoded genetically and if this can be used to place evolution in a predictive context. Part of the challenge is that our understanding of the potential for regulatory evolution is biased primarily toward natural variation or limited experimental perturbations. Here, to explore the evolutionary capacity of promoter variation, we surveyed an unbiased mutation library for three promoters in Drosophila melanogaster. We found that mutations in promoters had limited to no effect on spatial patterns of gene expression. Compared to developmental enhancers, promoters are more robust to mutations and have more access to mutations that can increase gene expression, suggesting that their low activity might be a result of selection. Consistent with these observations, increasing the promoter activity at the endogenous locus of shavenbaby led to increased transcription yet limited phenotypic changes. Taken together, developmental promoters may encode robust transcriptional outputs allowing evolvability through the integration of diverse developmental enhancers. This article is part of the theme issue ‘Interdisciplinary approaches to predicting evolutionary biology’.
Collapse
Affiliation(s)
- Xueying C. Li
- European Molecular Biology Laboratory, Heidelberg, Baden-Württemberg 69117, Germany
| | - Timothy Fuqua
- European Molecular Biology Laboratory, Heidelberg, Baden-Württemberg 69117, Germany
| | | | - Justin Crocker
- European Molecular Biology Laboratory, Heidelberg, Baden-Württemberg 69117, Germany
| |
Collapse
|
15
|
Santos-Moreno J, Tasiudi E, Kusumawardhani H, Stelling J, Schaerli Y. Robustness and innovation in synthetic genotype networks. Nat Commun 2023; 14:2454. [PMID: 37117168 PMCID: PMC10147661 DOI: 10.1038/s41467-023-38033-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 04/13/2023] [Indexed: 04/30/2023] Open
Abstract
Genotype networks are sets of genotypes connected by small mutational changes that share the same phenotype. They facilitate evolutionary innovation by enabling the exploration of different neighborhoods in genotype space. Genotype networks, first suggested by theoretical models, have been empirically confirmed for proteins and RNAs. Comparative studies also support their existence for gene regulatory networks (GRNs), but direct experimental evidence is lacking. Here, we report the construction of three interconnected genotype networks of synthetic GRNs producing three distinct phenotypes in Escherichia coli. Our synthetic GRNs contain three nodes regulating each other by CRISPR interference and governing the expression of fluorescent reporters. The genotype networks, composed of over twenty different synthetic GRNs, provide robustness in face of mutations while enabling transitions to innovative phenotypes. Through realistic mathematical modeling, we quantify robustness and evolvability for the complete genotype-phenotype map and link these features mechanistically to GRN motifs. Our work thereby exemplifies how GRN evolution along genotype networks might be driving evolutionary innovation.
Collapse
Affiliation(s)
- Javier Santos-Moreno
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015, Lausanne, Switzerland
- Department of Medicine and Life Sciences, Pompeu Fabra University, 00803, Barcelona, Spain
| | - Eve Tasiudi
- Department of Biosystems Science and Engineering, ETH Zurich and SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Hadiastri Kusumawardhani
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015, Lausanne, Switzerland
| | - Joerg Stelling
- Department of Biosystems Science and Engineering, ETH Zurich and SIB Swiss Institute of Bioinformatics, Basel, Switzerland.
| | - Yolanda Schaerli
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015, Lausanne, Switzerland.
| |
Collapse
|
16
|
Rozowsky J, Gao J, Borsari B, Yang YT, Galeev T, Gürsoy G, Epstein CB, Xiong K, Xu J, Li T, Liu J, Yu K, Berthel A, Chen Z, Navarro F, Sun MS, Wright J, Chang J, Cameron CJF, Shoresh N, Gaskell E, Drenkow J, Adrian J, Aganezov S, Aguet F, Balderrama-Gutierrez G, Banskota S, Corona GB, Chee S, Chhetri SB, Cortez Martins GC, Danyko C, Davis CA, Farid D, Farrell NP, Gabdank I, Gofin Y, Gorkin DU, Gu M, Hecht V, Hitz BC, Issner R, Jiang Y, Kirsche M, Kong X, Lam BR, Li S, Li B, Li X, Lin KZ, Luo R, Mackiewicz M, Meng R, Moore JE, Mudge J, Nelson N, Nusbaum C, Popov I, Pratt HE, Qiu Y, Ramakrishnan S, Raymond J, Salichos L, Scavelli A, Schreiber JM, Sedlazeck FJ, See LH, Sherman RM, Shi X, Shi M, Sloan CA, Strattan JS, Tan Z, Tanaka FY, Vlasova A, Wang J, Werner J, Williams B, Xu M, Yan C, Yu L, Zaleski C, Zhang J, Ardlie K, Cherry JM, Mendenhall EM, Noble WS, Weng Z, Levine ME, Dobin A, Wold B, Mortazavi A, Ren B, Gillis J, Myers RM, Snyder MP, Choudhary J, Milosavljevic A, Schatz MC, Bernstein BE, et alRozowsky J, Gao J, Borsari B, Yang YT, Galeev T, Gürsoy G, Epstein CB, Xiong K, Xu J, Li T, Liu J, Yu K, Berthel A, Chen Z, Navarro F, Sun MS, Wright J, Chang J, Cameron CJF, Shoresh N, Gaskell E, Drenkow J, Adrian J, Aganezov S, Aguet F, Balderrama-Gutierrez G, Banskota S, Corona GB, Chee S, Chhetri SB, Cortez Martins GC, Danyko C, Davis CA, Farid D, Farrell NP, Gabdank I, Gofin Y, Gorkin DU, Gu M, Hecht V, Hitz BC, Issner R, Jiang Y, Kirsche M, Kong X, Lam BR, Li S, Li B, Li X, Lin KZ, Luo R, Mackiewicz M, Meng R, Moore JE, Mudge J, Nelson N, Nusbaum C, Popov I, Pratt HE, Qiu Y, Ramakrishnan S, Raymond J, Salichos L, Scavelli A, Schreiber JM, Sedlazeck FJ, See LH, Sherman RM, Shi X, Shi M, Sloan CA, Strattan JS, Tan Z, Tanaka FY, Vlasova A, Wang J, Werner J, Williams B, Xu M, Yan C, Yu L, Zaleski C, Zhang J, Ardlie K, Cherry JM, Mendenhall EM, Noble WS, Weng Z, Levine ME, Dobin A, Wold B, Mortazavi A, Ren B, Gillis J, Myers RM, Snyder MP, Choudhary J, Milosavljevic A, Schatz MC, Bernstein BE, Guigó R, Gingeras TR, Gerstein M. The EN-TEx resource of multi-tissue personal epigenomes & variant-impact models. Cell 2023; 186:1493-1511.e40. [PMID: 37001506 PMCID: PMC10074325 DOI: 10.1016/j.cell.2023.02.018] [Show More Authors] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Revised: 10/16/2022] [Accepted: 02/10/2023] [Indexed: 04/03/2023]
Abstract
Understanding how genetic variants impact molecular phenotypes is a key goal of functional genomics, currently hindered by reliance on a single haploid reference genome. Here, we present the EN-TEx resource of 1,635 open-access datasets from four donors (∼30 tissues × ∼15 assays). The datasets are mapped to matched, diploid genomes with long-read phasing and structural variants, instantiating a catalog of >1 million allele-specific loci. These loci exhibit coordinated activity along haplotypes and are less conserved than corresponding, non-allele-specific ones. Surprisingly, a deep-learning transformer model can predict the allele-specific activity based only on local nucleotide-sequence context, highlighting the importance of transcription-factor-binding motifs particularly sensitive to variants. Furthermore, combining EN-TEx with existing genome annotations reveals strong associations between allele-specific and GWAS loci. It also enables models for transferring known eQTLs to difficult-to-profile tissues (e.g., from skin to heart). Overall, EN-TEx provides rich data and generalizable models for more accurate personal functional genomics.
Collapse
Affiliation(s)
- Joel Rozowsky
- Section on Biomedical Informatics and Data Science, Yale University, New Haven, CT, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Jiahao Gao
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Beatrice Borsari
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA; Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain
| | - Yucheng T Yang
- Institute of Science and Technology for Brain-Inspired Intelligence; MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence; MOE Frontiers Center for Brain Science, Fudan University, Shanghai 200433, China; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Timur Galeev
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Gamze Gürsoy
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | | | - Kun Xiong
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Jinrui Xu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Tianxiao Li
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Jason Liu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Keyang Yu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Ana Berthel
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Zhanlin Chen
- Department of Statistics and Data Science, Yale University, New Haven, CT, USA
| | - Fabio Navarro
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Maxwell S Sun
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | | | - Justin Chang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Christopher J F Cameron
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Noam Shoresh
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Jorg Drenkow
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jessika Adrian
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Sergey Aganezov
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
| | | | | | | | | | - Sora Chee
- Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA, USA
| | - Surya B Chhetri
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Gabriel Conte Cortez Martins
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Cassidy Danyko
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Carrie A Davis
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Daniel Farid
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | | | - Idan Gabdank
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Yoel Gofin
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - David U Gorkin
- Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA, USA
| | - Mengting Gu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Vivian Hecht
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Benjamin C Hitz
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Robbyn Issner
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Yunzhe Jiang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Melanie Kirsche
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Xiangmeng Kong
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Bonita R Lam
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Shantao Li
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Bian Li
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Xiqi Li
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Khine Zin Lin
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Ruibang Luo
- Department of Computer Science, The University of Hong Kong, Hong Kong, CHN
| | - Mark Mackiewicz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Ran Meng
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Jill E Moore
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, USA
| | - Jonathan Mudge
- European Bioinformatics Institute, Cambridge, Cambridgeshire, GB
| | | | - Chad Nusbaum
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ioann Popov
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Henry E Pratt
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, USA
| | - Yunjiang Qiu
- Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA, USA
| | - Srividya Ramakrishnan
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Joe Raymond
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Leonidas Salichos
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA; Department of Biological and Chemical Sciences, New York Institute of Technology, Old Westbury, NY, USA
| | - Alexandra Scavelli
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jacob M Schreiber
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Fritz J Sedlazeck
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA; Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Lei Hoon See
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Rachel M Sherman
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Xu Shi
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Minyi Shi
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Cricket Alicia Sloan
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - J Seth Strattan
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Zhen Tan
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Forrest Y Tanaka
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Anna Vlasova
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain; Comparative Genomics Group, Life Science Programme, Barcelona Supercomputing Centre, Barcelona, Spain; Institute of Research in Biomedicine, Barcelona, Spain
| | - Jun Wang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Jonathan Werner
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Brian Williams
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | - Min Xu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Chengfei Yan
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Lu Yu
- Institute of Cancer Research, London, UK
| | - Christopher Zaleski
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jing Zhang
- Department of Computer Science, University of California, Irvine, Irvine, CA, USA
| | | | - J Michael Cherry
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | | | - William S Noble
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA, USA
| | - Morgan E Levine
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
| | - Alexander Dobin
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Barbara Wold
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | - Ali Mortazavi
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
| | - Bing Ren
- Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA, USA
| | - Jesse Gillis
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA; Department of Physiology, University of Toronto, Toronto, ON, Canada
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Michael P Snyder
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | | | | | - Michael C Schatz
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA; Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| | - Bradley E Bernstein
- Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA.
| | - Roderic Guigó
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain; Universitat Pompeu Fabra, Barcelona, Catalonia, Spain.
| | - Thomas R Gingeras
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| | - Mark Gerstein
- Section on Biomedical Informatics and Data Science, Yale University, New Haven, CT, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA; Department of Statistics and Data Science, Yale University, New Haven, CT, USA; Department of Computer Science, Yale University, New Haven, CT, USA.
| |
Collapse
|
17
|
Arcuschin CD, Pinkasz M, Schor IE. Mechanisms of robustness in gene regulatory networks involved in neural development. Front Mol Neurosci 2023; 16:1114015. [PMID: 36814969 PMCID: PMC9940843 DOI: 10.3389/fnmol.2023.1114015] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 01/16/2023] [Indexed: 02/08/2023] Open
Abstract
The functions of living organisms are affected by different kinds of perturbation, both internal and external, which in many cases have functional effects and phenotypic impact. The effects of these perturbations become particularly relevant for multicellular organisms with complex body patterns and cell type heterogeneity, where transcriptional programs controlled by gene regulatory networks determine, for example, the cell fate during embryonic development. Therefore, an essential aspect of development in these organisms is the ability to maintain the functionality of their genetic developmental programs even in the presence of genetic variation, changing environmental conditions and biochemical noise, a property commonly termed robustness. We discuss the implication of different molecular mechanisms of robustness involved in neurodevelopment, which is characterized by the interplay of many developmental programs at a molecular, cellular and systemic level. We specifically focus on processes affecting the function of gene regulatory networks, encompassing transcriptional regulatory elements and post-transcriptional processes such as miRNA-based regulation, but also higher order regulatory organization, such as gene network topology. We also present cases where impairment of robustness mechanisms can be associated with neurodevelopmental disorders, as well as reasons why understanding these mechanisms should represent an important part of the study of gene regulatory networks driving neural development.
Collapse
Affiliation(s)
- Camila D. Arcuschin
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Universidad de Buenos Aires—Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Marina Pinkasz
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Universidad de Buenos Aires—Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Ignacio E. Schor
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Universidad de Buenos Aires—Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| |
Collapse
|
18
|
Schlosser G. Rebuilding ships while at sea-Character individuality, homology, and evolutionary innovation. J Morphol 2023; 284:e21522. [PMID: 36282954 PMCID: PMC10100095 DOI: 10.1002/jmor.21522] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 09/15/2022] [Accepted: 10/15/2022] [Indexed: 11/05/2022]
Abstract
How novel traits originate in evolution is still one of the most perplexing questions in Evolutionary Biology. Building on a previous account of evolutionary innovation, I here propose that evolutionary novelties are those individualized characters that are not homologous to any characters in the ancestor. To clarify this definition, I here provide a detailed analysis of the concepts of "character individuality" and "homology" first, before addressing their role for our understanding of evolutionary innovation. I will argue (1) that functional as well as structural considerations are important for character individualization; and (2) that compositional (structural) and positional homology need to be clearly distinguished to properly describe the evolutionary transformations of hierarchically structured characters. My account will therefore integrate functional and structural perspectives and put forward a new multi-level view of character identity and transformation.
Collapse
Affiliation(s)
- Gerhard Schlosser
- School of Biological and Chemical Sciences, University of Galway, Galway, Ireland
| |
Collapse
|
19
|
Long E, Yin J, Funderburk KM, Xu M, Feng J, Kane A, Zhang T, Myers T, Golden A, Thakur R, Kong H, Jessop L, Kim EY, Jones K, Chari R, Machiela MJ, Yu K, Iles MM, Landi MT, Law MH, Chanock SJ, Brown KM, Choi J. Massively parallel reporter assays and variant scoring identified functional variants and target genes for melanoma loci and highlighted cell-type specificity. Am J Hum Genet 2022; 109:2210-2229. [PMID: 36423637 PMCID: PMC9748337 DOI: 10.1016/j.ajhg.2022.11.006] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 11/02/2022] [Indexed: 11/24/2022] Open
Abstract
The most recent genome-wide association study (GWAS) of cutaneous melanoma identified 54 risk-associated loci, but functional variants and their target genes for most have not been established. Here, we performed massively parallel reporter assays (MPRAs) by using malignant melanoma and normal melanocyte cells and further integrated multi-layer annotation to systematically prioritize functional variants and susceptibility genes from these GWAS loci. Of 1,992 risk-associated variants tested in MPRAs, we identified 285 from 42 loci (78% of the known loci) displaying significant allelic transcriptional activities in either cell type (FDR < 1%). We further characterized MPRA-significant variants by motif prediction, epigenomic annotation, and statistical/functional fine-mapping to create integrative variant scores, which prioritized one to six plausible candidate variants per locus for the 42 loci and nominated a single variant for 43% of these loci. Overlaying the MPRA-significant variants with genome-wide significant expression or methylation quantitative trait loci (eQTLs or meQTLs, respectively) from melanocytes or melanomas identified candidate susceptibility genes for 60% of variants (172 of 285 variants). CRISPRi of top-scoring variants validated their cis-regulatory effect on the eQTL target genes, MAFF (22q13.1) and GPRC5A (12p13.1). Finally, we identified 36 melanoma-specific and 45 melanocyte-specific MPRA-significant variants, a subset of which are linked to cell-type-specific target genes. Analyses of transcription factor availability in MPRA datasets and variant-transcription-factor interaction in eQTL datasets highlighted the roles of transcription factors in cell-type-specific variant functionality. In conclusion, MPRAs along with variant scoring effectively prioritized plausible candidates for most melanoma GWAS loci and highlighted cellular contexts where the susceptibility variants are functional.
Collapse
Affiliation(s)
- Erping Long
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Jinhu Yin
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Karen M Funderburk
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Mai Xu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - James Feng
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Alexander Kane
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Tongwu Zhang
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Timothy Myers
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Alyxandra Golden
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Rohit Thakur
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Hyunkyung Kong
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Lea Jessop
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Eun Young Kim
- Department of Internal Medicine, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Kristine Jones
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Raj Chari
- Genome Modification Core, Frederick National Lab for Cancer Research, National Cancer Institute, Frederick, MD, USA
| | - Mitchell J Machiela
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Kai Yu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Mark M Iles
- Leeds Institute for Data Analytics, School of Medicine, University of Leeds, Leeds LS2 9NL, UK
| | - Maria Teresa Landi
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Matthew H Law
- Statistical Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia; Faculty of Health, Queensland University of Technology, Brisbane, QLD, Australia; School of Biomedical Sciences, University of Queensland, Brisbane, QLD, Australia
| | - Stephen J Chanock
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Kevin M Brown
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Jiyeon Choi
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA.
| |
Collapse
|
20
|
Einarsson H, Salvatore M, Vaagensø C, Alcaraz N, Bornholdt J, Rennie S, Andersson R. Promoter sequence and architecture determine expression variability and confer robustness to genetic variants. eLife 2022; 11:e80943. [PMID: 36377861 PMCID: PMC9844987 DOI: 10.7554/elife.80943] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 11/14/2022] [Indexed: 11/16/2022] Open
Abstract
Genetic and environmental exposures cause variability in gene expression. Although most genes are affected in a population, their effect sizes vary greatly, indicating the existence of regulatory mechanisms that could amplify or attenuate expression variability. Here, we investigate the relationship between the sequence and transcription start site architectures of promoters and their expression variability across human individuals. We find that expression variability can be largely explained by a promoter's DNA sequence and its binding sites for specific transcription factors. We show that promoter expression variability reflects the biological process of a gene, demonstrating a selective trade-off between stability for metabolic genes and plasticity for responsive genes and those involved in signaling. Promoters with a rigid transcription start site architecture are more prone to have variable expression and to be associated with genetic variants with large effect sizes, while a flexible usage of transcription start sites within a promoter attenuates expression variability and limits genotypic effects. Our work provides insights into the variable nature of responsive genes and reveals a novel mechanism for supplying transcriptional and mutational robustness to essential genes through multiple transcription start site regions within a promoter.
Collapse
Affiliation(s)
| | - Marco Salvatore
- Department of Biology, University of CopenhagenCopenhagenDenmark
| | | | - Nicolas Alcaraz
- Department of Biology, University of CopenhagenCopenhagenDenmark
| | - Jette Bornholdt
- Department of Biology, University of CopenhagenCopenhagenDenmark
| | - Sarah Rennie
- Department of Biology, University of CopenhagenCopenhagenDenmark
| | - Robin Andersson
- Department of Biology, University of CopenhagenCopenhagenDenmark
| |
Collapse
|
21
|
Shibai A, Kotani H, Sakata N, Furusawa C, Tsuru S. Purifying selection enduringly acts on the sequence evolution of highly expressed proteins in Escherichia coli. G3 GENES|GENOMES|GENETICS 2022; 12:6694045. [PMID: 36073932 PMCID: PMC9635659 DOI: 10.1093/g3journal/jkac235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 08/27/2022] [Indexed: 11/17/2022]
Abstract
The evolutionary speed of a protein sequence is constrained by its expression level, with highly expressed proteins evolving relatively slowly. This negative correlation between expression levels and evolutionary rates (known as the E–R anticorrelation) has already been widely observed in past macroevolution between species from bacteria to animals. However, it remains unclear whether this seemingly general law also governs recent evolution, including past and de novo, within a species. However, the advent of genomic sequencing and high-throughput phenotyping, particularly for bacteria, has revealed fundamental gaps between the 2 evolutionary processes and has provided empirical data opposing the possible underlying mechanisms which are widely believed. These conflicts raise questions about the generalization of the E–R anticorrelation and the relevance of plausible mechanisms. To explore the ubiquitous impact of expression levels on molecular evolution and test the relevance of the possible underlying mechanisms, we analyzed the genome sequences of 99 strains of Escherichia coli for evolution within species in nature. We also analyzed genomic mutations accumulated under laboratory conditions as a model of de novo evolution within species. Here, we show that E–R anticorrelation is significant in both past and de novo evolution within species in E. coli. Our data also confirmed ongoing purifying selection on highly expressed genes. Ongoing selection included codon-level purifying selection, supporting the relevance of the underlying mechanisms. However, the impact of codon-level purifying selection on the constraints in evolution within species might be smaller than previously expected from evolution between species.
Collapse
Affiliation(s)
- Atsushi Shibai
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Hazuki Kotani
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Natsue Sakata
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Chikara Furusawa
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
- Universal Biology Institute, School of Science, The University of Tokyo , Tokyo 113-0033, Japan
| | - Saburo Tsuru
- Universal Biology Institute, School of Science, The University of Tokyo , Tokyo 113-0033, Japan
| |
Collapse
|
22
|
Controlling gene expression with deep generative design of regulatory DNA. Nat Commun 2022; 13:5099. [PMID: 36042233 PMCID: PMC9427793 DOI: 10.1038/s41467-022-32818-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 08/18/2022] [Indexed: 11/25/2022] Open
Abstract
Design of de novo synthetic regulatory DNA is a promising avenue to control gene expression in biotechnology and medicine. Using mutagenesis typically requires screening sizable random DNA libraries, which limits the designs to span merely a short section of the promoter and restricts their control of gene expression. Here, we prototype a deep learning strategy based on generative adversarial networks (GAN) by learning directly from genomic and transcriptomic data. Our ExpressionGAN can traverse the entire regulatory sequence-expression landscape in a gene-specific manner, generating regulatory DNA with prespecified target mRNA levels spanning the whole gene regulatory structure including coding and adjacent non-coding regions. Despite high sequence divergence from natural DNA, in vivo measurements show that 57% of the highly-expressed synthetic sequences surpass the expression levels of highly-expressed natural controls. This demonstrates the applicability and relevance of deep generative design to expand our knowledge and control of gene expression regulation in any desired organism, condition or tissue. Design of de novo synthetic regulatory DNA is a promising avenue to control gene expression in biotechnology and medicine. Here the authors present EspressionGAN, a generative adversarial network that uses genomic and transcriptomic data to generate regulatory sequences.
Collapse
|
23
|
Rotrattanadumrong R, Yokobayashi Y. Experimental exploration of a ribozyme neutral network using evolutionary algorithm and deep learning. Nat Commun 2022; 13:4847. [PMID: 35977956 PMCID: PMC9385714 DOI: 10.1038/s41467-022-32538-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 08/03/2022] [Indexed: 11/18/2022] Open
Abstract
A neutral network connects all genotypes with equivalent phenotypes in a fitness landscape and plays an important role in the mutational robustness and evolvability of biomolecules. In contrast to earlier theoretical works, evidence of large neutral networks has been lacking in recent experimental studies of fitness landscapes. This suggests that evolution could be constrained globally. Here, we demonstrate that a deep learning-guided evolutionary algorithm can efficiently identify neutral genotypes within the sequence space of an RNA ligase ribozyme. Furthermore, we measure the activities of all 216 variants connecting two active ribozymes that differ by 16 mutations and analyze mutational interactions (epistasis) up to the 16th order. We discover an extensive network of neutral paths linking the two genotypes and reveal that these paths might be predicted using only information from lower-order interactions. Our experimental evaluation of over 120,000 ribozyme sequences provides important empirical evidence that neutral networks can increase the accessibility and predictability of the fitness landscape. Neutral networks, which are sets of genotypes connected via single mutations that share the same phenotype, are important for evolvability. Here, the authors provide experimental evidence of a neutral network in an RNA enzyme using a high-throughput assay and deep learning.
Collapse
Affiliation(s)
- Rachapun Rotrattanadumrong
- Nucleic Acid Chemistry and Engineering Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, 9040495, Japan
| | - Yohei Yokobayashi
- Nucleic Acid Chemistry and Engineering Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, 9040495, Japan.
| |
Collapse
|
24
|
Kingdom R, Wright CF. Incomplete Penetrance and Variable Expressivity: From Clinical Studies to Population Cohorts. Front Genet 2022; 13:920390. [PMID: 35983412 PMCID: PMC9380816 DOI: 10.3389/fgene.2022.920390] [Citation(s) in RCA: 107] [Impact Index Per Article: 35.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 06/09/2022] [Indexed: 12/20/2022] Open
Abstract
The same genetic variant found in different individuals can cause a range of diverse phenotypes, from no discernible clinical phenotype to severe disease, even among related individuals. Such variants can be said to display incomplete penetrance, a binary phenomenon where the genotype either causes the expected clinical phenotype or it does not, or they can be said to display variable expressivity, in which the same genotype can cause a wide range of clinical symptoms across a spectrum. Both incomplete penetrance and variable expressivity are thought to be caused by a range of factors, including common variants, variants in regulatory regions, epigenetics, environmental factors, and lifestyle. Many thousands of genetic variants have been identified as the cause of monogenic disorders, mostly determined through small clinical studies, and thus, the penetrance and expressivity of these variants may be overestimated when compared to their effect on the general population. With the wealth of population cohort data currently available, the penetrance and expressivity of such genetic variants can be investigated across a much wider contingent, potentially helping to reclassify variants that were previously thought to be completely penetrant. Research into the penetrance and expressivity of such genetic variants is important for clinical classification, both for determining causative mechanisms of disease in the affected population and for providing accurate risk information through genetic counseling. A genotype-based definition of the causes of rare diseases incorporating information from population cohorts and clinical studies is critical for our understanding of incomplete penetrance and variable expressivity. This review examines our current knowledge of the penetrance and expressivity of genetic variants in rare disease and across populations, as well as looking into the potential causes of the variation seen, including genetic modifiers, mosaicism, and polygenic factors, among others. We also considered the challenges that come with investigating penetrance and expressivity.
Collapse
Affiliation(s)
| | - Caroline F. Wright
- Institute of Biomedical and Clinical Science, Royal Devon & Exeter Hospital, University of Exeter Medical School, Exeter, United Kingdom
| |
Collapse
|
25
|
Romani F, Flores JR, Tolopka JI, Suárez G, He X, Moreno JE. Liverwort oil bodies: diversity, biochemistry, and molecular cell biology of the earliest secretory structure of land plants. JOURNAL OF EXPERIMENTAL BOTANY 2022; 73:4427-4439. [PMID: 35394035 DOI: 10.1093/jxb/erac134] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 04/01/2022] [Indexed: 05/27/2023]
Abstract
Liverworts are known for their large chemical diversity. Much of this diversity is synthesized and enclosed within oil bodies (OBs), a synapomorphy of the lineage. OBs contain the enzymes to biosynthesize and store large quantities of sesquiterpenoids and other compounds while limiting their cytotoxicity. Recent important biochemical and molecular discoveries related to OB formation, diversity, and biochemistry allow comparison with other secretory structures of land plants from an evo-devo perspective. This review addresses and discusses the most recent advances in OB origin, development, and function towards understanding the importance of these organelles in liverwort physiology and adaptation to changing environments. Our mapping of OB types and chemical compounds to the current liverwort phylogeny suggests that OBs were present in the most recent common ancestor of liverworts, supporting that OBs evolved as the first secretory structures in land plants. Yet, we require better sampling to define the macroevolutionary pattern governing the ancestral type of OB. We conclude that current efforts to find molecular mechanisms responsible for the morphological and chemical diversity of secretory structures will help understand the evolution of each major group of land plants, and open new avenues in biochemical research on bioactive compounds in bryophytes and vascular plants.
Collapse
Affiliation(s)
- Facundo Romani
- Department of Plant Sciences, University of Cambridge, Cambridge, UK
| | - Jorge R Flores
- Botany Unit, Finnish Museum of Natural History, University of Helsinki, Helsinki, Finland
| | - Juan Ignacio Tolopka
- Instituto de Agrobiotecnología del Litoral, Universidad Nacional del Litoral - CONICET, Facultad de Bioquímica y Ciencias Biológicas, Centro Científico Tecnológico CONICET Santa Fe, Colectora Ruta Nacional No. 168 km. 0, Paraje El Pozo, Santa Fe 3000, Argentina
| | - Guillermo Suárez
- Unidad Ejecutora Lillo (CONICET - Fundación Miguel Lillo), Miguel Lillo 251, San Miguel de Tucumán, Tucumán, 4000, Argentina
- Facultad de Ciencias Naturales, Instituto Miguel Lillo, Universidad Nacional de Tucumán, Miguel Lillo 205, San Miguel de Tucumán, Tucumán, 4000, Argentina
| | - Xiaolan He
- Botany Unit, Finnish Museum of Natural History, University of Helsinki, Helsinki, Finland
| | - Javier E Moreno
- Instituto de Agrobiotecnología del Litoral, Universidad Nacional del Litoral - CONICET, Facultad de Bioquímica y Ciencias Biológicas, Centro Científico Tecnológico CONICET Santa Fe, Colectora Ruta Nacional No. 168 km. 0, Paraje El Pozo, Santa Fe 3000, Argentina
| |
Collapse
|
26
|
Gera T, Jonas F, More R, Barkai N. Evolution of binding preferences among whole-genome duplicated transcription factors. eLife 2022; 11:73225. [PMID: 35404235 PMCID: PMC9000951 DOI: 10.7554/elife.73225] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Accepted: 01/20/2022] [Indexed: 01/10/2023] Open
Abstract
Throughout evolution, new transcription factors (TFs) emerge by gene duplication, promoting growth and rewiring of transcriptional networks. How TF duplicates diverge was studied in a few cases only. To provide a genome-scale view, we considered the set of budding yeast TFs classified as whole-genome duplication (WGD)-retained paralogs (~35% of all specific TFs). Using high-resolution profiling, we find that ~60% of paralogs evolved differential binding preferences. We show that this divergence results primarily from variations outside the DNA-binding domains (DBDs), while DBD preferences remain largely conserved. Analysis of non-WGD orthologs revealed uneven splitting of ancestral preferences between duplicates, and the preferential acquiring of new targets by the least conserved paralog (biased neo/sub-functionalization). Interactions between paralogs were rare, and, when present, occurred through weak competition for DNA-binding or dependency between dimer-forming paralogs. We discuss the implications of our findings for the evolutionary design of transcriptional networks.
Collapse
Affiliation(s)
- Tamar Gera
- Department of Molecular Genetics, Weizmann Institute of Science
| | - Felix Jonas
- Department of Molecular Genetics, Weizmann Institute of Science
| | - Roye More
- Department of Molecular Genetics, Weizmann Institute of Science
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science
| |
Collapse
|
27
|
Lynch TR, Xue M, Czerniak CW, Lee C, Kimble J. Notch-dependent DNA cis-regulatory elements and their dose-dependent control of C. elegans stem cell self-renewal. Development 2022; 149:dev200332. [PMID: 35394007 PMCID: PMC9058496 DOI: 10.1242/dev.200332] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 02/16/2022] [Indexed: 11/20/2022]
Abstract
A long-standing biological question is how DNA cis-regulatory elements shape transcriptional patterns during metazoan development. Reporter constructs, cell culture assays and computational modeling have made major contributions to answering this question, but analysis of elements in their natural context is an important complement. Here, we mutate Notch-dependent LAG-1 binding sites (LBSs) in the endogenous Caenorhabditis elegans sygl-1 gene, which encodes a key stem cell regulator, and analyze the consequences on sygl-1 expression (nascent transcripts, mRNA, protein) and stem cell maintenance. Mutation of one LBS in a three-element cluster approximately halved both expression and stem cell pool size, whereas mutation of two LBSs essentially abolished them. Heterozygous LBS mutant clusters provided intermediate values. Our results lead to two major conclusions. First, both LBS number and configuration impact cluster activity: LBSs act additively in trans and synergistically in cis. Second, the SYGL-1 gradient promotes self-renewal above its functional threshold and triggers differentiation below the threshold. Our approach of coupling CRISPR/Cas9 LBS mutations with effects on both molecular and biological readouts establishes a powerful model for in vivo analyses of DNA cis-regulatory elements.
Collapse
Affiliation(s)
- Tina R. Lynch
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
- Integrated Program in Biochemistry, Madison, WI 53706, USA
| | - Mingyu Xue
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
- Department of Life Sciences, Imperial College London, South Kensington, London, SW7 2AZ, UK
| | - Cazza W. Czerniak
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
- Joint Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - ChangHwan Lee
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
- Department of Biological Sciences, University at Albany, State University of New York, Albany, NY 12222, USA
| | - Judith Kimble
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
- Integrated Program in Biochemistry, Madison, WI 53706, USA
| |
Collapse
|
28
|
The evolution, evolvability and engineering of gene regulatory DNA. Nature 2022; 603:455-463. [PMID: 35264797 DOI: 10.1038/s41586-022-04506-6] [Citation(s) in RCA: 119] [Impact Index Per Article: 39.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Accepted: 02/02/2022] [Indexed: 11/08/2022]
Abstract
Mutations in non-coding regulatory DNA sequences can alter gene expression, organismal phenotype and fitness1-3. Constructing complete fitness landscapes, in which DNA sequences are mapped to fitness, is a long-standing goal in biology, but has remained elusive because it is challenging to generalize reliably to vast sequence spaces4-6. Here we build sequence-to-expression models that capture fitness landscapes and use them to decipher principles of regulatory evolution. Using millions of randomly sampled promoter DNA sequences and their measured expression levels in the yeast Saccharomyces cerevisiae, we learn deep neural network models that generalize with excellent prediction performance, and enable sequence design for expression engineering. Using our models, we study expression divergence under genetic drift and strong-selection weak-mutation regimes to find that regulatory evolution is rapid and subject to diminishing returns epistasis; that conflicting expression objectives in different environments constrain expression adaptation; and that stabilizing selection on gene expression leads to the moderation of regulatory complexity. We present an approach for using such models to detect signatures of selection on expression from natural variation in regulatory sequences and use it to discover an instance of convergent regulatory evolution. We assess mutational robustness, finding that regulatory mutation effect sizes follow a power law, characterize regulatory evolvability, visualize promoter fitness landscapes, discover evolvability archetypes and illustrate the mutational robustness of natural regulatory sequence populations. Our work provides a general framework for designing regulatory sequences and addressing fundamental questions in regulatory evolution.
Collapse
|
29
|
Abstract
Tumour formation involves random mutagenic events and positive evolutionary selection acting on a subset of such events, referred to as driver mutations. A decade of careful surveying of tumour DNA using exome-based analyses has revealed a multitude of protein-coding somatic driver mutations, some of which are clinically actionable. Today, a transition towards whole-genome analysis is well under way, technically enabling the discovery of potential driver mutations occurring outside protein-coding sequences. Mutations are abundant in this vast non-coding space, which is more than 50 times larger than the coding exome, but reliable identification of selection signals in non-coding DNA remains a challenge. In this Review, we discuss recent findings in the field, where the emerging landscape is one in which non-coding driver mutations appear to be relatively infrequent. Nevertheless, we highlight several notable discoveries. We consider possible reasons for the relative absence of non-coding driver events, as well as the difficulties associated with detecting signals of positive selection in non-coding DNA.
Collapse
Affiliation(s)
- Kerryn Elliott
- Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, Sahlgrenska Academy at University of Gothenburg, Gothenburg, Sweden
| | - Erik Larsson
- Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, Sahlgrenska Academy at University of Gothenburg, Gothenburg, Sweden.
| |
Collapse
|
30
|
EBF1 and PAX5 control pro-B cell expansion via opposing regulation of the Myc gene. Blood 2021; 137:3037-3049. [PMID: 33619557 DOI: 10.1182/blood.2020009564] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Accepted: 02/06/2021] [Indexed: 12/12/2022] Open
Abstract
Genes encoding B lineage-restricted transcription factors are frequently mutated in B-lymphoid leukemias, suggesting a close link between normal and malignant B-cell development. One of these transcription factors is early B-cell factor 1 (EBF1), a protein of critical importance for lineage specification and survival of B-lymphoid progenitors. Here, we report that impaired EBF1 function in mouse B-cell progenitors results in reduced expression of Myc. Ectopic expression of MYC partially rescued B-cell expansion in the absence of EBF1 both in vivo and in vitro. Using chromosome conformation analysis in combination with ATAC-sequencing, chromatin immunoprecipitation-sequencing, and reporter gene assays, six EBF1-responsive enhancer elements were identified within the Myc locus. CRISPR-Cas9-mediated targeting of EBF1-binding sites identified one element of key importance for Myc expression and pro-B cell expansion. These data provide evidence that Myc is a direct target of EBF1. Furthermore, chromatin immunoprecipitation-sequencing analysis revealed that several regulatory elements in the Myc locus are targets of PAX5. However, ectopic expression of PAX5 in EBF1-deficient cells inhibits the cell cycle and reduces Myc expression, suggesting that EBF1 and PAX5 act in an opposing manner to regulate Myc levels. This hypothesis is further substantiated by the finding that Pax5 inactivation reduces requirements for EBF1 in pro-B-cell expansion. The binding of EBF1 and PAX5 to regulatory elements in the human MYC gene in a B-cell acute lymphoblastic leukemia cell line indicates that the EBF1:PAX5:MYC regulatory loop is conserved and may control both normal and malignant B-cell development.
Collapse
|
31
|
Strid T, Okuyama K, Tingvall-Gustafsson J, Kuruvilla J, Jensen CT, Lang S, Prasad M, Somasundaram R, Åhsberg J, Cristobal S, Soneji S, Ungerbäck J, Sigvardsson M. B Lymphocyte Specification Is Preceded by Extensive Epigenetic Priming in Multipotent Progenitors. THE JOURNAL OF IMMUNOLOGY 2021; 206:2700-2713. [PMID: 34021049 DOI: 10.4049/jimmunol.2100048] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Accepted: 03/27/2021] [Indexed: 11/19/2022]
Abstract
B lymphocyte development is dependent on the interplay between the chromatin landscape and lineage-specific transcription factors. It has been suggested that B lineage commitment is associated with major changes in the nuclear chromatin environment, proposing a critical role for lineage-specific transcription factors in the formation of the epigenetic landscape. In this report, we have used chromosome conformation capture in combination with assay for transposase-accessible chromatin sequencing analysis to enable highly efficient annotation of both proximal and distal transcriptional control elements to genes activated in B lineage specification in mice. A large majority of these genes were annotated to at least one regulatory element with an accessible chromatin configuration in multipotent progenitors. Furthermore, the majority of binding sites for the key regulators of B lineage specification, EBF1 and PAX5, occurred in already accessible regions. EBF1 did, however, cause a dynamic change in assay for transposase-accessible chromatin accessibility and was critical for an increase in distal promoter-enhancer interactions. Our data unravel an extensive epigenetic priming at regulatory elements annotated to lineage-restricted genes and provide insight into the interplay between the epigenetic landscape and transcription factors in cell specification.
Collapse
Affiliation(s)
- Tobias Strid
- Department of Biological and Clinical Sciences, Linköping University, Linköping, Sweden.,Division of Molecular Hematology, Lund University, Lund, Sweden; and.,Department of Clinical Pathology, Biological and Clinical Sciences, Linköping University, Linköping, Sweden
| | - Kazuki Okuyama
- Department of Biological and Clinical Sciences, Linköping University, Linköping, Sweden
| | | | - Jacob Kuruvilla
- Division of Molecular Hematology, Lund University, Lund, Sweden; and
| | | | - Stefan Lang
- Division of Molecular Hematology, Lund University, Lund, Sweden; and
| | - Mahadesh Prasad
- Department of Biological and Clinical Sciences, Linköping University, Linköping, Sweden
| | - Rajesh Somasundaram
- Department of Biological and Clinical Sciences, Linköping University, Linköping, Sweden
| | - Josefine Åhsberg
- Department of Biological and Clinical Sciences, Linköping University, Linköping, Sweden
| | - Susana Cristobal
- Department of Biological and Clinical Sciences, Linköping University, Linköping, Sweden
| | - Shamit Soneji
- Division of Molecular Hematology, Lund University, Lund, Sweden; and
| | - Jonas Ungerbäck
- Division of Molecular Hematology, Lund University, Lund, Sweden; and
| | - Mikael Sigvardsson
- Department of Biological and Clinical Sciences, Linköping University, Linköping, Sweden; .,Division of Molecular Hematology, Lund University, Lund, Sweden; and
| |
Collapse
|
32
|
Whitworth DE, Zwarycz A. A Genomic Survey of Signalling in the Myxococcaceae. Microorganisms 2020; 8:microorganisms8111739. [PMID: 33171896 PMCID: PMC7694542 DOI: 10.3390/microorganisms8111739] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Revised: 11/02/2020] [Accepted: 11/03/2020] [Indexed: 12/27/2022] Open
Abstract
As prokaryotes diverge by evolution, essential 'core' genes required for conserved phenotypes are preferentially retained, while inessential 'accessory' genes are lost or diversify. We used the recently expanded number of myxobacterial genome sequences to investigate the conservation of their signalling proteins, focusing on two sister genera (Myxococcus and Corallococcus), and on a species within each genus (Myxococcus xanthus and Corallococcus exiguus). Four new C. exiguus genome sequences are also described here. Despite accessory genes accounting for substantial proportions of each myxobacterial genome, signalling proteins were found to be enriched in the core genome, with two-component system genes almost exclusively so. We also investigated the conservation of signalling proteins in three myxobacterial behaviours. The linear carotenogenesis pathway was entirely conserved, with no gene gain/loss observed. However, the modular fruiting body formation network was found to be evolutionarily plastic, with dispensable components in all modules (including components required for fruiting in the model myxobacterium M. xanthus DK1622). Quorum signalling (QS) is thought to be absent from most myxobacteria, however, they generally appear to be able to produce CAI-I (cholerae autoinducer-1), to sense other QS molecules, and to disrupt the QS of other organisms, potentially important abilities during predation of other prokaryotes.
Collapse
|
33
|
Vihinen M. Functional effects of protein variants. Biochimie 2020; 180:104-120. [PMID: 33164889 DOI: 10.1016/j.biochi.2020.10.009] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 10/15/2020] [Accepted: 10/19/2020] [Indexed: 12/11/2022]
Abstract
Genetic and other variations frequently affect protein functions. Scientific articles can contain confusing descriptions about which function or property is affected, and in many cases the statements are pure speculation without any experimental evidence. To clarify functional effects of protein variations of genetic or non-genetic origin, a systematic conceptualisation and framework are introduced. This framework describes protein functional effects on abundance, activity, specificity and affinity, along with countermeasures, which allow cells, tissues and organisms to tolerate, avoid, repair, attenuate or resist (TARAR) the effects. Effects on abundance discussed include gene dosage, restricted expression, mis-localisation and degradation. Enzymopathies, effects on kinetics, allostery and regulation of protein activity are subtopics for the effects of variants on activity. Variation outcomes on specificity and affinity comprise promiscuity, specificity, affinity and moonlighting. TARAR mechanisms redress variations with active and passive processes including chaperones, redundancy, robustness, canalisation and metabolic and signalling rewiring. A framework for pragmatic protein function analysis and presentation is introduced. All of the mechanisms and effects are described along with representative examples, most often in relation to diseases. In addition, protein function is discussed from evolutionary point of view. Application of the presented framework facilitates unambiguous, detailed and specific description of functional effects and their systematic study.
Collapse
Affiliation(s)
- Mauno Vihinen
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22 184, Lund, Sweden.
| |
Collapse
|
34
|
Abstract
Canalization refers to the evolution of populations such that the number of individuals who deviate from the optimum trait, or experience disease, is minimized. In the presence of rapid cultural, environmental, or genetic change, the reverse process of decanalization may contribute to observed increases in disease prevalence. This review starts by defining relevant concepts, drawing distinctions between the canalization of populations and robustness of individuals. It then considers evidence pertaining to three continuous traits and six domains of disease. In each case, existing genetic evidence for genotype-by-environment interactions is insufficient to support a strong inference of decanalization, but we argue that the advent of genome-wide polygenic risk assessment now makes an empirical evaluation of the role of canalization in preventing disease possible. Finally, the contributions of both rare and common variants to congenital abnormality and adult onset disease are considered in light of a new kerplunk model of genetic effects.
Collapse
Affiliation(s)
- Greg Gibson
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332, USA;
| | - Kristine A Lacek
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332, USA;
| |
Collapse
|
35
|
Hartmann J, Krueger D, De Renzis S. Using optogenetics to tackle systems-level questions of multicellular morphogenesis. Curr Opin Cell Biol 2020; 66:19-27. [PMID: 32408249 DOI: 10.1016/j.ceb.2020.04.004] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2020] [Revised: 04/03/2020] [Accepted: 04/06/2020] [Indexed: 01/13/2023]
Abstract
Morphogenesis of multicellular systems is governed by precise spatiotemporal regulation of biochemical reactions and mechanical forces which together with environmental conditions determine the development of complex organisms. Current efforts in the field aim at decoding the system-level principles underlying the regulation of developmental processes. Toward this goal, optogenetics, the science of regulation of protein function with light, is emerging as a powerful new tool to quantitatively perturb protein function in vivo with unprecedented precision in space and time. In this review, we provide an overview of how optogenetics is helping to address system-level questions of multicellular morphogenesis and discuss future directions.
Collapse
Affiliation(s)
- Jonas Hartmann
- European Molecular Biology Laboratory (EMBL), Developmental Biology Unit, Meyerhofstrasse 1, 69117, Heidelberg, Germany.
| | - Daniel Krueger
- European Molecular Biology Laboratory (EMBL), Developmental Biology Unit, Meyerhofstrasse 1, 69117, Heidelberg, Germany
| | - Stefano De Renzis
- European Molecular Biology Laboratory (EMBL), Developmental Biology Unit, Meyerhofstrasse 1, 69117, Heidelberg, Germany.
| |
Collapse
|
36
|
Miller SW, Posakony JW. Disparate expression specificities coded by a shared Hox-C enhancer. eLife 2020; 9:39876. [PMID: 32342858 PMCID: PMC7188484 DOI: 10.7554/elife.39876] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2018] [Accepted: 04/09/2020] [Indexed: 12/13/2022] Open
Abstract
Can a single regulatory sequence be shared by two genes undergoing functional divergence? Here we describe a single promiscuous enhancer within the Drosophila Antennapedia Complex, EO053, that directs aspects of the expression of two adjacent genes, pb (a Hox2 ortholog) and zen2 (a divergent Hox3 paralog), with disparate spatial and temporal expression patterns. We were unable to separate the pb-like and zen2-like specificities within EO053, and we identify sequences affecting both expression patterns. Importantly, genomic deletion experiments demonstrate that EO053 cooperates with additional pb- and zen2-specific enhancers to regulate the mRNA expression of both genes. We examine sequence conservation of EO053 within the Schizophora, and show that patterns of synteny between the Hox2 and Hox3 orthologs in Arthropods are consistent with a shared regulatory relationship extending prior to the Hox3/zen divergence. Thus, EO053 represents an example of two genes having evolved disparate outputs while utilizing this shared regulatory region. Editorial note: This article has been through an editorial process in which the authors decide how to respond to the issues raised during peer review. The Reviewing Editor's assessment is that all the issues have been addressed (see decision letter).
Collapse
Affiliation(s)
- Steve W Miller
- Division of Biological Sciences, Section of Cell & Developmental Biology, University of California San Diego, La Jolla, United States
| | - James W Posakony
- Division of Biological Sciences, Section of Cell & Developmental Biology, University of California San Diego, La Jolla, United States
| |
Collapse
|
37
|
Belver L, Yang AY, Albero R, Herranz D, Brundu FG, Quinn SA, Pérez-Durán P, Álvarez S, Gianni F, Rashkovan M, Gurung D, Rocha PP, Raviram R, Reglero C, Cortés JR, Cooke AJ, Wendorff AA, Cordó V, Meijerink JP, Rabadan R, Ferrando AA. GATA3-Controlled Nucleosome Eviction Drives MYC Enhancer Activity in T-cell Development and Leukemia. Cancer Discov 2019; 9:1774-1791. [PMID: 31519704 DOI: 10.1158/2159-8290.cd-19-0471] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Revised: 08/15/2019] [Accepted: 09/10/2019] [Indexed: 12/28/2022]
Abstract
Long-range enhancers govern the temporal and spatial control of gene expression; however, the mechanisms that regulate enhancer activity during normal and malignant development remain poorly understood. Here, we demonstrate a role for aberrant chromatin accessibility in the regulation of MYC expression in T-cell lymphoblastic leukemia (T-ALL). Central to this process, the NOTCH1-MYC enhancer (N-Me), a long-range T cell-specific MYC enhancer, shows dynamic changes in chromatin accessibility during T-cell specification and maturation and an aberrant high degree of chromatin accessibility in mouse and human T-ALL cells. Mechanistically, we demonstrate that GATA3-driven nucleosome eviction dynamically modulates N-Me enhancer activity and is strictly required for NOTCH1-induced T-ALL initiation and maintenance. These results directly implicate aberrant regulation of chromatin accessibility at oncogenic enhancers as a mechanism of leukemic transformation. SIGNIFICANCE: MYC is a major effector of NOTCH1 oncogenic programs in T-ALL. Here, we show a major role for GATA3-mediated enhancer nucleosome eviction as a driver of MYC expression and leukemic transformation. These results support the role of aberrant chromatin accessibility and consequent oncogenic MYC enhancer activation in NOTCH1-induced T-ALL.This article is highlighted in the In This Issue feature, p. 1631.
Collapse
Affiliation(s)
- Laura Belver
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Alexander Y Yang
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Robert Albero
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Daniel Herranz
- Rutgers Cancer Institute of New Jersey, Rutgers University, New Brunswick, New Jersey.,Department of Pharmacology, Robert Wood Johnson Medical School, Rutgers University, Piscataway, New Jersey
| | | | - S Aidan Quinn
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Pablo Pérez-Durán
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Silvia Álvarez
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Francesca Gianni
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Marissa Rashkovan
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Devya Gurung
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Pedro P Rocha
- Division of Developmental Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, Maryland
| | - Ramya Raviram
- Ludwig Institute for Cancer Research, La Jolla, California.,Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California
| | - Clara Reglero
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Jose R Cortés
- Institute for Cancer Genetics, Columbia University, New York, New York
| | - Anisha J Cooke
- Institute for Cancer Genetics, Columbia University, New York, New York
| | | | - Valentina Cordó
- Department of Pediatric Oncology/Hematology, Princess Maxima Center for Pediatric Oncology, Utrecht, the Netherlands
| | - Jules P Meijerink
- Department of Pediatric Oncology/Hematology, Princess Maxima Center for Pediatric Oncology, Utrecht, the Netherlands
| | - Raúl Rabadan
- Department of Pharmacology, Robert Wood Johnson Medical School, Rutgers University, Piscataway, New Jersey.,Department of Biomedical Informatics, Columbia University, New York, New York
| | - Adolfo A Ferrando
- Institute for Cancer Genetics, Columbia University, New York, New York. .,Department of Pharmacology, Robert Wood Johnson Medical School, Rutgers University, Piscataway, New Jersey.,Department of Pediatrics, Columbia University Medical Center, New York, New York.,Department of Pathology, Columbia University Medical Center, New York, New York
| |
Collapse
|
38
|
Lamrabet O, Plumbridge J, Martin M, Lenski RE, Schneider D, Hindré T. Plasticity of Promoter-Core Sequences Allows Bacteria to Compensate for the Loss of a Key Global Regulatory Gene. Mol Biol Evol 2019; 36:1121-1133. [PMID: 30825312 DOI: 10.1093/molbev/msz042] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Transcription regulatory networks (TRNs) are of central importance for both short-term phenotypic adaptation in response to environmental fluctuations and long-term evolutionary adaptation, with global regulatory genes often being targets of natural selection in laboratory experiments. Here, we combined evolution experiments, whole-genome resequencing, and molecular genetics to investigate the driving forces, genetic constraints, and molecular mechanisms that dictate how bacteria can cope with a drastic perturbation of their TRNs. The crp gene, encoding a major global regulator in Escherichia coli, was deleted in four different genetic backgrounds, all derived from the Long-Term Evolution Experiment (LTEE) but with different TRN architectures. We confirmed that crp deletion had a more deleterious effect on growth rate in the LTEE-adapted genotypes; and we showed that the ptsG gene, which encodes the major glucose-PTS transporter, gained CRP (cyclic AMP receptor protein) dependence over time in the LTEE. We then further evolved the four crp-deleted genotypes in glucose minimal medium, and we found that they all quickly recovered from their growth defects by increasing glucose uptake. We showed that this recovery was specific to the selective environment and consistently relied on mutations in the cis-regulatory region of ptsG, regardless of the initial genotype. These mutations affected the interplay of transcription factors acting at the promoters, changed the intrinsic properties of the existing promoters, or produced new transcription initiation sites. Therefore, the plasticity of even a single promoter region can compensate by three different mechanisms for the loss of a key regulatory hub in the E. coli TRN.
Collapse
Affiliation(s)
- Otmane Lamrabet
- Université Grenoble Alpes, CNRS, Grenoble INP, TIMC-IMAG, Grenoble, France
| | - Jacqueline Plumbridge
- CNRS UMR8261, Université Paris Diderot, Sorbonne Paris Cité, Institut de Biologie Physico-chimique, Paris, France
| | - Mikaël Martin
- Université Grenoble Alpes, CNRS, Grenoble INP, TIMC-IMAG, Grenoble, France
| | - Richard E Lenski
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI.,BEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, MI
| | | | - Thomas Hindré
- Université Grenoble Alpes, CNRS, Grenoble INP, TIMC-IMAG, Grenoble, France
| |
Collapse
|
39
|
Rao MC. Physiology of Electrolyte Transport in the Gut: Implications for Disease. Compr Physiol 2019; 9:947-1023. [PMID: 31187895 DOI: 10.1002/cphy.c180011] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We now have an increased understanding of the genetics, cell biology, and physiology of electrolyte transport processes in the mammalian intestine, due to the availability of sophisticated methodologies ranging from genome wide association studies to CRISPR-CAS technology, stem cell-derived organoids, 3D microscopy, electron cryomicroscopy, single cell RNA sequencing, transgenic methodologies, and tools to manipulate cellular processes at a molecular level. This knowledge has simultaneously underscored the complexity of biological systems and the interdependence of multiple regulatory systems. In addition to the plethora of mammalian neurohumoral factors and their cross talk, advances in pyrosequencing and metagenomic analyses have highlighted the relevance of the microbiome to intestinal regulation. This article provides an overview of our current understanding of electrolyte transport processes in the small and large intestine, their regulation in health and how dysregulation at multiple levels can result in disease. Intestinal electrolyte transport is a balance of ion secretory and ion absorptive processes, all exquisitely dependent on the basolateral Na+ /K+ ATPase; when this balance goes awry, it can result in diarrhea or in constipation. The key transporters involved in secretion are the apical membrane Cl- channels and the basolateral Na+ -K+ -2Cl- cotransporter, NKCC1 and K+ channels. Absorption chiefly involves apical membrane Na+ /H+ exchangers and Cl- /HCO3 - exchangers in the small intestine and proximal colon and Na+ channels in the distal colon. Key examples of our current understanding of infectious, inflammatory, and genetic diarrheal diseases and of constipation are provided. © 2019 American Physiological Society. Compr Physiol 9:947-1023, 2019.
Collapse
Affiliation(s)
- Mrinalini C Rao
- Department of Physiology and Biophysics, University of Illinois at Chicago, Chicago, Illinois, USA
| |
Collapse
|
40
|
Alexandre CM, Urton JR, Jean-Baptiste K, Huddleston J, Dorrity MW, Cuperus JT, Sullivan AM, Bemm F, Jolic D, Arsovski AA, Thompson A, Nemhauser JL, Fields S, Weigel D, Bubb KL, Queitsch C. Complex Relationships between Chromatin Accessibility, Sequence Divergence, and Gene Expression in Arabidopsis thaliana. Mol Biol Evol 2019; 35:837-854. [PMID: 29272536 DOI: 10.1093/molbev/msx326] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Variation in regulatory DNA is thought to drive phenotypic variation, evolution, and disease. Prior studies of regulatory DNA and transcription factors across animal species highlighted a fundamental conundrum: Transcription factor binding domains and cognate binding sites are conserved, while regulatory DNA sequences are not. It remains unclear how conserved transcription factors and dynamic regulatory sites produce conserved expression patterns across species. Here, we explore regulatory DNA variation and its functional consequences within Arabidopsis thaliana, using chromatin accessibility to delineate regulatory DNA genome-wide. Unlike in previous cross-species comparisons, the positional homology of regulatory DNA is maintained among A. thaliana ecotypes and less nucleotide divergence has occurred. Of the ∼50,000 regulatory sites in A. thaliana, we found that 15% varied in accessibility among ecotypes. Some of these accessibility differences were associated with extensive, previously unannotated sequence variation, encompassing many deletions and ancient hypervariable alleles. Unexpectedly, for the majority of such regulatory sites, nearby gene expression was unaffected. Nevertheless, regulatory sites with high levels of sequence variation and differential chromatin accessibility were the most likely to be associated with differential gene expression. Finally, and most surprising, we found that the vast majority of differentially accessible sites show no underlying sequence variation. We argue that these surprising results highlight the necessity to consider higher-order regulatory context in evaluating regulatory variation and predicting its phenotypic consequences.
Collapse
Affiliation(s)
| | - James R Urton
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Ken Jean-Baptiste
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - John Huddleston
- Department of Genome Sciences, University of Washington, Seattle, WA.,Molecular and Cellular Biology Graduate Program, University of Washington, Seattle, WA
| | - Michael W Dorrity
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Josh T Cuperus
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | - Felix Bemm
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Dino Jolic
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | | | | | | | - Stan Fields
- Department of Genome Sciences, University of Washington, Seattle, WA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Kerry L Bubb
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Christin Queitsch
- Department of Genome Sciences, University of Washington, Seattle, WA
| |
Collapse
|
41
|
Maheepala DC, Emerling CA, Rajewski A, Macon J, Strahl M, Pabón-Mora N, Litt A. Evolution and Diversification of FRUITFULL Genes in Solanaceae. FRONTIERS IN PLANT SCIENCE 2019; 10:43. [PMID: 30846991 PMCID: PMC6394111 DOI: 10.3389/fpls.2019.00043] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2018] [Accepted: 01/11/2019] [Indexed: 05/12/2023]
Abstract
Ecologically and economically important fleshy edible fruits have evolved from dry fruit numerous times during angiosperm diversification. However, the molecular mechanisms that underlie these shifts are unknown. In the Solanaceae there has been a major shift to fleshy fruits in the subfamily Solanoideae. Evidence suggests that an ortholog of FRUITFULL (FUL), a transcription factor that regulates cell proliferation and limits the dehiscence zone in the silique of Arabidopsis, plays a similar role in dry-fruited Solanaceae. However, studies have shown that FUL orthologs have taken on new functions in fleshy fruit development, including regulating elements of tomato ripening such as pigment accumulation. FUL belongs to the core eudicot euFUL clade of the angiosperm AP1/FUL gene lineage. The euFUL genes fall into two paralogous clades, euFULI and euFULII. While most core eudicots have one gene in each clade, Solanaceae have two: FUL1 and FUL2 in the former, and MBP10 and MBP20 in the latter. We characterized the evolution of the euFUL genes to identify changes that might be correlated with the origin of fleshy fruit in Solanaceae. Our analyses revealed that the Solanaceae FUL1 and FUL2 clades probably originated through an early whole genome multiplication event. By contrast, the data suggest that the MBP10 and MBP20 clades are the result of a later tandem duplication event. MBP10 is expressed at weak to moderate levels, and its atypical short first intron lacks putative transcription factor binding sites, indicating possible pseudogenization. Consistent with this, our analyses show that MBP10 is evolving at a faster rate compared to MBP20. Our analyses found that Solanaceae euFUL gene duplications, evolutionary rates, and changes in protein residues and expression patterns are not correlated with the shift in fruit type. This suggests deeper analyses are needed to identify the mechanism underlying the change in FUL ortholog function.
Collapse
Affiliation(s)
- Dinusha C. Maheepala
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, United States
| | - Christopher A. Emerling
- Institut des Sciences de l’Évolution de Montpellier, Université de Montpellier, Centre National de la Recherche Scientifique, Institut de Recherche pour le Développement, École Pratique des Hautes Études, Montpellier, France
| | - Alex Rajewski
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, United States
| | - Jenna Macon
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, United States
| | - Maya Strahl
- The New York Botanical Garden, Bronx, NY, United States
| | | | - Amy Litt
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, United States
- *Correspondence: Amy Litt,
| |
Collapse
|
42
|
Capriotti E, Ozturk K, Carter H. Integrating molecular networks with genetic variant interpretation for precision medicine. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2018; 11:e1443. [PMID: 30548534 PMCID: PMC6450710 DOI: 10.1002/wsbm.1443] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 10/23/2018] [Accepted: 10/30/2018] [Indexed: 02/01/2023]
Abstract
More reliable and cheaper sequencing technologies have revealed the vast mutational landscapes characteristic of many phenotypes. The analysis of such genetic variants has led to successful identification of altered proteins underlying many Mendelian disorders. Nevertheless the simple one‐variant one‐phenotype model valid for many monogenic diseases does not capture the complexity of polygenic traits and disorders. Although experimental and computational approaches have improved detection of functionally deleterious variants and important interactions between gene products, the development of comprehensive models relating genotype and phenotypes remains a challenge in the field of genomic medicine. In this context, a new view of the pathologic state as significant perturbation of the network of interactions between biomolecules is crucial for the identification of biochemical pathways associated with complex phenotypes. Seminal studies in systems biology combined the analysis of genetic variation with protein–protein interaction networks to demonstrate that even as biological systems evolve to be robust to genetic variation, their topologies create disease vulnerabilities. More recent analyses model the impact of genetic variants as changes to the “wiring” of the interactome to better capture heterogeneity in genotype–phenotype relationships. These studies lay the foundation for using networks to predict variant effects at scale using machine‐learning or algorithmic approaches. A wealth of databases and resources for the annotation of genotype–phenotype relationships have been developed to support developments in this area. This overview describes how study of the molecular interactome has generated insights linking the organization of biological systems to disease mechanism, and how this information can enable precision medicine. This article is categorized under:
Translational, Genomic, and Systems Medicine > Translational Medicine Biological Mechanisms > Cell Signaling Models of Systems Properties and Processes > Mechanistic Models Analytical and Computational Methods > Computational Methods
Collapse
Affiliation(s)
- Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy
| | - Kivilcim Ozturk
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, California
| | - Hannah Carter
- Department of Medicine and Institute for Genomic Medicine, University of California, San Diego, La Jolla, California
| |
Collapse
|
43
|
Morata J, Marín F, Payet J, Casacuberta JM. Plant Lineage-Specific Amplification of Transcription Factor Binding Motifs by Miniature Inverted-Repeat Transposable Elements (MITEs). Genome Biol Evol 2018; 10:1210-1220. [PMID: 29659815 PMCID: PMC5950925 DOI: 10.1093/gbe/evy073] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/04/2018] [Indexed: 12/20/2022] Open
Abstract
Transposable elements are one of the main drivers of plant genome evolution. Transposon insertions can modify the gene coding capacity or the regulation of their expression, the latter being a more subtle effect, and therefore particularly useful for evolution. Transposons have been show to contain transcription factor binding sites that can be mobilized upon transposition with the potential to integrate new genes into transcriptional networks. Miniature inverted-repeat transposable elements (MITEs) are a type of noncoding DNA transposons that could be particularly suited as a vector to mobilize transcription factor binding sites and modify transcriptional networks during evolution. MITEs are small in comparison to other transposons and can be excised, which should make them less mutagenic when inserting into promoters. On the other hand, in spite of their cut-and-paste mechanisms of transposition, they can reach very high copy numbers in genomes. We have previously shown that MITEs have amplified and redistributed the binding motif of the E2F transcription factor in different Brassicas. Here, we show that MITEs have amplified and mobilized the binding motifs of the bZIP60 and PIF3 transcription factors in peach and Prunus mume, and the TCP15/23 binding motif in tomato. Our results suggest that MITEs could have rewired new genes into transcriptional regulatory networks that are responsible for important adaptive responses and breeding traits in plants, such as stress responses, flowering time, or fruit ripening. The results presented here therefore suggest a general impact of MITEs in the evolution of transcriptional regulatory networks in plants.
Collapse
Affiliation(s)
- Jordi Morata
- CRAG (CSIC-IRTA-UAB-UB) Campus UAB, Bellaterra, Cerdanyola del Vallès, Barcelona, Spain
| | - Fatima Marín
- CRAG (CSIC-IRTA-UAB-UB) Campus UAB, Bellaterra, Cerdanyola del Vallès, Barcelona, Spain
| | | | - Josep M Casacuberta
- CRAG (CSIC-IRTA-UAB-UB) Campus UAB, Bellaterra, Cerdanyola del Vallès, Barcelona, Spain
| |
Collapse
|
44
|
Kirschner S, Woodfield H, Prusko K, Koczor M, Gowik U, Hibberd JM, Westhoff P. Expression of SULTR2;2, encoding a low-affinity sulphur transporter, in the Arabidopsis bundle sheath and vein cells is mediated by a positive regulator. JOURNAL OF EXPERIMENTAL BOTANY 2018; 69:4897-4906. [PMID: 30032291 PMCID: PMC6137973 DOI: 10.1093/jxb/ery263] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Accepted: 07/10/2018] [Indexed: 05/03/2023]
Abstract
The bundle sheath provides a conduit linking veins and mesophyll cells. In the C3 plant Arabidopsis thaliana, it also plays important roles in oxidative stress and sulphur metabolism. However, the mechanisms responsible for the patterns of gene expression that underpin these metabolic specializations are poorly understood. Here, we used the Arabidopsis SULTR2;2 gene as a model to better understand mechanisms that restrict expression to the bundle sheath. Deletion analysis indicated that the SULTR2;2 promoter contains a short region necessary for expression in the bundle sheath and veins. This sequence acts as a positive regulator and is tolerant to multiple consecutive deletions indicating considerable redundancy in the cis-elements involved. It is highly conserved in SULTR2;2 genes of the Brassicaceae and is functional in the distantly related C4 species Flaveria bidentis that belongs to the Asteraceae. We conclude that expression of SULTR2;2 in the bundle sheath and veins is underpinned by a highly redundant sequence that likely represents an ancient and conserved mechanism found in families as diverse as the Asteraceae and Brassicaceae.
Collapse
Affiliation(s)
- Sandra Kirschner
- Institute for Plant Molecular and Developmental Biology, Heinrich-Heine-Universität Düsseldorf, Universitätsstraße, Düsseldorf, Germany
| | - Helen Woodfield
- Department of Plant Sciences, University of Cambridge, Downing Street, Cambridge, UK
| | - Katharina Prusko
- Institute for Plant Molecular and Developmental Biology, Heinrich-Heine-Universität Düsseldorf, Universitätsstraße, Düsseldorf, Germany
| | - Maria Koczor
- Institute for Plant Molecular and Developmental Biology, Heinrich-Heine-Universität Düsseldorf, Universitätsstraße, Düsseldorf, Germany
| | - Udo Gowik
- Institute for Plant Molecular and Developmental Biology, Heinrich-Heine-Universität Düsseldorf, Universitätsstraße, Düsseldorf, Germany
| | - Julian M Hibberd
- Department of Plant Sciences, University of Cambridge, Downing Street, Cambridge, UK
| | - Peter Westhoff
- Institute for Plant Molecular and Developmental Biology, Heinrich-Heine-Universität Düsseldorf, Universitätsstraße, Düsseldorf, Germany
| |
Collapse
|
45
|
Epigenetic and Cellular Diversity in the Brain through Allele-Specific Effects. Trends Neurosci 2018; 41:925-937. [PMID: 30098802 DOI: 10.1016/j.tins.2018.07.005] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Revised: 06/29/2018] [Accepted: 07/10/2018] [Indexed: 01/18/2023]
Abstract
The benefits of diploidy are considered to involve masking partially recessive mutations and increasing genetic diversity. Here, we review new studies showing evidence for diverse allele-specific expression and epigenetic states in mammalian brain cells, which suggest that diploidy expands the landscape of gene regulatory and expression programs in cells. Allele-specific expression has been thought to be restricted to a few specific classes of genes. However, new studies show novel genomic imprinting effects that are brain-region-, cell-type- and age-dependent. In addition, novel forms of random monoallelic expression that impact many autosomal genes have been described in vitro and in vivo. We discuss the implications for understanding the benefits of diploidy, and the mechanisms shaping brain development, function, and disease.
Collapse
|
46
|
Bao R, Dia SE, Issa HA, Alhusein D, Friedrich M. Comparative Evidence of an Exceptional Impact of Gene Duplication on the Developmental Evolution of Drosophila and the Higher Diptera. Front Ecol Evol 2018. [DOI: 10.3389/fevo.2018.00063] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
|
47
|
Abstract
Cells regulate the activity of genes in a variety of ways. For example, they regulate transcription through DNA binding proteins called transcription factors, and they regulate mRNA stability and processing through RNA binding proteins. Based on current knowledge, transcriptional regulation is more widespread and is involved in many more evolutionary adaptations than posttranscriptional regulation. The reason could be that transcriptional regulation is studied more intensely. We suggest instead that transcriptional regulation harbors an intrinsic evolutionary advantage: when mutations change transcriptional regulation, they are more likely to bring forth novel patterns of such regulation. That is, transcriptional regulation is more evolvable. Our analysis suggests a reason why a specific kind of gene regulation is especially abundant in the living world. Much of gene regulation is carried out by proteins that bind DNA or RNA molecules at specific sequences. One class of such proteins is transcription factors, which bind short DNA sequences to regulate transcription. Another class is RNA binding proteins, which bind short RNA sequences to regulate RNA maturation, transport, and stability. Here, we study the robustness and evolvability of these regulatory mechanisms. To this end, we use experimental binding data from 172 human and fruit fly transcription factors and RNA binding proteins as well as human polymorphism data to study the evolution of binding sites in vivo. We find little difference between the robustness of regulatory protein–RNA interactions and transcription factor–DNA interactions to DNA mutations. In contrast, we find that RNA-mediated regulation is less evolvable than transcriptional regulation, because mutations are less likely to create interactions of an RNA molecule with a new RNA binding protein than they are to create interactions of a gene regulatory region with a new transcription factor. Our observations are consistent with the high level of conservation observed for interactions between RNA binding proteins and their target molecules as well as the evolutionary plasticity of regulatory regions bound by transcription factors. They may help explain why transcriptional regulation is implicated in many more evolutionary adaptations and innovations than RNA-mediated gene regulation.
Collapse
|
48
|
Rikkerink EHA. Pathogens and Disease Play Havoc on the Host Epiproteome-The "First Line of Response" Role for Proteomic Changes Influenced by Disorder. Int J Mol Sci 2018. [PMID: 29518008 PMCID: PMC5877633 DOI: 10.3390/ijms19030772] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Organisms face stress from multiple sources simultaneously and require mechanisms to respond to these scenarios if they are to survive in the long term. This overview focuses on a series of key points that illustrate how disorder and post-translational changes can combine to play a critical role in orchestrating the response of organisms to the stress of a changing environment. Increasingly, protein complexes are thought of as dynamic multi-component molecular machines able to adapt through compositional, conformational and/or post-translational modifications to control their largely metabolic outputs. These metabolites then feed into cellular physiological homeostasis or the production of secondary metabolites with novel anti-microbial properties. The control of adaptations to stress operates at multiple levels including the proteome and the dynamic nature of proteomic changes suggests a parallel with the equally dynamic epigenetic changes at the level of nucleic acids. Given their properties, I propose that some disordered protein platforms specifically enable organisms to sense and react rapidly as the first line of response to change. Using examples from the highly dynamic host-pathogen and host-stress response, I illustrate by example how disordered proteins are key to fulfilling the need for multiple levels of integration of response at different time scales to create robust control points.
Collapse
Affiliation(s)
- Erik H A Rikkerink
- The New Zealand Institute for Plant & Food Research Ltd., 120 Mt. Albert Rd., Private Bag 92169, Auckland 1025, New Zealand.
| |
Collapse
|
49
|
Julio-Kalajzić F, Villanueva S, Burgos J, Ojeda M, Cid LP, Jentsch TJ, Sepúlveda FV. K 2P TASK-2 and KCNQ1-KCNE3 K + channels are major players contributing to intestinal anion and fluid secretion. J Physiol 2017; 596:393-407. [PMID: 29143340 DOI: 10.1113/jp275178] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Accepted: 11/08/2017] [Indexed: 12/11/2022] Open
Abstract
KEY POINTS K+ channels are important in intestinal epithelium as they ensure the ionic homeostasis and electrical potential of epithelial cells during anion and fluid secretion. Intestinal epithelium cAMP-activated anion secretion depends on the activity of the (also cAMP dependent) KCNQ1-KCNE3 K+ channel, but the secretory process survives after genetic inactivation of the K+ channel in the mouse. Here we use double mutant mice to investigate which alternative K+ channels come into action to compensate for the absence of KCNQ1-KCNE3 K+ channels. Our data establish that whilst Ca2+ -activated KCa 3.1 channels are not involved, K2P two-pore domain TASK-2 K+ channels are major players providing an alternative conductance to sustain the intestinal secretory process. Work with double mutant mice lacking both TASK-2 and KCNQ1-KCNE3 channels nevertheless points to yet-unidentified K+ channels that contribute to the robustness of the cAMP-activated anion secretion process. ABSTRACT Anion and fluid secretion across the intestinal epithelium, a process altered in cystic fibrosis and secretory diarrhoea, is mediated by cAMP-activated CFTR Cl- channels and requires the simultaneous activity of basolateral K+ channels to maintain cellular ionic homeostasis and membrane potential. This function is fulfilled by the cAMP-activated K+ channel formed by the association of pore-forming KCNQ1 with its obligatory KCNE3 β-subunit. Studies using mice show sizeable cAMP-activated intestinal anion secretion in the absence of either KCNQ1 or KCNE3 suggesting that an alternative K+ conductance must compensate for the loss of KCNQ1-KCNE3 activity. We used double mutant mouse and pharmacological approaches to identify such a conductance. Ca2+ -dependent anion secretion can also be supported by Ca2+ -dependent KCa 3.1 channels after independent CFTR activation, but cAMP-dependent anion secretion is not further decreased in the combined absence of KCa 3.1 and KCNQ1-KCNE3 K+ channel activity. We show that the K2P K+ channel TASK-2 is expressed in the epithelium of the small and large intestine. Tetrapentylammonium, a TASK-2 inhibitor, abolishes anion secretory current remaining in the absence of KCNQ1-KCNE3 activity. A double mutant mouse lacking both KCNQ1-KCNE3 and TASK-2 showed a much reduced cAMP-mediated anion secretion compared to that observed in the single KCNQ1-KCNE3 deficient mouse. We conclude that KCNQ1-KCNE3 and TASK-2 play major roles in the intestinal anion and fluid secretory phenotype. The persistence of an, admittedly reduced, secretory activity in the absence of these two conductances suggests that further additional K+ channel(s) as yet unidentified contribute to the robustness of the intestinal anion secretory process.
Collapse
Affiliation(s)
| | - Sandra Villanueva
- Centro de Estudios Científicos (CECs), Avenida Arturo Prat 514, Valdivia, Chile.,Universidad Austral de Chile, Valdivia, Chile
| | - Johanna Burgos
- Centro de Estudios Científicos (CECs), Avenida Arturo Prat 514, Valdivia, Chile.,Universidad Austral de Chile, Valdivia, Chile
| | - Margarita Ojeda
- Centro de Estudios Científicos (CECs), Avenida Arturo Prat 514, Valdivia, Chile
| | - L Pablo Cid
- Centro de Estudios Científicos (CECs), Avenida Arturo Prat 514, Valdivia, Chile
| | - Thomas J Jentsch
- Leibniz-Institut für Molekulare Pharmakologie (FMP) and Max-Delbrück-Centrum für Molekulare Medizin (MDC), Berlin, Germany
| | | |
Collapse
|
50
|
Chanderbali AS, Berger BA, Howarth DG, Soltis DE, Soltis PS. Evolution of floral diversity: genomics, genes and gamma. Philos Trans R Soc Lond B Biol Sci 2017; 372:rstb.2015.0509. [PMID: 27994132 DOI: 10.1098/rstb.2015.0509] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/17/2016] [Indexed: 11/12/2022] Open
Abstract
A salient feature of flowering plant diversification is the emergence of a novel suite of floral features coinciding with the origin of the most species-rich lineage, Pentapetalae. Advances in phylogenetics, developmental genetics and genomics, including new analyses presented here, are helping to reconstruct the specific evolutionary steps involved in the evolution of this clade. The enormous floral diversity among Pentapetalae appears to be built on a highly conserved ground plan of five-parted (pentamerous) flowers with whorled phyllotaxis. By contrast, lability in the number and arrangement of component parts of the flower characterize the early-diverging eudicot lineages subtending Pentapetalae. The diversification of Pentapetalae also coincides closely with ancient hexaploidy, referred to as the gamma whole-genome triplication, for which the phylogenetic timing, mechanistic details and molecular evolutionary consequences are as yet not fully resolved. Transcription factors regulating floral development often persist in duplicate or triplicate in gamma-derived genomes, and both individual genes and whole transcriptional programmes exhibit a shift from broadly overlapping to tightly defined expression domains in Pentapetalae flowers. Investigations of these changes associated with the origin of Pentapetalae can lead to a more comprehensive understanding of what is arguably one of the most important evolutionary diversification events within terrestrial plants.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'.
Collapse
Affiliation(s)
- Andre S Chanderbali
- Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA.,Department of Biology, University of Florida, Gainesville, FL 32611, USA
| | - Brent A Berger
- Department of Biological Sciences, St John's University, Queens, NY 11439, USA
| | - Dianella G Howarth
- Department of Biological Sciences, St John's University, Queens, NY 11439, USA
| | - Douglas E Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA.,Department of Biology, University of Florida, Gainesville, FL 32611, USA.,Genetics Institute, University of Florida, Gainesville, FL 32610, USA
| | - Pamela S Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA .,Genetics Institute, University of Florida, Gainesville, FL 32610, USA
| |
Collapse
|