1
|
Reimão-Pinto MM, Castillo-Hair SM, Seelig G, Schier AF. The regulatory landscape of 5' UTRs in translational control during zebrafish embryogenesis. Dev Cell 2025; 60:1498-1515.e8. [PMID: 39818206 DOI: 10.1016/j.devcel.2024.12.038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 07/22/2024] [Accepted: 12/19/2024] [Indexed: 01/18/2025]
Abstract
The 5' UTRs of mRNAs are critical for translation regulation during development, but their in vivo regulatory features are poorly characterized. Here, we report the regulatory landscape of 5' UTRs during early zebrafish embryogenesis using a massively parallel reporter assay of 18,154 sequences coupled to polysome profiling. We found that the 5' UTR suffices to confer temporal dynamics to translation initiation and identified 86 motifs enriched in 5' UTRs with distinct ribosome recruitment capabilities. A quantitative deep learning model, Danio Optimus 5-Prime (DaniO5P), identified a combined role for 5' UTR length, translation initiation site context, upstream AUGs, and sequence motifs on ribosome recruitment. DaniO5P predicts the activities of maternal and zygotic 5' UTR isoforms and indicates that modulating 5' UTR length and motif grammar contributes to translation initiation dynamics. This study provides a first quantitative model of 5' UTR-based translation regulation in development and lays the foundation for identifying the underlying molecular effectors.
Collapse
Affiliation(s)
| | - Sebastian M Castillo-Hair
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA 98195, USA; eScience Institute, University of Washington, Seattle, WA 98195, USA
| | - Georg Seelig
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA 98195, USA; Paul G. Allen School of Computer Science & Engineering, University of Washington, Seattle, WA 98195, USA
| | - Alexander F Schier
- Biozentrum, University of Basel, 4056 Basel, Switzerland; Allen Discovery Center for Cell Lineage Tracing, Seattle, WA 98195, USA.
| |
Collapse
|
2
|
Lobel JH, Ingolia NT. Deciphering disordered regions controlling mRNA decay in high-throughput. Nature 2025:10.1038/s41586-025-08919-x. [PMID: 40269159 DOI: 10.1038/s41586-025-08919-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Accepted: 03/19/2025] [Indexed: 04/25/2025]
Abstract
Intrinsically disordered regions within proteins drive specific molecular functions despite lacking a defined structure1,2. Although disordered regions are integral to controlling mRNA stability and translation, the mechanisms underlying these regulatory effects remain unclear3. Here we reveal the molecular determinants of this activity using high-throughput functional profiling. Systematic mutagenesis across hundreds of regulatory disordered elements, combined with machine learning, reveals a complex pattern of molecular features important for their activity. The presence and arrangement of aromatic residues strongly predicts the ability of seemingly diverse protein sequences to influence mRNA stability and translation. We further show how many of these regulatory elements exert their effects by engaging core mRNA decay machinery. Our results define molecular features and biochemical pathways that explain how disordered regions control mRNA expression and shed light on broader principles within functional, unstructured proteins.
Collapse
Affiliation(s)
- Joseph H Lobel
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Nicholas T Ingolia
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA.
- Center for Computational Biology and California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA, USA.
| |
Collapse
|
3
|
Akirtava C, May G, McManus CJ. Deciphering the landscape of cis-acting sequences in natural yeast transcript leaders. Nucleic Acids Res 2025; 53:gkaf165. [PMID: 40071932 PMCID: PMC11897887 DOI: 10.1093/nar/gkaf165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Revised: 02/16/2025] [Accepted: 02/20/2025] [Indexed: 03/15/2025] Open
Abstract
Protein synthesis is a vital process that is highly regulated at the initiation step of translation. Eukaryotic 5' transcript leaders (TLs) contain a variety of cis-acting features that influence translation and messenger RNA stability. However, the relative influences of these features in natural TLs are poorly characterized. To address this, we used massively parallel reporter assays (MPRAs) to quantify RNA levels, ribosome loading, and protein levels from 11,027 natural yeast TLs in vivo and systematically compared the relative impacts of their sequence features on gene expression. We found that yeast TLs influence gene expression over two orders of magnitude. While a leaky scanning model using Kozak contexts (-4 to +1 around the AUG start) and upstream AUGs (uAUGs) explained half of the variance in expression across TLs, the addition of other features explained ∼80% of gene expression variation. Our analyses detected key cis-acting sequence features, quantified their effects in vivo, and compared their roles to motifs reported from an in vitro study of ribosome recruitment. In addition, our work quantitated the effects of alternative transcription start site usage on gene expression in yeast. Thus, our study provides new quantitative insights into the roles of TL cis-acting sequences in regulating gene expression.
Collapse
Affiliation(s)
- Christina Akirtava
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, United States
- RNA Bioscience Initiative, University of Colorado – Anschutz, Aurora, CO 80045, United States
| | - Gemma E May
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, United States
| | - C Joel McManus
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, United States
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, United States
| |
Collapse
|
4
|
Strayer EC, Krishna S, Lee H, Vejnar C, Neuenkirchen N, Gupta A, Beaudoin JD, Giraldez AJ. NaP-TRAP reveals the regulatory grammar in 5'UTR-mediated translation regulation during zebrafish development. Nat Commun 2024; 15:10898. [PMID: 39738051 PMCID: PMC11685710 DOI: 10.1038/s41467-024-55274-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 12/06/2024] [Indexed: 01/01/2025] Open
Abstract
The cis-regulatory elements encoded in an mRNA determine its stability and translational output. While there has been a considerable effort to understand the factors driving mRNA stability, the regulatory frameworks governing translational control remain more elusive. We have developed a novel massively parallel reporter assay (MPRA) to measure mRNA translation, named Nascent Peptide Translating Ribosome Affinity Purification (NaP-TRAP). NaP-TRAP measures translation in a frame-specific manner through the immunocapture of epitope tagged nascent peptides of reporter mRNAs. We benchmark NaP-TRAP to polysome profiling and use it to quantify Kozak strength and the regulatory landscapes of 5' UTRs in the developing zebrafish embryo and in human cells. Through this approach we identified general and developmentally dynamic cis-regulatory elements, as well as potential trans-acting proteins. We find that U-rich motifs are general enhancers, and upstream ORFs and GC-rich motifs are global repressors of translation. We also observe a translational switch during the maternal-to-zygotic transition, where C-rich motifs shift from repressors to prominent activators of translation. Conversely, we show that microRNA sites in the 5' UTR repress translation following the zygotic expression of miR-430. Together these results demonstrate that NaP-TRAP is a versatile, accessible, and powerful method to decode the regulatory functions of UTRs across different systems.
Collapse
Affiliation(s)
- Ethan C Strayer
- Department of Genetics, Yale University, Yale School of Medicine, New Haven, 06510, CT, USA
| | - Srikar Krishna
- Department of Genetics, Yale University, Yale School of Medicine, New Haven, 06510, CT, USA
| | - Haejeong Lee
- Department of Genetics, Yale University, Yale School of Medicine, New Haven, 06510, CT, USA
| | - Charles Vejnar
- Department of Genetics, Yale University, Yale School of Medicine, New Haven, 06510, CT, USA
| | - Nils Neuenkirchen
- Department of Cell Biology, Yale University, Yale School of Medicine, New Haven, 06510, CT, USA
| | - Amit Gupta
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, CT, USA
| | - Jean-Denis Beaudoin
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, CT, USA.
- Yale Center for RNA Science and Medicine, Yale University, New Haven, 06510, CT, USA.
| | - Antonio J Giraldez
- Department of Genetics, Yale University, Yale School of Medicine, New Haven, 06510, CT, USA.
- Yale Center for RNA Science and Medicine, Yale University, New Haven, 06510, CT, USA.
- Yale Stem Cell Center, Yale University, Yale School of Medicine, New Haven, 06510, CT, USA.
| |
Collapse
|
5
|
Tominaga M, Shima Y, Nozaki K, Ito Y, Someda M, Shoya Y, Hashii N, Obata C, Matsumoto-Kitano M, Suematsu K, Matsukawa T, Hosoya K, Hashiba N, Kondo A, Ishii J. Designing strong inducible synthetic promoters in yeasts. Nat Commun 2024; 15:10653. [PMID: 39702268 PMCID: PMC11659477 DOI: 10.1038/s41467-024-54865-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Accepted: 11/21/2024] [Indexed: 12/21/2024] Open
Abstract
Inducible promoters are essential for precise control of target gene expression in synthetic biological systems. However, engineering eukaryotic promoters is often more challenging than engineering prokaryotic promoters due to their greater mechanistic complexity. In this study, we describe a simple and reliable approach for constructing strongly inducible synthetic promoters with minimum leakiness in yeasts. The results indicate that the leakiness of yeast-inducible synthetic promoters is primarily the result of cryptic transcriptional activation of heterologous sequences that may be avoided by appropriate insulation and operator mutagenesis. Our promoter design approach has successfully generated robust, inducible promoters that achieve a > 103-fold induction in reporter gene expression. The utility of these promoters is demonstrated by using them to produce various biologics with titers up to 2 g/L, including antigens designed to raise specific antibodies against a SARS-CoV-2 omicron variant through chicken immunization.
Collapse
Affiliation(s)
- Masahiro Tominaga
- Engineering Biology Research Center, Kobe University, Kobe, Japan
- Graduate School of Science, Technology and Innovation, Kobe University, Kobe, Japan
| | - Yoko Shima
- Engineering Biology Research Center, Kobe University, Kobe, Japan
| | - Kenta Nozaki
- Graduate School of Science, Technology and Innovation, Kobe University, Kobe, Japan
| | - Yoichiro Ito
- Engineering Biology Research Center, Kobe University, Kobe, Japan
- Graduate School of Science, Technology and Innovation, Kobe University, Kobe, Japan
| | | | - Yuji Shoya
- Pharma Foods International Co. Ltd., Kyoto, Japan
| | - Noritaka Hashii
- Division of Biological Chemistry and Biologicals, National Institute of Health Sciences, Kawasaki, Kanagawa, Japan
| | - Chihiro Obata
- Division of Biological Chemistry and Biologicals, National Institute of Health Sciences, Kawasaki, Kanagawa, Japan
| | | | - Kohei Suematsu
- Engineering Biology Research Center, Kobe University, Kobe, Japan
| | | | - Keita Hosoya
- Engineering Biology Research Center, Kobe University, Kobe, Japan
| | - Noriko Hashiba
- Graduate School of Science, Technology and Innovation, Kobe University, Kobe, Japan
| | - Akihiko Kondo
- Engineering Biology Research Center, Kobe University, Kobe, Japan
- Graduate School of Science, Technology and Innovation, Kobe University, Kobe, Japan
- Department of Chemical Science and Engineering, Faculty of Engineering, Kobe University, Kobe, Japan
- Center for Sustainable Resource Science, RIKEN, Yokohama, Japan
| | - Jun Ishii
- Engineering Biology Research Center, Kobe University, Kobe, Japan.
- Graduate School of Science, Technology and Innovation, Kobe University, Kobe, Japan.
| |
Collapse
|
6
|
Wäneskog M, Rasmussen TB, Jensen ED. A strategy for successful dual-species protein expression of genes with non-optimal codon usage destined for bacterial and yeast cell factories. Biotechnol Prog 2024; 40:e3482. [PMID: 38757558 PMCID: PMC11659799 DOI: 10.1002/btpr.3482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 04/24/2024] [Accepted: 05/08/2024] [Indexed: 05/18/2024]
Abstract
Recombinant protein expression on an industrial scale traditionally utilizes one of two microbial workhorses: Escherichia coli or Saccharomyces cerevisiae. Additionally, random protein engineering of enzymes and proteins aimed for expression in S. cerevisiae are often mutagenized and pre-screened in E. coli before expression in yeast. This introduces artificial bottlenecks as the bacterial expression vector needs to be substituted for a yeast expression vector via sub-cloning, and the new library re-evaluated before a final screening in yeast. Here, we put forward a protein expression and engineering strategy that involves the use of a dual-host shuttle vector (pYB-Dual) designed with both a strong inducible yeast promoter (pGAL1), and a strong inducible bacterial promoter (pT7-RNAP), which allows for inducible protein expression in both species. Additionally, we demonstrate that by transforming the pYB-Dual vector into the E. coli strain Rosetta 2, which has elevated levels of 7 rare tRNAs, we can achieve high-level protein expression in both yeast and bacteria, even when using a mNeonGreen gene codon optimized for yeast. This dual expression vector is expected to remove bottlenecks during protein engineering of commercially important enzymes destined for high-titer expression in yeast.
Collapse
Affiliation(s)
- Marcus Wäneskog
- Novo Nordisk Foundation Center for BiosustainabilityTechnical University of DenmarkKgs. LyngbyDenmark
| | - Trine Bertram Rasmussen
- Novo Nordisk Foundation Center for BiosustainabilityTechnical University of DenmarkKgs. LyngbyDenmark
| | - Emil D. Jensen
- Novo Nordisk Foundation Center for BiosustainabilityTechnical University of DenmarkKgs. LyngbyDenmark
| |
Collapse
|
7
|
La Fleur A, Shi Y, Seelig G. Decoding biology with massively parallel reporter assays and machine learning. Genes Dev 2024; 38:843-865. [PMID: 39362779 PMCID: PMC11535156 DOI: 10.1101/gad.351800.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/05/2024]
Abstract
Massively parallel reporter assays (MPRAs) are powerful tools for quantifying the impacts of sequence variation on gene expression. Reading out molecular phenotypes with sequencing enables interrogating the impact of sequence variation beyond genome scale. Machine learning models integrate and codify information learned from MPRAs and enable generalization by predicting sequences outside the training data set. Models can provide a quantitative understanding of cis-regulatory codes controlling gene expression, enable variant stratification, and guide the design of synthetic regulatory elements for applications from synthetic biology to mRNA and gene therapy. This review focuses on cis-regulatory MPRAs, particularly those that interrogate cotranscriptional and post-transcriptional processes: alternative splicing, cleavage and polyadenylation, translation, and mRNA decay.
Collapse
Affiliation(s)
- Alyssa La Fleur
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA
| | - Yongsheng Shi
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California, Irvine, Irvine, California 92697, USA;
| | - Georg Seelig
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA;
- Department of Electrical & Computer Engineering, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
8
|
Li ZM, Lin Y, Luo CH, Sun QL, Mi CL, Wang XY, Wang TY. Optimization of extended Kozak elements enhances recombinant proteins expression in CHO cells. J Biotechnol 2024; 392:96-102. [PMID: 38960098 DOI: 10.1016/j.jbiotec.2024.06.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 06/08/2024] [Accepted: 06/27/2024] [Indexed: 07/05/2024]
Abstract
In eukaryotes, the localization of small ribosomal subunits to mRNA transcripts requires the translation of Kozak elements at the starting site. The sequence of Kozak elements affects the translation efficiency of protein synthesis. However, whether the upstream nucleotide of Kozak sequence affects the expression of recombinant proteins in Chinese hamster ovary (CHO) cells remains unclear. In order to find the optimal sequence to enhance recombinant proteins expression in CHO cells, -10 to +4 sequences around ATG in 100 CHO genes were compared, and the extended Kozak elements with different translation intensities were constructed. Using the classic Kozak element as control, the effects of optimized extended Kozak elements on the secreted alkaline phosphatase (SEAP) and human serum albumin (HSA) gene were studied. The results showed that the optimized extended Kozak sequence can enhance the stable expression level of recombinant proteins in CHO cells. Furthermore, it was found that the increased expression level of the recombinant protein was not related with higher transcription level. In summary, optimizing extended Kozak elements can enhance the expression of recombinant proteins in CHO cells, which contributes to the construction of an efficient expression system for CHO cells.
Collapse
Affiliation(s)
- Zheng-Mei Li
- International Joint Research Laboratory for Recombinant Pharmaceutical Protein Expression System of Henan, Xinxiang Medical University, Xinxiang 453003, China; College of Science and Technology, Nanchang University, Jiujiang 332020, China
| | - Yan Lin
- International Joint Research Laboratory for Recombinant Pharmaceutical Protein Expression System of Henan, Xinxiang Medical University, Xinxiang 453003, China; School of Nursing, Xinxiang Medical University, Xinxiang 453003, China
| | - Cong-Hui Luo
- International Joint Research Laboratory for Recombinant Pharmaceutical Protein Expression System of Henan, Xinxiang Medical University, Xinxiang 453003, China; School of Life Science and Technology,Xinxiang Medical University, Xinxiang 453003, China
| | - Qiu-Li Sun
- International Joint Research Laboratory for Recombinant Pharmaceutical Protein Expression System of Henan, Xinxiang Medical University, Xinxiang 453003, China; School of Life Science and Technology,Xinxiang Medical University, Xinxiang 453003, China
| | - Chun-Liu Mi
- International Joint Research Laboratory for Recombinant Pharmaceutical Protein Expression System of Henan, Xinxiang Medical University, Xinxiang 453003, China
| | - Xiao-Yin Wang
- International Joint Research Laboratory for Recombinant Pharmaceutical Protein Expression System of Henan, Xinxiang Medical University, Xinxiang 453003, China; Department of Biochemistry and Molecular Biology, School of Basic Medicine, Xinxiang Medical University, Xinxiang 453003, China.
| | - Tian-Yun Wang
- International Joint Research Laboratory for Recombinant Pharmaceutical Protein Expression System of Henan, Xinxiang Medical University, Xinxiang 453003, China; Department of Biochemistry and Molecular Biology, School of Basic Medicine, Xinxiang Medical University, Xinxiang 453003, China.
| |
Collapse
|
9
|
Wang Q, Kim T, Martínez-Bonet M, Aguiar VRC, Sim S, Cui J, Sparks JA, Chen X, Todd M, Wauford B, Marion MC, Langefeld CD, Weirauch MT, Gutierrez-Arcelus M, Nigrovic PA. High-throughput identification of functional regulatory SNPs in systemic lupus erythematosus. Nat Commun 2024; 15:6804. [PMID: 39122710 PMCID: PMC11315931 DOI: 10.1038/s41467-024-50710-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 07/17/2024] [Indexed: 08/12/2024] Open
Abstract
Genome-wide association studies implicate multiple loci in risk for systemic lupus erythematosus (SLE), but few contain exonic variants, rendering systematic identification of non-coding variants essential to decoding SLE genetics. We utilized SNP-seq and bioinformatic enrichment to interrogate 2180 single-nucleotide polymorphisms (SNPs) from 87 SLE risk loci for potential binding of transcription factors and related proteins from B cells. 52 SNPs that passed initial screening were tested by electrophoretic mobility shift and luciferase reporter assays. To validate the approach, we studied rs2297550 in detail, finding that the risk allele enhanced binding to the transcription factor Ikaros (encoded by IKZF1), thereby modulating expression of IKBKE. Correspondingly, primary cells from genotyped healthy donors bearing the risk allele expressed higher levels of the interferon / NF-κB regulator IKKε. Together, these findings define a set of likely functional non-coding lupus risk variants and identify a regulatory pathway involving rs2297550, Ikaros, and IKKε implicated by human genetics in risk for SLE.
Collapse
Affiliation(s)
- Qiang Wang
- Division of Immunology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | - Taehyeung Kim
- Division of Immunology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | - Marta Martínez-Bonet
- Division of Rheumatology, Inflammation, and Immunity, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Laboratory of Immune-regulation, Instituto de Investigación Sanitaria Gregorio Marañón, Madrid, Spain
| | - Vitor R C Aguiar
- Division of Immunology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Sangwan Sim
- Division of Immunology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | - Jing Cui
- Division of Rheumatology, Inflammation, and Immunity, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Jeffrey A Sparks
- Division of Rheumatology, Inflammation, and Immunity, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Xiaoting Chen
- Center of Autoimmune Genomics and Etiology, Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Marc Todd
- Division of Immunology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | - Brian Wauford
- Division of Immunology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | - Miranda C Marion
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC, USA
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Carl D Langefeld
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC, USA
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Matthew T Weirauch
- Center of Autoimmune Genomics and Etiology, Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
- Divisions of Human Genetics, Biomedical Informatics, and Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, USA
| | - Maria Gutierrez-Arcelus
- Division of Immunology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Peter A Nigrovic
- Division of Immunology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
- Division of Rheumatology, Inflammation, and Immunity, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
10
|
Akirtava C, May G, McManus CJ. Deciphering the cis-regulatory landscape of natural yeast Transcript Leaders. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.03.601937. [PMID: 39005336 PMCID: PMC11245039 DOI: 10.1101/2024.07.03.601937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Protein synthesis is a vital process that is highly regulated at the initiation step of translation. Eukaryotic 5' transcript leaders (TLs) contain a variety of cis-regulatory features that influence translation and mRNA stability. However, the relative influences of these features in natural TLs are poorly characterized. To address this, we used massively parallel reporter assays (MPRAs) to quantify RNA levels, ribosome loading, and protein levels from 11,027 natural yeast TLs in vivo and systematically compared the relative impacts of their sequence features on gene expression. We found that yeast TLs influence gene expression over two orders of magnitude. While a leaky scanning model using Kozak contexts and uAUGs explained half of the variance in expression across transcript leaders, the addition of other features explained ~70% of gene expression variation. Our analyses detected key cis-acting sequence features, quantified their effects in vivo, and compared their roles to motifs reported from an in vitro study of ribosome recruitment. In addition, our work quantitated the effects of alternative transcription start site usage on gene expression in yeast. Thus, our study provides new quantitative insights into the roles of TL cis-acting sequences in regulating gene expression.
Collapse
Affiliation(s)
- Christina Akirtava
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
- RNA Bioscience Initiative, University of Colorado - Anshutz, Aurora, CO, 80045, USA
| | - Gemma May
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - C Joel McManus
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| |
Collapse
|
11
|
Kim KQ, Nanjaraj Urs AN, Lasehinde V, Greenlaw AC, Hudson BH, Zaher HS. eIF4F complex dynamics are important for the activation of the integrated stress response. Mol Cell 2024; 84:2135-2151.e7. [PMID: 38848692 PMCID: PMC11189614 DOI: 10.1016/j.molcel.2024.04.016] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 12/08/2023] [Accepted: 04/19/2024] [Indexed: 06/09/2024]
Abstract
In response to stress, eukaryotes activate the integrated stress response (ISR) via phosphorylation of eIF2α to promote the translation of pro-survival effector genes, such as GCN4 in yeast. Complementing the ISR is the target of rapamycin (TOR) pathway, which regulates eIF4E function. Here, we probe translational control in the absence of eIF4E in Saccharomyces cerevisiae. Intriguingly, we find that loss of eIF4E leads to de-repression of GCN4 translation. In addition, we find that de-repression of GCN4 translation is accompanied by neither eIF2α phosphorylation nor reduction in initiator ternary complex (TC). Our data suggest that when eIF4E levels are depleted, GCN4 translation is de-repressed via a unique mechanism that may involve faster scanning by the small ribosome subunit due to increased local concentration of eIF4A. Overall, our findings suggest that relative levels of eIF4F components are key to ribosome dynamics and may play important roles in translational control of gene expression.
Collapse
Affiliation(s)
- Kyusik Q Kim
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | | - Victor Lasehinde
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Alison C Greenlaw
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Benjamin H Hudson
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Hani S Zaher
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA.
| |
Collapse
|
12
|
Nigrovic PA, Wang Q, Kim T, Martinez-Bonet M, Aguiar VRC, Sim S, Cui J, Sparks JA, Chen X, Todd M, Wauford B, Marion MC, Langefeld CD, Weirauch MT, Gutierrez-Arcelus M. High-throughput identification of functional regulatory SNPs in systemic lupus erythematosus. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.08.16.553538. [PMID: 37645953 PMCID: PMC10462027 DOI: 10.1101/2023.08.16.553538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Genome-wide association studies implicate multiple loci in risk for systemic lupus erythematosus (SLE), but few contain exonic variants, rendering systematic identification of non-coding variants essential to decoding SLE genetics. We utilized SNP-seq and bioinformatic enrichment to interrogate 2180 single-nucleotide polymorphisms (SNPs) from 87 SLE risk loci for potential binding of transcription factors and related proteins from B cells. 52 SNPs that passed initial screening were tested by electrophoretic mobility shift and luciferase reporter assays. To validate the approach, we studied rs2297550 in detail, finding that the risk allele enhanced binding to the transcription factor Ikaros (IKZF1), thereby modulating expression of IKBKE. Correspondingly, primary cells from genotyped healthy donors bearing the risk allele expressed higher levels of the interferon / NF-κB regulator IKKϵ. Together, these findings define a set of likely functional non-coding lupus risk variants and identify a new regulatory pathway involving rs2297550, Ikaros, and IKKϵ implicated by human genetics in risk for SLE.
Collapse
|
13
|
Wang H, Chen M, Zhang D, Meng X, Yan J, Chu J, Li J, Yu H. Shaping rice Green Revolution traits by engineering ATG immediate upstream 5'-UTR sequences of OsSBI and OsHTD1. PLANT BIOTECHNOLOGY JOURNAL 2024; 22:532-534. [PMID: 37996983 PMCID: PMC10893934 DOI: 10.1111/pbi.14235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 09/29/2023] [Accepted: 11/04/2023] [Indexed: 11/25/2023]
Affiliation(s)
- Hongwen Wang
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental BiologyChinese Academy of SciencesBeijingChina
| | - Mingjiang Chen
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental BiologyChinese Academy of SciencesBeijingChina
| | - Dahan Zhang
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental BiologyChinese Academy of SciencesBeijingChina
- University of Chinese Academy of SciencesBeijingChina
| | - Xiangbing Meng
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental BiologyChinese Academy of SciencesBeijingChina
| | - Jijun Yan
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental BiologyChinese Academy of SciencesBeijingChina
| | - Jinfang Chu
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental BiologyChinese Academy of SciencesBeijingChina
- University of Chinese Academy of SciencesBeijingChina
| | - Jiayang Li
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental BiologyChinese Academy of SciencesBeijingChina
- University of Chinese Academy of SciencesBeijingChina
- Yazhouwan National LaboratorySanyaChina
| | - Hong Yu
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental BiologyChinese Academy of SciencesBeijingChina
- University of Chinese Academy of SciencesBeijingChina
| |
Collapse
|
14
|
Hernández G, García A, Weingarten-Gabbay S, Mishra R, Hussain T, Amiri M, Moreno-Hagelsieb G, Montiel-Dávalos A, Lasko P, Sonenberg N. Functional analysis of the AUG initiator codon context reveals novel conserved sequences that disfavor mRNA translation in eukaryotes. Nucleic Acids Res 2024; 52:1064-1079. [PMID: 38038264 PMCID: PMC10853783 DOI: 10.1093/nar/gkad1152] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 11/09/2023] [Accepted: 11/15/2023] [Indexed: 12/02/2023] Open
Abstract
mRNA translation is a fundamental process for life. Selection of the translation initiation site (TIS) is crucial, as it establishes the correct open reading frame for mRNA decoding. Studies in vertebrate mRNAs discovered that a purine at -3 and a G at +4 (where A of the AUG initiator codon is numbered + 1), promote TIS recognition. However, the TIS context in other eukaryotes has been poorly experimentally analyzed. We analyzed in vitro the influence of the -3, -2, -1 and + 4 positions of the TIS context in rabbit, Drosophila, wheat, and yeast. We observed that -3A conferred the best translational efficiency across these species. However, we found variability at the + 4 position for optimal translation. In addition, the Kozak motif that was defined from mammalian cells was only weakly predictive for wheat and essentially non-predictive for yeast. We discovered eight conserved sequences that significantly disfavored translation. Due to the big differences in translational efficiency observed among weak TIS context sequences, we define a novel category that we termed 'barren AUG context sequences (BACS)', which represent sequences disfavoring translation. Analysis of mRNA-ribosomal complexes structures provided insights into the function of BACS. The gene ontology of the BACS-containing mRNAs is presented.
Collapse
Affiliation(s)
- Greco Hernández
- mRNA and Cancer Laboratory, Unit of Biomedical Research on Cancer, National Institute of Cancer (INCan), Mexico City 14080, Mexico
| | - Alejandra García
- mRNA and Cancer Laboratory, Unit of Biomedical Research on Cancer, National Institute of Cancer (INCan), Mexico City 14080, Mexico
| | - Shira Weingarten-Gabbay
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Laboratory of Virology and Infectious Disease, The Rockefeller University, New York, NY, USA
| | - Rishi Kumar Mishra
- Department of Developmental Biology and Genetics, Indian Institute of Science, Bengaluru-560012, India
| | - Tanweer Hussain
- Department of Developmental Biology and Genetics, Indian Institute of Science, Bengaluru-560012, India
| | - Mehdi Amiri
- Department of Biochemistry and Goodman Cancer Institute. McGill University., Montreal, QC H3A 1A3, Canada
| | - Gabriel Moreno-Hagelsieb
- Department of Biology, Wilfrid Laurier University. 75 University Ave. W, Waterloo, ON N2L 3C5, Canada
| | - Angélica Montiel-Dávalos
- mRNA and Cancer Laboratory, Unit of Biomedical Research on Cancer, National Institute of Cancer (INCan), Mexico City 14080, Mexico
| | - Paul Lasko
- Department of Biology, McGill University. Montreal, QC H3G 0B1, Canada
| | - Nahum Sonenberg
- Department of Biochemistry and Goodman Cancer Institute. McGill University., Montreal, QC H3A 1A3, Canada
| |
Collapse
|
15
|
Truong NH, T. Vo N, Nguyen BT, Huynh ST, Nguyen HD. Analyzing the correlation between protein expression and sequence-related features of mRNA and protein in Escherichia coli K-12 MG1655 model. PLoS One 2024; 19:e0288526. [PMID: 38324614 PMCID: PMC10849221 DOI: 10.1371/journal.pone.0288526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 06/27/2023] [Indexed: 02/09/2024] Open
Abstract
It was necessary to have a tool that could predict the amount of protein and optimize the gene sequences to produce recombinant proteins efficiently. The Transim model published by Tuller et al. in 2018 can calculate the translation rate in E. coli using features on the mRNA sequence, achieving a Spearman correlation with the amount of protein per mRNA of 0.36 when tested on the dataset of operons' first genes in E. coli K-12 MG1655 genome. However, this Spearman correlation was not high, and the model did not fully consider the features of mRNA and protein sequences. Therefore, to enhance the prediction capability, our study firstly tried expanding the testing dataset, adding genes inside the operon, and using the microarray of the mRNA expression data set, thereby helping to improve the correlation of translation rate with the amount of protein with more than 0.42. Next, the applicability of 6 traditional machine learning models to calculate a "new translation rate" was examined using initiation rate and elongation rate as inputs. The result showed that the SVR algorithm had the most correlated new translation rates, with Spearman correlation improving to R = 0.6699 with protein level output and to R = 0.6536 with protein level per mRNA. Finally, the study investigated the degree of improvement when combining more features with the new translation rates. The results showed that the model's predictive ability to produce a protein per mRNA reached R = 0.6660 when using six features, while the correlation of this model's final translation rate to protein level was up to R = 0.6729. This demonstrated the model's capability to predict protein expression of a gene, rather than being limited to predicting expression by an mRNA and showed the model's potential for development into gene expression predicting tools.
Collapse
Affiliation(s)
- Nhat H.M. Truong
- Center for Bioscience and Biotechnology, University of Science, Ho Chi Minh City, Vietnam
- Vietnam National University, Ho Chi Minh City, Vietnam
| | - Nam T. Vo
- Center for Bioscience and Biotechnology, University of Science, Ho Chi Minh City, Vietnam
- Vietnam National University, Ho Chi Minh City, Vietnam
- Laboratory of Molecular Biotechnology, University of Science, Ho Chi Minh City, Vietnam
| | - Binh T. Nguyen
- Vietnam National University, Ho Chi Minh City, Vietnam
- Department of Computer Science, University of Science, Ho Chi Minh City, Vietnam
| | - Son T. Huynh
- Vietnam National University, Ho Chi Minh City, Vietnam
- Department of Computer Science, University of Science, Ho Chi Minh City, Vietnam
| | - Hoang D. Nguyen
- Center for Bioscience and Biotechnology, University of Science, Ho Chi Minh City, Vietnam
- Vietnam National University, Ho Chi Minh City, Vietnam
| |
Collapse
|
16
|
Cautereels C, Smets J, Bircham P, De Ruysscher D, Zimmermann A, De Rijk P, Steensels J, Gorkovskiy A, Masschelein J, Verstrepen KJ. Combinatorial optimization of gene expression through recombinase-mediated promoter and terminator shuffling in yeast. Nat Commun 2024; 15:1112. [PMID: 38326309 PMCID: PMC10850122 DOI: 10.1038/s41467-024-44997-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 01/12/2024] [Indexed: 02/09/2024] Open
Abstract
Microbes are increasingly employed as cell factories to produce biomolecules. This often involves the expression of complex heterologous biosynthesis pathways in host strains. Achieving maximal product yields and avoiding build-up of (toxic) intermediates requires balanced expression of every pathway gene. However, despite progress in metabolic modeling, the optimization of gene expression still heavily relies on trial-and-error. Here, we report an approach for in vivo, multiplexed Gene Expression Modification by LoxPsym-Cre Recombination (GEMbLeR). GEMbLeR exploits orthogonal LoxPsym sites to independently shuffle promoter and terminator modules at distinct genomic loci. This approach facilitates creation of large strain libraries, in which expression of every pathway gene ranges over 120-fold and each strain harbors a unique expression profile. When applied to the biosynthetic pathway of astaxanthin, an industrially relevant antioxidant, a single round of GEMbLeR improved pathway flux and doubled production titers. Together, this shows that GEMbLeR allows rapid and efficient gene expression optimization in heterologous biosynthetic pathways, offering possibilities for enhancing the performance of microbial cell factories.
Collapse
Affiliation(s)
- Charlotte Cautereels
- VIB Laboratory for Systems Biology, VIB-KU Leuven Center for Microbiology, Leuven, 3001, Belgium
- Laboratory of Genetics and Genomics, Center of Microbial and Plant Genetics, Department M2S, KU Leuven, Gaston Geenslaan 1, Leuven, 3001, Belgium
| | - Jolien Smets
- VIB Laboratory for Systems Biology, VIB-KU Leuven Center for Microbiology, Leuven, 3001, Belgium
- Laboratory of Genetics and Genomics, Center of Microbial and Plant Genetics, Department M2S, KU Leuven, Gaston Geenslaan 1, Leuven, 3001, Belgium
| | - Peter Bircham
- VIB Laboratory for Systems Biology, VIB-KU Leuven Center for Microbiology, Leuven, 3001, Belgium
- Laboratory of Genetics and Genomics, Center of Microbial and Plant Genetics, Department M2S, KU Leuven, Gaston Geenslaan 1, Leuven, 3001, Belgium
| | - Dries De Ruysscher
- Molecular Biotechnology of Plants and Micro-organisms, Department of Biology, KU Leuven, Kasteelpark Arenberg 31, box 2438, Leuven, 3001, Belgium
- Laboratory for Biomolecular Discovery & Engineering, VIB-KU Leuven Center for Microbiology, Leuven, 3001, Belgium
| | - Anna Zimmermann
- VIB Laboratory for Systems Biology, VIB-KU Leuven Center for Microbiology, Leuven, 3001, Belgium
- Laboratory of Genetics and Genomics, Center of Microbial and Plant Genetics, Department M2S, KU Leuven, Gaston Geenslaan 1, Leuven, 3001, Belgium
| | - Peter De Rijk
- Neuromics Support Facility, VIB Center for Molecular Neurology, VIB, Antwerp, 2610, Belgium
- Neuromics Support Facility, Department of Biomedical Sciences, University of Antwerp, Antwerp, 2610, Belgium
| | - Jan Steensels
- VIB Laboratory for Systems Biology, VIB-KU Leuven Center for Microbiology, Leuven, 3001, Belgium
- Laboratory of Genetics and Genomics, Center of Microbial and Plant Genetics, Department M2S, KU Leuven, Gaston Geenslaan 1, Leuven, 3001, Belgium
| | - Anton Gorkovskiy
- VIB Laboratory for Systems Biology, VIB-KU Leuven Center for Microbiology, Leuven, 3001, Belgium
- Laboratory of Genetics and Genomics, Center of Microbial and Plant Genetics, Department M2S, KU Leuven, Gaston Geenslaan 1, Leuven, 3001, Belgium
| | - Joleen Masschelein
- Molecular Biotechnology of Plants and Micro-organisms, Department of Biology, KU Leuven, Kasteelpark Arenberg 31, box 2438, Leuven, 3001, Belgium
- Laboratory for Biomolecular Discovery & Engineering, VIB-KU Leuven Center for Microbiology, Leuven, 3001, Belgium
| | - Kevin J Verstrepen
- VIB Laboratory for Systems Biology, VIB-KU Leuven Center for Microbiology, Leuven, 3001, Belgium.
- Laboratory of Genetics and Genomics, Center of Microbial and Plant Genetics, Department M2S, KU Leuven, Gaston Geenslaan 1, Leuven, 3001, Belgium.
| |
Collapse
|
17
|
Lobel JH, Ingolia NT. Defining the mechanisms and properties of post-transcriptional regulatory disordered regions by high-throughput functional profiling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.01.578453. [PMID: 38370681 PMCID: PMC10871298 DOI: 10.1101/2024.02.01.578453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Disordered regions within RNA binding proteins are required to control mRNA decay and protein synthesis. To understand how these disordered regions modulate gene expression, we surveyed regulatory activity across the entire disordered proteome using a high-throughput functional assay. We identified hundreds of regulatory sequences within intrinsically disordered regions and demonstrate how these elements cooperate with core mRNA decay machinery to promote transcript turnover. Coupling high-throughput functional profiling with mutational scanning revealed diverse molecular features, ranging from defined motifs to overall sequence composition, underlying the regulatory effects of disordered peptides. Machine learning analysis implicated aromatic residues in particular contexts as critical determinants of repressor activity, consistent with their roles in forming protein-protein interactions with downstream effectors. Our results define the molecular principles and biochemical mechanisms that govern post-transcriptional gene regulation by disordered regions and exemplify the encoding of diverse yet specific functions in the absence of well-defined structure.
Collapse
Affiliation(s)
- Joseph H Lobel
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Nicholas T Ingolia
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
- Lead contact
| |
Collapse
|
18
|
Zeng J, Song K, Wang J, Wen H, Zhou J, Ni T, Lu H, Yu Y. Characterization and optimization of 5´ untranslated region containing poly-adenine tracts in Kluyveromyces marxianus using machine-learning model. Microb Cell Fact 2024; 23:7. [PMID: 38172836 PMCID: PMC10763412 DOI: 10.1186/s12934-023-02271-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 12/12/2023] [Indexed: 01/05/2024] Open
Abstract
BACKGROUND The 5´ untranslated region (5´ UTR) plays a key role in regulating translation efficiency and mRNA stability, making it a favored target in genetic engineering and synthetic biology. A common feature found in the 5´ UTR is the poly-adenine (poly(A)) tract. However, the effect of 5´ UTR poly(A) on protein production remains controversial. Machine-learning models are powerful tools for explaining the complex contributions of features, but models incorporating features of 5´ UTR poly(A) are currently lacking. Thus, our goal is to construct such a model, using natural 5´ UTRs from Kluyveromyces marxianus, a promising cell factory for producing heterologous proteins. RESULTS We constructed a mini-library consisting of 207 5´ UTRs harboring poly(A) and 34 5´ UTRs without poly(A) from K. marxianus. The effects of each 5´ UTR on the production of a GFP reporter were evaluated individually in vivo, and the resulting protein abundance spanned an approximately 450-fold range throughout. The data were used to train a multi-layer perceptron neural network (MLP-NN) model that incorporated the length and position of poly(A) as features. The model exhibited good performance in predicting protein abundance (average R2 = 0.7290). The model suggests that the length of poly(A) is negatively correlated with protein production, whereas poly(A) located between 10 and 30 nt upstream of the start codon (AUG) exhibits a weak positive effect on protein abundance. Using the model as guidance, the deletion or reduction of poly(A) upstream of 30 nt preceding AUG tended to improve the production of GFP and a feruloyl esterase. Deletions of poly(A) showed inconsistent effects on mRNA levels, suggesting that poly(A) represses protein production either with or without reducing mRNA levels. CONCLUSION The effects of poly(A) on protein production depend on its length and position. Integrating poly(A) features into machine-learning models improves simulation accuracy. Deleting or reducing poly(A) upstream of 30 nt preceding AUG tends to enhance protein production. This optimization strategy can be applied to enhance the yield of K. marxianus and other microbial cell factories.
Collapse
Affiliation(s)
- Junyuan Zeng
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
- Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, 200438, China
| | - Kunfeng Song
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
- Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, 200438, China
| | - Jingqi Wang
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
- Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, 200438, China
| | - Haimei Wen
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
- Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, 200438, China
| | - Jungang Zhou
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
- Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, 200438, China
| | - Ting Ni
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
- Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, 200438, China
| | - Hong Lu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
- Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, 200438, China
| | - Yao Yu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China.
- Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, 200438, China.
| |
Collapse
|
19
|
Zheng W, Fong JHC, Wan YK, Chu AHY, Huang Y, Wong ASL, Ho JWK. Discovery of regulatory motifs in 5' untranslated regions using interpretable multi-task learning models. Cell Syst 2023; 14:1103-1112.e6. [PMID: 38016465 DOI: 10.1016/j.cels.2023.10.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 09/18/2023] [Accepted: 10/31/2023] [Indexed: 11/30/2023]
Abstract
The sequence in the 5' untranslated regions (UTRs) is known to affect mRNA translation rates. However, the underlying regulatory grammar remains elusive. Here, we propose MTtrans, a multi-task translation rate predictor capable of learning common sequence patterns from datasets across various experimental techniques. The core premise is that common motifs are more likely to be genuinely involved in translation control. MTtrans outperforms existing methods in both accuracy and the ability to capture transferable motifs across species, highlighting its strength in identifying evolutionarily conserved sequence motifs. Our independent fluorescence-activated cell sorting coupled with deep sequencing (FACS-seq) experiment validates the impact of most motifs identified by MTtrans. Additionally, we introduce "GRU-rewiring," a technique to interpret the hidden states of the recurrent units. Gated recurrent unit (GRU)-rewiring allows us to identify regulatory element-enriched positions and examine the local effects of 5' UTR mutations. MTtrans is a powerful tool for deciphering the translation regulatory motifs.
Collapse
Affiliation(s)
- Weizhong Zheng
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - John H C Fong
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Yuk Kei Wan
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Athena H Y Chu
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China; Centre for Oncology and Immunology, Hong Kong Science Park, Hong Kong SAR, China
| | - Yuanhua Huang
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China; Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China; Center for Translational Stem Cell Biology, Hong Kong Science and Technology Park, Hong Kong SAR, China
| | - Alan S L Wong
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China; Centre for Oncology and Immunology, Hong Kong Science Park, Hong Kong SAR, China; Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong SAR, China
| | - Joshua W K Ho
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China; Laboratory of Data Discovery for Health (D24H) Limited, Hong Kong Science Park, Hong Kong SAR, China.
| |
Collapse
|
20
|
Irshad IU, Sharma AK. Decoding stoichiometric protein synthesis in E. coli through translation rate parameters. BIOPHYSICAL REPORTS 2023; 3:100131. [PMID: 37789867 PMCID: PMC10542608 DOI: 10.1016/j.bpr.2023.100131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 09/11/2023] [Indexed: 10/05/2023]
Abstract
E. coli is one of the most widely used organisms for understanding the principles of cellular and molecular genetics. However, we are yet to understand the origin of several experimental observations related to the regulation of gene expression in E. coli. One of the prominent examples in this context is the proportional synthesis in multiprotein complexes where all of their obligate subunits are produced in proportion to their stoichiometry. In this work, by combining the next-generation sequencing data with the stochastic simulations of protein synthesis, we explain the origin of proportional protein synthesis in multicomponent complexes. We find that the estimated initiation rates for the translation of all subunits in those complexes are proportional to their stoichiometry. This constraint on protein synthesis kinetics enforces proportional protein synthesis without requiring any feedback mechanism. We also find that the translation initiation rates in E. coli are influenced by the coding sequence length and the enrichment of A and C nucleotides near the start codon. Thus, this study rationalizes the role of conserved and nonrandom features of genes in regulating the translation kinetics and unravels a key principle of the regulation of protein synthesis.
Collapse
Affiliation(s)
| | - Ajeet K. Sharma
- Department of Physics, Indian Institute of Technology Jammu, Jammu, India
- Department of Biosciences and Bioengineering, Indian Institute of Technology Jammu, Jammu, India
| |
Collapse
|
21
|
Reimão-Pinto MM, Castillo-Hair SM, Seelig G, Schier AF. The regulatory landscape of 5' UTRs in translational control during zebrafish embryogenesis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.23.568470. [PMID: 38045294 PMCID: PMC10690280 DOI: 10.1101/2023.11.23.568470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The 5' UTRs of mRNAs are critical for translation regulation, but their in vivo regulatory features are poorly characterized. Here, we report the regulatory landscape of 5' UTRs during early zebrafish embryogenesis using a massively parallel reporter assay of 18,154 sequences coupled to polysome profiling. We found that the 5' UTR is sufficient to confer temporal dynamics to translation initiation, and identified 86 motifs enriched in 5' UTRs with distinct ribosome recruitment capabilities. A quantitative deep learning model, DaniO5P, revealed a combined role for 5' UTR length, translation initiation site context, upstream AUGs and sequence motifs on in vivo ribosome recruitment. DaniO5P predicts the activities of 5' UTR isoforms and indicates that modulating 5' UTR length and motif grammar contributes to translation initiation dynamics. This study provides a first quantitative model of 5' UTR-based translation regulation in early vertebrate development and lays the foundation for identifying the underlying molecular effectors.
Collapse
Affiliation(s)
| | - Sebastian M Castillo-Hair
- Department of Electrical & Computer Engineering, University of Washington, Seattle, Washington 98195, United States
| | - Georg Seelig
- Department of Electrical & Computer Engineering, University of Washington, Seattle, Washington 98195, United States
- Paul G. Allen School of Computer Science & Engineering, University of Washington, Seattle, Washington 98195, United States
| | - Alex F Schier
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Allen Discovery Center for Cell Lineage Tracing, Seattle, Washington 98195, United States
| |
Collapse
|
22
|
Zhou H, Li W, Bai L, Wang J, Luo Y, Li S, Hickford JGH. Ovine KRTAP36-2: A New Keratin-Associated Protein Gene Related to Variation in Wool Yield. Genes (Basel) 2023; 14:2045. [PMID: 38002988 PMCID: PMC10671549 DOI: 10.3390/genes14112045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 10/19/2023] [Accepted: 11/03/2023] [Indexed: 11/26/2023] Open
Abstract
Keratin-associated proteins (KAPs) are structural components of wool fibres. High-glycine/tyrosine (HGT)-KAPs are a subset of the KAP family, and their abundance in fibres varies. In this study, we report the discovery of an ovine HGT-KAP gene to which we assigned the name KRTAP36-2. Polymerase chain reaction and single-strand conformation polymorphism (PCR-SSCP) analyses revealed four variants of this gene in a screening population of 170 sheep from a variety of breeds. The DNA sequencing of the variants revealed four single-nucleotide polymorphisms (SNPs) and a dinucleotide deletion. Three of these SNPs were in the coding region, and one of these was non-synonymous and potentially led to the amino acid substitution p.Cys27Gly near the middle of the protein. The remaining SNP was located near the putative TATA box, and the di-nucleotide deletion was near the putative transcription initiation site. The effect of this variation in KRTAP36-2 was investigated in 274 Southdown × Merino lambs that were the progeny of five sires. Variation was only found to be associated with wool yield, that is, the proportion of the greasy fleece that remained as clean fleece upon scouring (expressed as a percentage). This may have some value in increasing wool production.
Collapse
Affiliation(s)
- Huitong Zhou
- International Wool Research Institute, Gansu Agricultural University, Lanzhou 730070, China; (H.Z.); (J.W.); (Y.L.)
- Gene-Marker Laboratory, Faculty of Agricultural and Life Sciences, Lincoln University, Lincoln 7647, New Zealand;
| | - Wenhao Li
- Plateau Livestock Genetic Resources Protection and Innovative Utilization Key Laboratory of Qinghai Province, Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Qinghai Academy of Animal Science and Veterinary Medicine, Qinghai University, Xining 810016, China;
| | - Lingrong Bai
- Gene-Marker Laboratory, Faculty of Agricultural and Life Sciences, Lincoln University, Lincoln 7647, New Zealand;
| | - Jiqing Wang
- International Wool Research Institute, Gansu Agricultural University, Lanzhou 730070, China; (H.Z.); (J.W.); (Y.L.)
- Gansu Key Laboratory of Herbivorous Animal Biotechnology, College of Animal Science and Technology, Gansu Agricultural University, Lanzhou 730070, China
| | - Yuzhu Luo
- International Wool Research Institute, Gansu Agricultural University, Lanzhou 730070, China; (H.Z.); (J.W.); (Y.L.)
- Gansu Key Laboratory of Herbivorous Animal Biotechnology, College of Animal Science and Technology, Gansu Agricultural University, Lanzhou 730070, China
| | - Shaobin Li
- International Wool Research Institute, Gansu Agricultural University, Lanzhou 730070, China; (H.Z.); (J.W.); (Y.L.)
- Gansu Key Laboratory of Herbivorous Animal Biotechnology, College of Animal Science and Technology, Gansu Agricultural University, Lanzhou 730070, China
| | - Jonathan G. H. Hickford
- International Wool Research Institute, Gansu Agricultural University, Lanzhou 730070, China; (H.Z.); (J.W.); (Y.L.)
- Gene-Marker Laboratory, Faculty of Agricultural and Life Sciences, Lincoln University, Lincoln 7647, New Zealand;
| |
Collapse
|
23
|
Drehmann P, Milanos S, Schaefer N, Kasaragod VB, Herterich S, Holzbach-Eberle U, Harvey RJ, Villmann C. Dual Role of Dysfunctional Asc-1 Transporter in Distinct Human Pathologies, Human Startle Disease, and Developmental Delay. eNeuro 2023; 10:ENEURO.0263-23.2023. [PMID: 37903619 PMCID: PMC10668224 DOI: 10.1523/eneuro.0263-23.2023] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 09/11/2023] [Indexed: 11/01/2023] Open
Abstract
Human startle disease is associated with mutations in distinct genes encoding glycine receptors, transporters or interacting proteins at glycinergic synapses in spinal cord and brainstem. However, a significant number of diagnosed patients does not carry a mutation in the common genes GLRA1, GLRB, and SLC6A5 Recently, studies on solute carrier 7 subfamily 10 (SLC7A10; Asc-1, alanine-serine-cysteine transporter) knock-out (KO) mice displaying a startle disease-like phenotype hypothesized that this transporter might represent a novel candidate for human startle disease. Here, we screened 51 patients from our patient cohort negative for the common genes and found three exonic (one missense, two synonymous), seven intronic, and single nucleotide changes in the 5' and 3' untranslated regions (UTRs) in Asc-1. The identified missense mutation Asc-1G307R from a patient with startle disease and developmental delay was investigated in functional studies. At the molecular level, the mutation Asc-1G307R did not interfere with cell-surface expression, but disrupted glycine uptake. Substitution of glycine at position 307 to other amino acids, e.g., to alanine or tryptophan did not affect trafficking or glycine transport. By contrast, G307K disrupted glycine transport similar to the G307R mutation found in the patient. Structurally, the disrupted function in variants carrying positively charged residues can be explained by local structural rearrangements because of the large positively charged side chain. Thus, our data suggest that SLC7A10 may represent a rare but novel gene associated with human startle disease and developmental delay.
Collapse
Affiliation(s)
- Paul Drehmann
- Institute for Clinical Neurobiology, Julius Maximilians University of Würzburg, 97078 Würzburg, Germany
| | - Sinem Milanos
- Institute for Clinical Neurobiology, Julius Maximilians University of Würzburg, 97078 Würzburg, Germany
| | - Natascha Schaefer
- Institute for Clinical Neurobiology, Julius Maximilians University of Würzburg, 97078 Würzburg, Germany
| | - Vikram Babu Kasaragod
- Neurobiology Division, Medical Reserach Council Laboratory of Molecular Biology, Cambridge CB2 0QH, United Kingdom
| | - Sarah Herterich
- Institute for Clinical Neurobiology, Julius Maximilians University of Würzburg, 97078 Würzburg, Germany
| | - Ulrike Holzbach-Eberle
- Center for Pediatrics and Adolescent Medicine, Pediatric Neurology, Social Pediatrics and Epileptology, University Hospital Gießen, 35392 Giessen, Germany
| | - Robert J Harvey
- School of Health, University of the Sunshine Coast, Sippy Downs, QLD 4558, Australia
- Sunshine Coast Health Institute, Birtinya, QLD 4575, Australia
| | - Carmen Villmann
- Institute for Clinical Neurobiology, Julius Maximilians University of Würzburg, 97078 Würzburg, Germany
| |
Collapse
|
24
|
Zhu YJ, Liao ML, Dong YW. Exploring the adaptability of the secondary structure of mRNA to temperature in intertidal snails based on SHAPE experiments. J Exp Biol 2023; 226:jeb246544. [PMID: 37767692 DOI: 10.1242/jeb.246544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 09/22/2023] [Indexed: 09/29/2023]
Abstract
RNA-based thermal regulation is an important strategy for organisms to cope with temperature changes. Inhabiting the intertidal rocky shore, a key interface of the ocean, atmosphere and terrestrial environments, intertidal species have developed variable thermal adaptation mechanisms; however, adaptions at the RNA level remain largely uninvestigated. To examine the relationship between mRNA structural stability and species distribution, in the present study, the secondary structure of cytosolic malate dehydrogenase (cMDH) mRNA of Echinolittorina malaccana, Echinolittorina radiata and Littorina brevicula was determined using selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE), and the change in folding free energy of formation (ΔGfold) was calculated. The results showed that ΔGfold increased as the temperature increased. The difference in ΔGfold (ΔΔGfold) between two specific temperatures (25 versus 0°C, 37 versus 0°C and 57 versus 0°C) differed among the three species, and the ΔΔGfold value of E. malaccana was significantly lower than those of E. radiata and L. brevicula. The number of stems of cMDH mRNA of the snails decreased with increasing temperature, and the breakpoint temperature of E. malaccana was the highest among these. The number of loops was also reduced with increasing temperature, while the length of the loop structure increased accordingly. Consequently, these structural changes can potentially affect the translational efficiency of mRNA. These results imply that there were interspecific differences in the thermal stability of RNA secondary structures in intertidal snails, and these differences may be related to snail distribution.
Collapse
Affiliation(s)
- Ya-Jie Zhu
- The Key Laboratory of Mariculture, Ministry of Education, Fisheries College, Ocean University of China, Qingdao 266003, PR China
| | - Ming-Ling Liao
- The Key Laboratory of Mariculture, Ministry of Education, Fisheries College, Ocean University of China, Qingdao 266003, PR China
| | - Yun-Wei Dong
- The Key Laboratory of Mariculture, Ministry of Education, Fisheries College, Ocean University of China, Qingdao 266003, PR China
| |
Collapse
|
25
|
Kershaw CJ, Nelson MG, Castelli LM, Jennings MD, Lui J, Talavera D, Grant CM, Pavitt GD, Hubbard SJ, Ashe MP. Translation factor and RNA binding protein mRNA interactomes support broader RNA regulons for posttranscriptional control. J Biol Chem 2023; 299:105195. [PMID: 37633333 PMCID: PMC10562868 DOI: 10.1016/j.jbc.2023.105195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 08/18/2023] [Accepted: 08/20/2023] [Indexed: 08/28/2023] Open
Abstract
The regulation of translation provides a rapid and direct mechanism to modulate the cellular proteome. In eukaryotes, an established model for the recruitment of ribosomes to mRNA depends upon a set of conserved translation initiation factors. Nevertheless, how cells orchestrate and define the selection of individual mRNAs for translation, as opposed to other potential cytosolic fates, is poorly understood. We have previously found significant variation in the interaction between individual mRNAs and an array of translation initiation factors. Indeed, mRNAs can be separated into different classes based upon these interactions to provide a framework for understanding different modes of translation initiation. Here, we extend this approach to include new mRNA interaction profiles for additional proteins involved in shaping the cytoplasmic fate of mRNAs. This work defines a set of seven mRNA clusters, based on their interaction profiles with 12 factors involved in translation and/or RNA binding. The mRNA clusters share both physical and functional characteristics to provide a rationale for the interaction profiles. Moreover, a comparison with mRNA interaction profiles from a host of RNA binding proteins suggests that there are defined patterns in the interactions of functionally related mRNAs. Therefore, this work defines global cytoplasmic mRNA binding modules that likely coordinate the synthesis of functionally related proteins.
Collapse
Affiliation(s)
- Christopher J Kershaw
- Division of Molecular and Cellular Function, School of Biological Sciences, The University of Manchester, Manchester, UK
| | - Michael G Nelson
- Division of Molecular and Cellular Function, School of Biological Sciences, The University of Manchester, Manchester, UK
| | - Lydia M Castelli
- Division of Molecular and Cellular Function, School of Biological Sciences, The University of Manchester, Manchester, UK
| | - Martin D Jennings
- Division of Molecular and Cellular Function, School of Biological Sciences, The University of Manchester, Manchester, UK
| | - Jennifer Lui
- Division of Molecular and Cellular Function, School of Biological Sciences, The University of Manchester, Manchester, UK
| | - David Talavera
- Division of Cardiovascular Sciences, School of Medical Sciences, The University of Manchester, Manchester, UK
| | - Chris M Grant
- Division of Molecular and Cellular Function, School of Biological Sciences, The University of Manchester, Manchester, UK
| | - Graham D Pavitt
- Division of Molecular and Cellular Function, School of Biological Sciences, The University of Manchester, Manchester, UK.
| | - Simon J Hubbard
- Division of Molecular and Cellular Function, School of Biological Sciences, The University of Manchester, Manchester, UK.
| | - Mark P Ashe
- Division of Molecular and Cellular Function, School of Biological Sciences, The University of Manchester, Manchester, UK.
| |
Collapse
|
26
|
Kleinschmidt H, Xu C, Bai L. Using Synthetic DNA Libraries to Investigate Chromatin and Gene Regulation. Chromosoma 2023; 132:167-189. [PMID: 37184694 PMCID: PMC10542970 DOI: 10.1007/s00412-023-00796-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 04/25/2023] [Accepted: 04/26/2023] [Indexed: 05/16/2023]
Abstract
Despite the recent explosion in genome-wide studies in chromatin and gene regulation, we are still far from extracting a set of genetic rules that can predict the function of the regulatory genome. One major reason for this deficiency is that gene regulation is a multi-layered process that involves an enormous variable space, which cannot be fully explored using native genomes. This problem can be partially solved by introducing synthetic DNA libraries into cells, a method that can test the regulatory roles of thousands to millions of sequences with limited variables. Here, we review recent applications of this method to study transcription factor (TF) binding, nucleosome positioning, and transcriptional activity. We discuss the design principles, experimental procedures, and major findings from these studies and compare the pros and cons of different approaches.
Collapse
Affiliation(s)
- Holly Kleinschmidt
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, 16802, USA
- Center for Eukaryotic Gene Regulation, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Cheng Xu
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, 16802, USA
- Center for Eukaryotic Gene Regulation, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Lu Bai
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, 16802, USA.
- Center for Eukaryotic Gene Regulation, The Pennsylvania State University, University Park, PA, 16802, USA.
- Department of Physics, The Pennsylvania State University, University Park, PA, 16802, USA.
| |
Collapse
|
27
|
Das S, Singh A, Shah P. Evaluating single-cell variability in proteasomal decay. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.22.554358. [PMID: 37662347 PMCID: PMC10473619 DOI: 10.1101/2023.08.22.554358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
Gene expression is a stochastic process that leads to variability in mRNA and protein abundances even within an isogenic population of cells grown in the same environment. This variation, often called gene-expression noise, has typically been attributed to transcriptional and translational processes while ignoring the contributions of protein decay variability across cells. Here we estimate the single-cell protein decay rates of two degron GFPs in Saccharomyces cerevisiae using time-lapse microscopy. We find substantial cell-to-cell variability in the decay rates of the degron GFPs. We evaluate cellular features that explain the variability in the proteasomal decay and find that the amount of 20s catalytic beta subunit of the proteasome marginally explains the observed variability in the degron GFP half-lives. We propose alternate hypotheses that might explain the observed variability in the decay of the two degron GFPs. Overall, our study highlights the importance of studying the kinetics of the decay process at single-cell resolution and that decay rates vary at the single-cell level, and that the decay process is stochastic. A complex model of decay dynamics must be included when modeling stochastic gene expression to estimate gene expression noise.
Collapse
Affiliation(s)
| | - Abhyudai Singh
- Department of Electrical and Computer Engineering, Biomedical Engineering, University of Delaware
| | | |
Collapse
|
28
|
May GE, Akirtava C, Agar-Johnson M, Micic J, Woolford J, McManus J. Unraveling the influences of sequence and position on yeast uORF activity using massively parallel reporter systems and machine learning. eLife 2023; 12:e69611. [PMID: 37227054 PMCID: PMC10259493 DOI: 10.7554/elife.69611] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 05/24/2023] [Indexed: 05/26/2023] Open
Abstract
Upstream open-reading frames (uORFs) are potent cis-acting regulators of mRNA translation and nonsense-mediated decay (NMD). While both AUG- and non-AUG initiated uORFs are ubiquitous in ribosome profiling studies, few uORFs have been experimentally tested. Consequently, the relative influences of sequence, structural, and positional features on uORF activity have not been determined. We quantified thousands of yeast uORFs using massively parallel reporter assays in wildtype and ∆upf1 yeast. While nearly all AUG uORFs were robust repressors, most non-AUG uORFs had relatively weak impacts on expression. Machine learning regression modeling revealed that both uORF sequences and locations within transcript leaders predict their effect on gene expression. Indeed, alternative transcription start sites highly influenced uORF activity. These results define the scope of natural uORF activity, identify features associated with translational repression and NMD, and suggest that the locations of uORFs in transcript leaders are nearly as predictive as uORF sequences.
Collapse
Affiliation(s)
- Gemma E May
- Department of Biological Sciences, Carnegie Mellon UniversityPittsburghUnited States
| | - Christina Akirtava
- Department of Biological Sciences, Carnegie Mellon UniversityPittsburghUnited States
| | - Matthew Agar-Johnson
- Department of Biological Sciences, Carnegie Mellon UniversityPittsburghUnited States
| | - Jelena Micic
- Department of Biological Sciences, Carnegie Mellon UniversityPittsburghUnited States
| | - John Woolford
- Department of Biological Sciences, Carnegie Mellon UniversityPittsburghUnited States
| | - Joel McManus
- Department of Biological Sciences, Carnegie Mellon UniversityPittsburghUnited States
- Computational Biology Department, Carnegie Mellon UniversityPittsburghUnited States
| |
Collapse
|
29
|
Li K, Kong J, Zhang S, Zhao T, Qian W. Distance-dependent inhibition of translation initiation by downstream out-of-frame AUGs is consistent with a Brownian ratchet process of ribosome scanning. Genome Biol 2022; 23:254. [PMID: 36510274 PMCID: PMC9743702 DOI: 10.1186/s13059-022-02829-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Accepted: 12/01/2022] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Eukaryotic ribosomes are widely presumed to scan mRNA for the AUG codon to initiate translation in a strictly 5'-3' movement (i.e., strictly unidirectional scanning model), so that ribosomes initiate translation exclusively at the 5' proximal AUG codon (i.e., the first-AUG rule). RESULTS We generate 13,437 yeast variants, each with an ATG triplet placed downstream (dATGs) of the annotated ATG (aATG) codon of a green fluorescent protein. We find that out-of-frame dATGs can inhibit translation at the aATG, but with diminishing strength over increasing distance between aATG and dATG, undetectable beyond ~17 nt. This phenomenon is best explained by a Brownian ratchet mechanism of ribosome scanning, in which the ribosome uses small-amplitude 5'-3' and 3'-5' oscillations with a net 5'-3' movement to scan the AUG codon, thereby leading to competition for translation initiation between aAUG and a proximal dAUG. This scanning model further predicts that the inhibitory effect induced by an out-of-frame upstream AUG triplet (uAUG) will diminish as uAUG approaches aAUG, which is indeed observed among the 15,586 uATG variants generated in this study. Computational simulations suggest that each triplet is scanned back and forth approximately ten times until the ribosome eventually migrates to downstream regions. Moreover, this scanning process could constrain the evolution of sequences downstream of the aATG to minimize proximal out-of-frame dATG triplets in yeast and humans. CONCLUSIONS Collectively, our findings uncover the basic process by which eukaryotic ribosomes scan for initiation codons, and how this process could shape eukaryotic genome evolution.
Collapse
Affiliation(s)
- Ke Li
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, 100101, China
| | - Jinhui Kong
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Shuo Zhang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Tong Zhao
- Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Wenfeng Qian
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, 100101, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
30
|
Yang TH, Hsu CW, Wang YX, Yu CH, Rathod J, Tseng YY, Wu WS. YMLA: A comparative platform to carry out functional enrichment analysis for multiple gene lists in yeast. Comput Biol Med 2022; 151:106314. [PMID: 36455295 DOI: 10.1016/j.compbiomed.2022.106314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 10/23/2022] [Accepted: 11/13/2022] [Indexed: 11/16/2022]
Abstract
Comparative analysis among multiple gene lists on their functional features is now a routine task due to the advancement of high-throughput experiments. Several enrichment analysis tools were developed in the past. However, these tools mainly focus on one gene list and contain only gene ontology or interaction features. What makes it worse, comparative investigation and customized feature set reanalysis are still unavailable. Therefore, we constructed the YMLA (Yeast Multiple List Analyzer) platform in this research. YMLA includes 39 yeast features and facilitates comparative analysis among multiple gene lists via tabular views, heatmaps, and network plots. Moreover, the customized feature set reanalysis function was implemented in YMLA to help form mechanism hypotheses based on a selected enriched feature subset. We demonstrated the biological applicability of YMLA via example lists consisting of genes with top/bottom translation efficiency values. The analysis results provided by YMLA reveal novel facts consistent with previous experiments. YMLA is available at https://cosbi7.ee.ncku.edu.tw/YMLA/.
Collapse
Affiliation(s)
- Tzu-Hsien Yang
- Department of Biomedical Engineering, National Cheng Kung University, University Road, 701 Tainan, Taiwan.
| | - Chia-Wei Hsu
- Department of Electrical Engineering, National Cheng Kung University, University Road, 701 Tainan, Taiwan.
| | - Yan-Xiang Wang
- Department of Electrical Engineering, National Cheng Kung University, University Road, 701 Tainan, Taiwan.
| | - Chien-Hung Yu
- Department of Biochemistry and Molecular Biology, National Cheng Kung University, University Road, 701 Tainan, Taiwan.
| | - Jagat Rathod
- Department of Environmental Biotechnology, Gujarat Biotechnology University, Gujarat International Finance Tec (GIFT)-City, Gandhinagar 382355, Gujarat, India.
| | - Yan-Yuan Tseng
- Center for Molecular Medicine and Genetics, Wayne State University, School of Medicine, Detroit, MI 48201, USA.
| | - Wei-Sheng Wu
- Department of Electrical Engineering, National Cheng Kung University, University Road, 701 Tainan, Taiwan.
| |
Collapse
|
31
|
van den Elzen AMG, Watson MJ, Thoreen CC. mRNA 5' terminal sequences drive 200-fold differences in expression through effects on synthesis, translation and decay. PLoS Genet 2022; 18:e1010532. [PMID: 36441824 PMCID: PMC9731452 DOI: 10.1371/journal.pgen.1010532] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 12/08/2022] [Accepted: 11/15/2022] [Indexed: 11/30/2022] Open
Abstract
mRNA regulatory sequences control gene expression at multiple levels including translation initiation and mRNA decay. The 5' terminal sequences of mRNAs have unique regulatory potential because of their proximity to key post-transcriptional regulators. Here we have systematically probed the function of 5' terminal sequences in gene expression in human cells. Using a library of reporter mRNAs initiating with all possible 7-mer sequences at their 5' ends, we find an unexpected impact on transcription that underlies 200-fold differences in mRNA expression. Library sequences that promote high levels of transcription mirrored those found in native mRNAs and define two basic classes with similarities to classic Initiator (Inr) and TCT core promoter motifs. By comparing transcription, translation and decay rates, we identify sequences that are optimized for both efficient transcription and growth-regulated translation and stability, including variants of terminal oligopyrimidine (TOP) motifs. We further show that 5' sequences of endogenous mRNAs are enriched for multi-functional TCT/TOP hybrid sequences. Together, our results reveal how 5' sequences define two general classes of mRNAs with distinct growth-responsive profiles of expression across synthesis, translation and decay.
Collapse
Affiliation(s)
- Antonia M. G. van den Elzen
- Department of Cellular and Molecular Physiology, Yale School of Medicine, New Haven, Connecticut, United States of America
| | - Maegan J. Watson
- Department of Cellular and Molecular Physiology, Yale School of Medicine, New Haven, Connecticut, United States of America
| | - Carson C. Thoreen
- Department of Cellular and Molecular Physiology, Yale School of Medicine, New Haven, Connecticut, United States of America
- * E-mail:
| |
Collapse
|
32
|
5′ Untranslated mRNA Regions Allow Bypass of Host Cell Translation Inhibition by Legionella pneumophila. Infect Immun 2022; 90:e0017922. [DOI: 10.1128/iai.00179-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Legionella pneumophila
grows within membrane-bound vacuoles in alveolar macrophages during human disease. Pathogen manipulation of the host cell is driven by bacterial proteins translocated through a type IV secretion system (T4SS).
Collapse
|
33
|
Jiang S, Wang Y, Liu Q, Zhao Q, Gao L, Song X, Li X, Qu Y, Liu G. Genetic engineering and raising temperature enhance recombinant protein production with the cdna1 promoter in Trichoderma reesei. BIORESOUR BIOPROCESS 2022; 9:113. [PMID: 38647824 PMCID: PMC10991654 DOI: 10.1186/s40643-022-00607-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 10/20/2022] [Indexed: 11/10/2022] Open
Abstract
The fungus Trichoderma reesei is a powerful host for secreted production of proteins. The promoter of cdna1 gene, which encodes a small basic protein of unknown function and high expression, is commonly used for constitutive protein production in T. reesei. Nevertheless, the production level of proteins driven by this promoter still needs to be improved. Here, we identified that the region 600- to 700-bp upstream of the start codon is critical for the efficiency of the cdna1 promoter. Increasing the copy number of this region to three improved the production of a heterologous β-mannanase by 37.5%. Screening of several stressful conditions revealed that the cdna1 promoter is heat inducible. Cultivation at 37 °C significantly enhanced the production of β-mannanase as well as a polygalacturonase with the cdna1 promoter compared with those at 30 °C. Combing the strategies of promoter engineering, multi-copy gene insertion, and control of cultivation temperature, β-mannanase of 199.85 U/mL and relatively high purity was produced in shake flask, which was 6.6 times higher than that before optimization. Taken together, the results advance the understanding of the widely used cdna1 promoter and provide effective strategies for enhancing the production of recombinant proteins in T. reesei.
Collapse
Affiliation(s)
- Shanshan Jiang
- State Key Laboratory of Microbial Technology, Shandong University, 72 Binhai Road, Qingdao, 266237, China
| | - Yue Wang
- State Key Laboratory of Microbial Technology, Shandong University, 72 Binhai Road, Qingdao, 266237, China
| | - Qin Liu
- State Key Laboratory of Microbial Technology, Shandong University, 72 Binhai Road, Qingdao, 266237, China
| | - Qinqin Zhao
- State Key Laboratory of Microbial Technology, Shandong University, 72 Binhai Road, Qingdao, 266237, China
| | - Liwei Gao
- Tobacco Research Institute of Chinese Academy of Agricultural Sciences, 11 Keyuanjingsi Road, Qingdao, 266101, China.
| | - Xin Song
- State Key Laboratory of Microbial Technology, Shandong University, 72 Binhai Road, Qingdao, 266237, China
| | - Xuezhi Li
- State Key Laboratory of Microbial Technology, Shandong University, 72 Binhai Road, Qingdao, 266237, China
| | - Yinbo Qu
- State Key Laboratory of Microbial Technology, Shandong University, 72 Binhai Road, Qingdao, 266237, China
| | - Guodong Liu
- State Key Laboratory of Microbial Technology, Shandong University, 72 Binhai Road, Qingdao, 266237, China.
| |
Collapse
|
34
|
Nicolet BP, Wolkers MC. The relationship of mRNA with protein expression in CD8+ T cells associates with gene class and gene characteristics. PLoS One 2022; 17:e0276294. [PMID: 36260607 PMCID: PMC9581405 DOI: 10.1371/journal.pone.0276294] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 10/04/2022] [Indexed: 11/19/2022] Open
Abstract
T cells are key players in our defence against infections and malignancies. When T cells differentiate or become activated, they undergo substantial alterations in gene expression. Even though RNA expression levels are now well documented throughout different stages of T cells, it is not well understood how mRNA expression translates into the protein landscape. By combining paired RNA sequencing and mass spectrometry data of primary human CD8+ T cells, we report that mRNA expression is a poor proxy for the overall protein output, irrespective of the differentiation or activation status. Yet, gene class stratification revealed a function-specific correlation of mRNA with protein expression. This gene class-specific expression pattern associated with differences in gene characteristics such as sequence conservation and untranslated region (UTR) lengths. In addition, the presence of AU-rich elements in the 3'UTR associated with alterations in mRNA and protein abundance T cell activation dependent, gene class-specific manner. In conclusion, our study highlights the role of gene characteristics as a determinant for gene expression in T cells.
Collapse
Affiliation(s)
- Benoît P. Nicolet
- Department of Hematopoiesis, Sanquin Research, Amsterdam, The Netherlands
- Landsteiner Laboratory, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
- Oncode Institute, Utrecht, The Netherlands
| | - Monika C. Wolkers
- Department of Hematopoiesis, Sanquin Research, Amsterdam, The Netherlands
- Landsteiner Laboratory, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
- Oncode Institute, Utrecht, The Netherlands
- * E-mail:
| |
Collapse
|
35
|
Bottorff TA, Park H, Geballe AP, Subramaniam AR. Translational buffering by ribosome stalling in upstream open reading frames. PLoS Genet 2022; 18:e1010460. [PMID: 36315596 PMCID: PMC9648851 DOI: 10.1371/journal.pgen.1010460] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 11/10/2022] [Accepted: 10/04/2022] [Indexed: 11/05/2022] Open
Abstract
Upstream open reading frames (uORFs) are present in over half of all human mRNAs. uORFs can potently regulate the translation of downstream open reading frames through several mechanisms: siphoning away scanning ribosomes, regulating re-initiation, and allowing interactions between scanning and elongating ribosomes. However, the consequences of these different mechanisms for the regulation of protein expression remain incompletely understood. Here, we performed systematic measurements on the uORF-containing 5' UTR of the cytomegaloviral UL4 mRNA to test alternative models of uORF-mediated regulation in human cells. We find that a terminal diproline-dependent elongating ribosome stall in the UL4 uORF prevents decreases in main ORF protein expression when ribosome loading onto the mRNA is reduced. This uORF-mediated buffering is insensitive to the location of the ribosome stall along the uORF. Computational kinetic modeling based on our measurements suggests that scanning ribosomes dissociate rather than queue when they collide with stalled elongating ribosomes within the UL4 uORF. We identify several human uORFs that repress main ORF protein expression via a similar terminal diproline motif. We propose that ribosome stalls in uORFs provide a general mechanism for buffering against reductions in main ORF translation during stress and developmental transitions.
Collapse
Affiliation(s)
- Ty A. Bottorff
- Basic Sciences Division and Computational Biology Program of the Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
- Biological Physics, Structure and Design Graduate Program, University of Washington, Seattle, Washington, United States of America
| | - Heungwon Park
- Basic Sciences Division and Computational Biology Program of the Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Adam P. Geballe
- Human Biology and Clinical Research Divisions, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Arvind Rasi Subramaniam
- Basic Sciences Division and Computational Biology Program of the Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
- Biological Physics, Structure and Design Graduate Program, University of Washington, Seattle, Washington, United States of America
| |
Collapse
|
36
|
Pfister B, Shields JM, Kockmann T, Grossmann J, Abt MR, Stadler M, Zeeman SC. Tuning heterologous glucan biosynthesis in yeast to understand and exploit plant starch diversity. BMC Biol 2022; 20:207. [PMID: 36153520 PMCID: PMC9509603 DOI: 10.1186/s12915-022-01408-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 09/13/2022] [Indexed: 11/30/2022] Open
Abstract
Background Starch, a vital plant-derived polysaccharide comprised of branched glucans, is essential in nutrition and many industrial applications. Starch is often modified post-extraction to alter its structure and enhance its functionality. Targeted metabolic engineering of crops to produce valuable and versatile starches requires knowledge of the relationships between starch biosynthesis, structure, and properties, but systematic studies to obtain this knowledge are difficult to conduct in plants. Here we used Saccharomyces cerevisiae as a testbed to dissect the functions of plant starch biosynthetic enzymes and create diverse starch-like polymers. Results We explored yeast promoters and terminators to tune the expression levels of the starch-biosynthesis machinery from Arabidopsis thaliana. We systematically modulated the expression of each starch synthase (SS) together with a branching enzyme (BE) in yeast. Protein quantification by parallel reaction monitoring (targeted proteomics) revealed unexpected effects of glucan biosynthesis on protein abundances but showed that the anticipated broad range of SS/BE enzyme ratios was maintained during the biosynthetic process. The different SS/BE ratios clearly influenced glucan structure and solubility: The higher the SS/BE ratio, the longer the glucan chains and the more glucans were partitioned into the insoluble fraction. This effect was irrespective of the SS isoform, demonstrating that the elongation/branching ratio controls glucan properties separate from enzyme specificity. Conclusions Our results provide a quantitative framework for the in silico design of improved starch biosynthetic processes in plants. Our study also exemplifies a workflow for the rational tuning of a complex pathway in yeast, starting from the selection and evaluation of expression modules to multi-gene assembly and targeted protein monitoring during the biosynthetic process. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-022-01408-x.
Collapse
|
37
|
Controlling gene expression with deep generative design of regulatory DNA. Nat Commun 2022; 13:5099. [PMID: 36042233 PMCID: PMC9427793 DOI: 10.1038/s41467-022-32818-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 08/18/2022] [Indexed: 11/25/2022] Open
Abstract
Design of de novo synthetic regulatory DNA is a promising avenue to control gene expression in biotechnology and medicine. Using mutagenesis typically requires screening sizable random DNA libraries, which limits the designs to span merely a short section of the promoter and restricts their control of gene expression. Here, we prototype a deep learning strategy based on generative adversarial networks (GAN) by learning directly from genomic and transcriptomic data. Our ExpressionGAN can traverse the entire regulatory sequence-expression landscape in a gene-specific manner, generating regulatory DNA with prespecified target mRNA levels spanning the whole gene regulatory structure including coding and adjacent non-coding regions. Despite high sequence divergence from natural DNA, in vivo measurements show that 57% of the highly-expressed synthetic sequences surpass the expression levels of highly-expressed natural controls. This demonstrates the applicability and relevance of deep generative design to expand our knowledge and control of gene expression regulation in any desired organism, condition or tissue. Design of de novo synthetic regulatory DNA is a promising avenue to control gene expression in biotechnology and medicine. Here the authors present EspressionGAN, a generative adversarial network that uses genomic and transcriptomic data to generate regulatory sequences.
Collapse
|
38
|
Ray S, Dandpat SS, Chatterjee S, Walter NG. Precise tuning of bacterial translation initiation by non-equilibrium 5'-UTR unfolding observed in single mRNAs. Nucleic Acids Res 2022; 50:8818-8833. [PMID: 35892287 PMCID: PMC9410914 DOI: 10.1093/nar/gkac635] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 06/15/2022] [Accepted: 07/14/2022] [Indexed: 11/21/2022] Open
Abstract
Noncoding, structured 5′-untranslated regions (5′-UTRs) of bacterial messenger RNAs (mRNAs) can control translation efficiency by forming structures that either recruit or repel the ribosome. Here we exploit a 5′-UTR embedded preQ1-sensing, pseudoknotted translational riboswitch to probe how binding of a small ligand controls recruitment of the bacterial ribosome to the partially overlapping Shine-Dalgarno (SD) sequence. Combining single-molecule fluorescence microscopy with mutational analyses, we find that the stability of 30S ribosomal subunit binding is inversely correlated with the free energy needed to unfold the 5′-UTR during mRNA accommodation into the mRNA binding cleft. Ligand binding to the riboswitch stabilizes the structure to both antagonize 30S recruitment and accelerate 30S dissociation. Proximity of the 5′-UTR and stability of the SD:anti-SD interaction both play important roles in modulating the initial 30S-mRNA interaction. Finally, depletion of small ribosomal subunit protein S1, known to help resolve structured 5′-UTRs, further increases the energetic penalty for mRNA accommodation. The resulting model of rapid standby site exploration followed by gated non-equilibrium unfolding of the 5′-UTR during accommodation provides a mechanistic understanding of how translation efficiency is governed by riboswitches and other dynamic structure motifs embedded upstream of the translation initiation site of bacterial mRNAs.
Collapse
Affiliation(s)
- Sujay Ray
- Single-Molecule Analysis Group, Department of Chemistry and Center for RNA Biomedicine, University of Michigan, Ann Arbor, MI 48109, USA
| | - Shiba S Dandpat
- Single-Molecule Analysis Group, Department of Chemistry and Center for RNA Biomedicine, University of Michigan, Ann Arbor, MI 48109, USA
| | - Surajit Chatterjee
- Single-Molecule Analysis Group, Department of Chemistry and Center for RNA Biomedicine, University of Michigan, Ann Arbor, MI 48109, USA
| | - Nils G Walter
- Single-Molecule Analysis Group, Department of Chemistry and Center for RNA Biomedicine, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
39
|
Focht CM, Strobel SA. Efficient quantitative monitoring of translational initiation by RelE cleavage. Nucleic Acids Res 2022; 50:e105. [PMID: 35871288 PMCID: PMC9561414 DOI: 10.1093/nar/gkac614] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 06/23/2022] [Accepted: 07/19/2022] [Indexed: 11/14/2022] Open
Abstract
Abstract
The sequences of the 5′ untranslated regions (5′-UTRs) of mRNA alter gene expression across domains of life. Transcriptional modulators can be easily assayed through transcription termination, but translational regulators often require indirect, laborious methods. We have leveraged RelE’s ribosome-dependent endonuclease activity to develop a quantitative assay to monitor translation initiation of cis-regulatory mRNAs. RelE cleavage accurately reports ligand-dependent changes in ribosome association for two translational riboswitches and provides quantitative information about each switch's sensitivity and range of response. RelE accurately reads out sequence-driven changes in riboswitch specificity and function and is quantitatively dependent upon ligand concentration. RelE cleavage similarly captures differences in translation initiation between yeast 5′-UTR isoforms. RelE cleavage can thus reveal a plethora of information about translation initiation in different domains of life.
Collapse
Affiliation(s)
- Caroline M Focht
- Department of Molecular Biophysics and Biochemistry, Yale University , New Haven , CT 06510 , USA
- Institute of Biomolecular Design and Discovery , West Haven , CT 06516 , USA
| | - Scott A Strobel
- Department of Molecular Biophysics and Biochemistry, Yale University , New Haven , CT 06510 , USA
- Institute of Biomolecular Design and Discovery , West Haven , CT 06516 , USA
- Department of Chemistry, Yale University , New Haven , CT 06511 , USA
| |
Collapse
|
40
|
Salinero AC, Emerson S, Cormier TC, Yin J, Morse RH, Curcio MJ. Reliance of Host-Encoded Regulators of Retromobility on Ty1 Promoter Activity or Architecture. Front Mol Biosci 2022; 9:896215. [PMID: 35847981 PMCID: PMC9283973 DOI: 10.3389/fmolb.2022.896215] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 06/10/2022] [Indexed: 11/13/2022] Open
Abstract
The Ty1 retrotransposon family is maintained in a functional but dormant state by its host, Saccharomyces cerevisiae. Several hundred RHF and RTT genes encoding co-factors and restrictors of Ty1 retromobility, respectively, have been identified. Well-characterized examples include MED3 and MED15, encoding subunits of the Mediator transcriptional co-activator complex; control of retromobility by Med3 and Med15 requires the Ty1 promoter in the U3 region of the long terminal repeat. To characterize the U3-dependence of other Ty1 regulators, we screened a library of 188 known rhf and rtt mutants for altered retromobility of Ty1his3AI expressed from the strong, TATA-less TEF1 promoter or the weak, TATA-containing U3 promoter. Two classes of genes, each including both RHFs and RTTs, were identified. The first class comprising 82 genes that regulated Ty1his3AI retromobility independently of U3 is enriched for RHF genes that restrict the G1 phase of the cell cycle and those involved in transcriptional elongation and mRNA catabolism. The second class of 51 genes regulated retromobility of Ty1his3AI driven only from the U3 promoter. Nineteen U3-dependent regulators (U3DRs) also controlled retromobility of Ty1his3AI driven by the weak, TATA-less PSP2 promoter, suggesting reliance on the low activity of U3. Thirty-one U3DRs failed to modulate PPSP2-Ty1his3AI retromobility, suggesting dependence on the architecture of U3. To further investigate the U3-dependency of Ty1 regulators, we developed a novel fluorescence-based assay to monitor expression of p22-Gag, a restriction factor expressed from the internal Ty1i promoter. Many U3DRs had minimal effects on levels of Ty1 RNA, Ty1i RNA or p22-Gag. These findings uncover a role for the Ty1 promoter in integrating signals from diverse host factors to modulate Ty1 RNA biogenesis or fate.
Collapse
Affiliation(s)
- Alicia C. Salinero
- Laboratory of Molecular Genetics, Wadsworth Center, New York State Department of Health, Albany, NY, United States
- Department of Biomedical Sciences, School of Public Health, University at Albany, Albany, NY, United States
| | - Simey Emerson
- Laboratory of Molecular Genetics, Wadsworth Center, New York State Department of Health, Albany, NY, United States
| | - Tayla C. Cormier
- Laboratory of Molecular Genetics, Wadsworth Center, New York State Department of Health, Albany, NY, United States
| | - John Yin
- Laboratory of Molecular Genetics, Wadsworth Center, New York State Department of Health, Albany, NY, United States
| | - Randall H. Morse
- Laboratory of Molecular Genetics, Wadsworth Center, New York State Department of Health, Albany, NY, United States
- Department of Biomedical Sciences, School of Public Health, University at Albany, Albany, NY, United States
| | - M. Joan Curcio
- Laboratory of Molecular Genetics, Wadsworth Center, New York State Department of Health, Albany, NY, United States
- Department of Biomedical Sciences, School of Public Health, University at Albany, Albany, NY, United States
- *Correspondence: M. Joan Curcio,
| |
Collapse
|
41
|
Georgakopoulos-Soares I, Parada GE, Hemberg M. Secondary structures in RNA synthesis, splicing and translation. Comput Struct Biotechnol J 2022; 20:2871-2884. [PMID: 35765654 PMCID: PMC9198270 DOI: 10.1016/j.csbj.2022.05.041] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 05/19/2022] [Accepted: 05/21/2022] [Indexed: 11/30/2022] Open
Abstract
Even though the functional role of mRNA molecules is primarily decided by the nucleotide sequence, several properties are determined by secondary structure conformations. Examples of secondary structures include long range interactions, hairpins, R-loops and G-quadruplexes and they are formed through interactions of non-adjacent nucleotides. Here, we discuss advances in our understanding of how secondary structures can impact RNA synthesis, splicing, translation and mRNA half-life. During RNA synthesis, secondary structures determine RNA polymerase II (RNAPII) speed, thereby influencing splicing. Splicing is also determined by RNA binding proteins and their binding rates are modulated by secondary structures. For the initiation of translation, secondary structures can control the choice of translation start site. Here, we highlight the mechanisms by which secondary structures modulate these processes, discuss advances in technologies to detect and study them systematically, and consider the roles of RNA secondary structures in disease.
Collapse
Affiliation(s)
- Ilias Georgakopoulos-Soares
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Guillermo E. Parada
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5A 1A8, Canada
| | - Martin Hemberg
- Evergrande Center for Immunologic Diseases, Harvard Medical School and Brigham and Women’s Hospital, Boston, MA, USA
| |
Collapse
|
42
|
Toolbox for Genetic Transformation of Non-Conventional Saccharomycotina Yeasts: High Efficiency Transformation of Yeasts Belonging to the Schwanniomyces Genus. J Fungi (Basel) 2022; 8:jof8050531. [PMID: 35628786 PMCID: PMC9146037 DOI: 10.3390/jof8050531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 05/12/2022] [Accepted: 05/17/2022] [Indexed: 11/16/2022] Open
Abstract
Non-conventional yeasts are increasingly being investigated and used as producers in biotechnological processes which often offer advantages in comparison to traditional and well-established systems. Most biotechnologically interesting non-conventional yeasts belong to the Saccharomycotina subphylum, including those already in use (Pichia pastoris, Yarrowia lypolitica, etc.), as well as those that are promising but as yet insufficiently characterized. Moreover, for many of these yeasts the basic tools of genetic engineering needed for strain construction, including a procedure for efficient genetic transformation, heterologous protein expression and precise genetic modification, are lacking. The first aim of this study was to construct a set of integrative and replicative plasmids which can be used in various yeasts across the Saccharomycotina subphylum. Additionally, we demonstrate here that the electroporation procedure we developed earlier for transformation of B. bruxellensis can be applied in various yeasts which, together with the constructed plasmids, makes a solid starting point when approaching a transformation of yeasts form the Saccharomycotina subphylum. To provide a proof of principle, we successfully transformed three species from the Schwanniomyces genus (S. polymorphus var. polymorphus, S. polymorphus var. africanus and S. pseudopolymorphus) with high efficiencies (up to 8 × 103 in case of illegitimate integration of non-homologous linear DNA and up to 4.7 × 105 in case of replicative plasmid). For the latter two species this is the first reported genetic transformation. Moreover, we found that a plasmid carrying replication origin from Scheffersomyces stipitis can be used as a replicative plasmid for these three Schwanniomyces species.
Collapse
|
43
|
Niederer RO, Rojas-Duran MF, Zinshteyn B, Gilbert WV. Direct analysis of ribosome targeting illuminates thousand-fold regulation of translation initiation. Cell Syst 2022; 13:256-264.e3. [PMID: 35041803 PMCID: PMC8930539 DOI: 10.1016/j.cels.2021.12.002] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 07/15/2021] [Accepted: 12/09/2021] [Indexed: 12/15/2022]
Abstract
Translational control shapes the proteome in normal and pathophysiological conditions. Current high-throughput approaches reveal large differences in mRNA-specific translation activity but cannot identify the causative mRNA features. We developed direct analysis of ribosome targeting (DART) and used it to dissect regulatory elements within 5' untranslated regions that confer 1,000-fold differences in ribosome recruitment in biochemically accessible cell lysates. Using DART, we determined a functional role for most alternative 5' UTR isoforms expressed in yeast, revealed a general mode of increased translation via direct binding to a core translation factor, and identified numerous translational control elements including C-rich silencers that are sufficient to repress translation both in vitro and in vivo. DART enables systematic assessment of the translational regulatory potential of 5' UTR variants, whether native or disease-associated, and will facilitate engineering of mRNAs for optimized protein production in various systems.
Collapse
Affiliation(s)
- Rachel O Niederer
- Department of Molecular Biophysics & Biochemistry, Yale School of Medicine, New Haven, CT 06520, USA
| | - Maria F Rojas-Duran
- Department of Molecular Biophysics & Biochemistry, Yale School of Medicine, New Haven, CT 06520, USA
| | - Boris Zinshteyn
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Wendy V Gilbert
- Department of Molecular Biophysics & Biochemistry, Yale School of Medicine, New Haven, CT 06520, USA.
| |
Collapse
|
44
|
Hector RE, Mertens JA, Nichols NN. Increased expression of the fluorescent reporter protein ymNeonGreen in Saccharomyces cerevisiae by reducing RNA secondary structure near the start codon. BIOTECHNOLOGY REPORTS 2022; 33:e00697. [PMID: 35036336 PMCID: PMC8749125 DOI: 10.1016/j.btre.2021.e00697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 12/22/2021] [Accepted: 12/27/2021] [Indexed: 11/06/2022]
Abstract
Stable secondary RNA structure 3′ of the start codon inhibits ymNeonGreen expression. Removing secondary RNA structure increased expression in S. cerevisiae and E. coli. Expression was increased 2-fold in S. cerevisiae. Expression was increased 3.8-fold in E. coli. Increased expression in S. cerevisiae was promoter independent.
Expression of a new fluorescent reporter protein called mNeonGreen, that is not based on the jellyfish green fluorescent protein (GFP) sequence, shows increased brightness and folding speed compared to enhanced GFP. However, in vivo brightness of mNeonGreen and its yeast-optimized variant ymNeonGreen in S. cerevisiae is lower than expected, limiting the use of this high quantum yield, fast-folding reporter in budding yeast. This study shows that secondary RNA structure near the start codon in the ymNeonGreen ORF inhibits expression in S. cerevisiae. Removing secondary structure, without altering the ymNeonGreen protein sequence, led to a 2 and 4-fold increase in fluorescence when expressed in S. cerevisiae and E. coli, respectively. In S. cerevisiae, increased fluorescence was seen with strong and weak promoters and led to higher transcript levels suggesting greater transcript stability and improved expression in the absence of stable secondary RNA structure near the start codon.
Collapse
|
45
|
Dahale S, Ruiz-Orera J, Silhavy J, Hübner N, van Heesch S, Pravenec M, Atanur SS. Cap analysis of gene expression reveals alternative promoter usage in a rat model of hypertension. Life Sci Alliance 2022; 5:5/4/e202101234. [PMID: 34996843 PMCID: PMC8742872 DOI: 10.26508/lsa.202101234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 12/26/2021] [Accepted: 12/28/2021] [Indexed: 11/24/2022] Open
Abstract
The role of alternative promoter usage in tissue-specific gene expression has been well established; however, its role in complex diseases is poorly understood. We performed cap analysis of gene expression (CAGE) sequencing from the left ventricle of a rat model of hypertension, the spontaneously hypertensive rat (SHR), and a normotensive strain, Brown Norway to understand the role of alternative promoter usage in complex disease. We identified 26,560 CAGE-defined transcription start sites in the rat left ventricle, including 1,970 novel cardiac transcription start sites. We identified 28 genes with alternative promoter usage between SHR and Brown Norway, which could lead to protein isoforms differing at the amino terminus between two strains and 475 promoter switching events altering the length of the 5' UTR. We found that the shift in Insr promoter usage was significantly associated with insulin levels and blood pressure within a panel of HXB/BXH recombinant inbred rat strains, suggesting that hyperinsulinemia due to insulin resistance might lead to hypertension in SHR. Our study provides a preliminary evidence of alternative promoter usage in complex diseases.
Collapse
Affiliation(s)
- Sonal Dahale
- Department of Metabolism, Digestion, and Reproduction, Faculty of Medicine, Imperial College London, London, UK.,Department of Microbial Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford, UK
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | - Jan Silhavy
- Institute of Physiology of the Czech Academy of Sciences, Prague, Czech Republic
| | - Norbert Hübner
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany.,Charité -Universitätsmedizin, Berlin, Germany.,DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, Berlin, Germany
| | | | - Michal Pravenec
- Institute of Physiology of the Czech Academy of Sciences, Prague, Czech Republic
| | - Santosh S Atanur
- Department of Metabolism, Digestion, and Reproduction, Faculty of Medicine, Imperial College London, London, UK .,The National Institute for Health Research, Imperial Biomedical Research Centre, ITMAT Data Science Group, Imperial College London, London, UK
| |
Collapse
|
46
|
May GE, McManus CJ. High-Throughput Quantitation of Yeast uORF Regulatory Impacts Using FACS-uORF. Methods Mol Biol 2022; 2404:331-351. [PMID: 34694618 DOI: 10.1007/978-1-0716-1851-6_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Eukaryotic upstream Open Reading Frames (uORFs) are short translated regions found in many transcript leaders (Barbosa et al. PLoS Genet 9:e1003529, 2013; Zhang et al. Trends Biochem Sci 44:782-794, 2019). Modern transcript annotations and ribosome profiling studies have found thousands of AUG-initiated uORFs, and many more uORFs initiated by near-cognate codons (CUG, GUG, UUG, etc.). Their translation generally decreases the expression of the main encoded protein by preventing ribosomes from reaching the main ORF of each gene, and by inducing nonsense mediated decay (NMD) through premature termination. Under many cellular stresses, uORF containing transcripts are de-repressed due to decreased translation initiation (Young et al. J Biol Chem 291:16927-16935, 2016). Traditional experimental evaluation of uORFs involves comparing expression from matched uORF-containing and start-codon mutated transcript leader reporter plasmids. This tedious process has precluded analysis of large numbers of uORFs. We recently used FACS-uORF to simultaneously assay thousands of yeast uORFs in order to evaluate the impact of codon usage on their functions (Lin et al. Nucleic Acids Res 2:1-10, 2019). Here, we provide a step-by-step protocol for this assay.
Collapse
Affiliation(s)
- Gemma E May
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, USA
| | - C Joel McManus
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, USA.
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
47
|
Haslem L, Brown M, Zhang XA, Hays JM, Hays FA. Overproduction of Membrane-Associated, and Integrated, Proteins Using Saccharomyces cerevisiae. Methods Mol Biol 2022; 2507:111-141. [PMID: 35773580 PMCID: PMC9531322 DOI: 10.1007/978-1-0716-2368-8_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Structural and functional eukaryotic membrane protein research continues to grow at an increasing rate, placing greater significance on leveraging productive protein expression pipelines to feed downstream studies. Bacterial expression systems (e.g., E. coli) are often the preferred system due to their simple growth conditions, relative simplicity in experimental workflow, low overall cost per liter of cell growth, and ease of genetic manipulation. However, overproduction success of eukaryotic membrane proteins in bacterial systems is hindered by the limited native processing ability of bacterial systems for important protein folding interactions (e.g., disulfide bonds), post-translational modifications (e.g., glycosylation), and inherent disadvantages in protein trafficking and folding machinery compared to other expression systems.In contrast, Saccharomyces cerevisiae expression systems combine positive benefits of simpler bacterial systems with those of more complex eukaryotic systems (e.g., mammalian cells). Benefits include inexpensive growth, robust DNA repair and recombination machinery, amenability to high density growths in bioreactors, efficient transformation, and robust post-translational modification machinery. These characteristics make S. cerevisiae a viable first-alternative when bacterial overproduction is insufficient. Thus, this chapter provides a framework, using methods that have proven successful in prior efforts, for overproducing membrane anchored or membrane integrated proteins in S. cerevisiae. The framework is designed to improve yields for all levels of overexpression expertise, providing optimization insights for the variety of processes involved in heterologous protein expression.
Collapse
Affiliation(s)
- Landon Haslem
- Department of Biochemistry and Molecular Biology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Marina Brown
- Department of Biochemistry and Molecular Biology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Xin A Zhang
- Stephenson Cancer Center, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
- Department of Physiology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Jennifer M Hays
- Department of Biochemistry and Molecular Biology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Franklin A Hays
- Department of Biochemistry and Molecular Biology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA.
- Stephenson Cancer Center, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA.
| |
Collapse
|
48
|
May GE, McManus CJ. Multiplexed Analysis of Human uORF Regulatory Functions During the ISR Using PoLib-Seq. Methods Mol Biol 2022; 2428:41-62. [PMID: 35171472 DOI: 10.1007/978-1-0716-1975-9_3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Protein synthesis is a highly regulated essential process. As such, it is subjected to substantial regulation in response to stress. One hallmark of the Integrated Stress Response (ISR) is the immediate shutdown of most translation through phosphorylation of the alpha subunit of translation initiation factor eIF2 and activation of eIF4E binding proteins. While these posttranslational modifications largely inhibit cap-dependent translation, many mRNA resist this inhibition by alternative translation mechanisms involving cis-regulatory sequences and structures in 5' transcript leaders, including upstream Open Reading Frames (uORFs), Internal Ribosome Entry Sites (IRESes), and Cap-Independent Translation Elements (CITEs). Studies of uORF and IRES activity are often performed on a gene-by-gene basis; however, high-throughput methods have recently emerged. Here, we describe a protocol for Polysome Library Sequencing (PoLib-Seq; Fig. 1), a multiplexed assay of reporter gene translation that can be used during the ISR. A designer library of reporter RNAs are transfected into tissue-culture cells, and their translation is assayed via sucrose gradient fractionation followed by high-throughput sequencing. As an example, we include PoLib-seq results simultaneously assaying translation of wildtype and uORF mutant human ATF4 reporter RNAs, recapitulating the known function of uORF1 in resisting translational inhibition during the ISR.
Collapse
Affiliation(s)
- Gemma E May
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, USA
| | - C Joel McManus
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, USA.
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
49
|
Kang YJ, Li JY, Ke L, Jiang S, Yang DC, Hou M, Gao G. Quantitative model suggests both intrinsic and contextual features contribute to the transcript coding ability determination in cells. Brief Bioinform 2021; 23:6445106. [PMID: 34849565 DOI: 10.1093/bib/bbab483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 10/18/2021] [Accepted: 10/23/2021] [Indexed: 11/13/2022] Open
Abstract
Gene transcription and protein translation are two key steps of the 'central dogma.' It is still a major challenge to quantitatively deconvolute factors contributing to the coding ability of transcripts in mammals. Here, we propose ribosome calculator (RiboCalc) for quantitatively modeling the coding ability of RNAs in human genome. In addition to effectively predicting the experimentally confirmed coding abundance via sequence and transcription features with high accuracy, RiboCalc provides interpretable parameters with biological information. Large-scale analysis further revealed a number of transcripts with a variety of coding ability for distinct types of cells (i.e. context-dependent coding transcripts), suggesting that, contrary to conventional wisdom, a transcript's coding ability should be modeled as a continuous spectrum with a context-dependent nature.
Collapse
Affiliation(s)
- Yu-Jian Kang
- Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing, 100871, China
| | - Jing-Yi Li
- Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing, 100871, China
| | - Lan Ke
- Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing, 100871, China
| | - Shuai Jiang
- Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing, 100871, China
| | - De-Chang Yang
- Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing, 100871, China
| | - Mei Hou
- Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing, 100871, China
| | - Ge Gao
- Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing, 100871, China
| |
Collapse
|
50
|
Liu B, Wu P, Zhou J, Yin A, Yu Y, Lu H. Characterization and optimization of the LAC4 upstream region for low-leakage expression in Kluyveromyces marxianus. Yeast 2021; 39:283-296. [PMID: 34791694 DOI: 10.1002/yea.3682] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 11/11/2021] [Accepted: 11/13/2021] [Indexed: 11/07/2022] Open
Abstract
Kluyveromyces marxianus is a promising host for the production of heterologous proteins, chemicals, and bioethanol. One superior feature of this species is its capacity to assimilate lactose, which is rendered by the LAC12-LAC4 gene pair encoding a lactose permease and a β-galactosidase enzyme. Little is known about the regulation of LAC4 in K. marxianus. In this study, we showed the presence of weak glucose repression in the regulation of LAC4 and that might contribute to the leaky expression of LAC4 in the glucose medium. In a mutagenesis screen of 1000-bp LAC4 upstream region, one mutant region, named H1, drove low-leakage expression of a URA3 reporter gene in glucose medium. Two mutations inside a polyadenosine stretch (poly(A)) of 5' UTR were major contributors to the low-leakage phenotype of H1. H1 directed low-leakage expression of GFP on a plasmid and that of LAC4 in situ in the glucose medium, which was not due to the reduction of mRNA levels. Meanwhile, H1 did not affect the induction of GFP or LAC4 by lactose. Cre recombinase expressed by H1 caused lower toxicity in the repressive condition and achieved higher yield after induction, compared with that expressed by a wild-type LAC4 upstream region or a strong INU1 promoter. Our study suggested that poly(A) inside 5' UTR played a role in regulating the expression of LAC4 in the repressive condition. Meanwhile, H1 provided a base for the development of a strict inducible system for expressing industrial proteins, especially toxic proteins.
Collapse
Affiliation(s)
- Benxin Liu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China.,Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, China
| | - Pingping Wu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China.,Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, China
| | - Jungang Zhou
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China.,Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, China
| | - Anqi Yin
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China.,Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, China
| | - Yao Yu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China.,Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, China.,National Technology Innovation Center of Synthetic Biology, Tianjin, China
| | - Hong Lu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China.,Shanghai Engineering Research Center of Industrial Microorganisms, Shanghai, China.,National Technology Innovation Center of Synthetic Biology, Tianjin, China.,Shanghai Collaborative Innovation Center for Biomanufacturing (SCICB), East China University of Science and Technology, Shanghai, China
| |
Collapse
|