1
|
Balázs Z, Tombácz D, Szűcs A, Csabai Z, Megyeri K, Petrov AN, Snyder M, Boldogkői Z. Long-Read Sequencing of Human Cytomegalovirus Transcriptome Reveals RNA Isoforms Carrying Distinct Coding Potentials. Sci Rep 2017; 7:15989. [PMID: 29167532 PMCID: PMC5700075 DOI: 10.1038/s41598-017-16262-z] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 11/07/2017] [Indexed: 12/22/2022] Open
Abstract
The human cytomegalovirus (HCMV) is a ubiquitous, human pathogenic herpesvirus. The complete viral genome is transcriptionally active during infection; however, a large part of its transcriptome has yet to be annotated. In this work, we applied the amplified isoform sequencing technique from Pacific Biosciences to characterize the lytic transcriptome of HCMV strain Towne varS. We developed a pipeline for transcript annotation using long-read sequencing data. We identified 248 transcriptional start sites, 116 transcriptional termination sites and 80 splicing events. Using this information, we have annotated 291 previously undescribed or only partially annotated transcript isoforms, including eight novel antisense transcripts and their isoforms, as well as a novel transcript (RS2) in the short repeat region, partially antisense to RS1. Similarly to other organisms, we discovered a high transcriptional diversity in HCMV, with many transcripts only slightly differing from one another. Comparing our transcriptome profiling results to an earlier ribosome footprint analysis, we have concluded that the majority of the transcripts contain multiple translationally active ORFs, and also that most isoforms contain unique combinations of ORFs. Based on these results, we propose that one important function of this transcriptional diversity may be to provide a regulatory mechanism at the level of translation.
Collapse
Affiliation(s)
- Zsolt Balázs
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary.,Department of Genetics, School of Medicine, Stanford University, Stanford, California, 94305, USA
| | - Attila Szűcs
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Zsolt Csabai
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Klára Megyeri
- Department of Medical Microbiology and Immunobiology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Alexey N Petrov
- Department of Structural Biology, School of Medicine, Stanford University, Stanford, California, 94305, USA.,Department of Biological Sciences, College of Sciences and Mathematics, Auburn University, Auburn, Alabama, 36849, USA
| | - Michael Snyder
- Department of Genetics, School of Medicine, Stanford University, Stanford, California, 94305, USA
| | - Zsolt Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary.
| |
Collapse
|
2
|
Yang YF, Zhang X, Ma X, Zhao T, Sun Q, Huan Q, Wu S, Du Z, Qian W. Trans-splicing enhances translational efficiency in C. elegans. Genome Res 2017; 27:1525-1535. [PMID: 28684554 PMCID: PMC5580712 DOI: 10.1101/gr.202150.115] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 06/22/2017] [Indexed: 11/24/2022]
Abstract
Translational efficiency is subject to extensive regulation. However, the factors influencing such regulation are poorly understood. In Caenorhabditis elegans, 62% of genes are trans-spliced to a specific spliced leader (SL1), which replaces part of the native 5' untranslated region (5' UTR). Given the pivotal role the 5' UTR plays in the regulation of translational efficiency, we hypothesized that SL1 trans-splicing functions to regulate translational efficiency. With genome-wide analysis on Ribo-seq data, polysome profiling experiments, and CRISPR-Cas9-based genetic manipulation of trans-splicing sites, we found four lines of evidence in support of this hypothesis. First, SL1 trans-spliced genes have higher translational efficiencies than non-trans-spliced genes. Second, SL1 trans-spliced genes have higher translational efficiencies than non-trans-spliced orthologous genes in other nematode species. Third, an SL1 trans-spliced isoform has higher translational efficiency than the non-trans-spliced isoform of the same gene. Fourth, deletion of trans-splicing sites of endogenous genes leads to reduced translational efficiency. Importantly, we demonstrated that SL1 trans-splicing plays a key role in enhancing translational efficiencies of essential genes. We further discovered that SL1 trans-splicing likely enhances translational efficiency by shortening the native 5' UTRs, hence reducing the presence of upstream start codons (uAUG) and weakening mRNA secondary structures. Taken together, our study elucidates the global function of trans-splicing in enhancing translational efficiency in nematodes, paving the way for further understanding the genomic mechanisms of translational regulation.
Collapse
Affiliation(s)
- Yu-Fei Yang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xiaoqing Zhang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xuehua Ma
- Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Taolan Zhao
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Qiushi Sun
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- Beijing Key Laboratory of Traffic Data Analysis and Mining, School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China
| | - Qing Huan
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Shaohuan Wu
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhuo Du
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Wenfeng Qian
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
3
|
Sinha A, Langnick C, Sommer RJ, Dieterich C. Genome-wide analysis of trans-splicing in the nematode Pristionchus pacificus unravels conserved gene functions for germline and dauer development in divergent operons. RNA (NEW YORK, N.Y.) 2014; 20:1386-1397. [PMID: 25015138 PMCID: PMC4138322 DOI: 10.1261/rna.041954.113] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2013] [Accepted: 05/19/2014] [Indexed: 06/03/2023]
Abstract
Discovery of trans-splicing in multiple metazoan lineages led to the identification of operon-like gene organization in diverse organisms, including trypanosomes, tunicates, and nematodes, but the functional significance of such operons is not completely understood. To see whether the content or organization of operons serves similar roles across species, we experimentally defined operons in the nematode model Pristionchus pacificus. We performed affinity capture experiments on mRNA pools to specifically enrich for transcripts that are trans-spliced to either the SL1- or SL2-spliced leader, using spliced leader-specific probes. We obtained distinct trans-splicing patterns from the analysis of three mRNA pools (total mRNA, SL1 and SL2 fraction) by RNA-seq. This information was combined with a genome-wide analysis of gene orientation and spacing. We could confirm 2219 operons by RNA-seq data out of 6709 candidate operons, which were predicted by sequence information alone. Our gene order comparison of the Caenorhabditis elegans and P. pacificus genomes shows major changes in operon organization in the two species. Notably, only 128 out of 1288 operons in C. elegans are conserved in P. pacificus. However, analysis of gene-expression profiles identified conserved functions such as an enrichment of germline-expressed genes and higher expression levels of operonic genes during recovery from dauer arrest in both species. These results provide support for the model that a necessity for increased transcriptional efficiency in the context of certain developmental processes could be a selective constraint for operon evolution in metazoans. Our method is generally applicable to other metazoans to see if similar functional constraints regulate gene organization into operons.
Collapse
Affiliation(s)
- Amit Sinha
- Max Planck Institute for Developmental Biology, Department for Evolutionary Biology, 72076 Tübingen, Germany
| | - Claudia Langnick
- Berlin Institute for Medical Systems Biology (BIMSB) at the Max Delbrück Center for Molecular Medicine Berlin, 13125 Berlin, Germany
| | - Ralf J Sommer
- Max Planck Institute for Developmental Biology, Department for Evolutionary Biology, 72076 Tübingen, Germany
| | | |
Collapse
|
4
|
Chew DS, Mah AK, Baillie DL. Characterizing the transcriptional regulation of let-721, a Caenorhabditis elegans homolog of human electron flavoprotein dehydrogenase. Mol Genet Genomics 2009; 282:555-70. [PMID: 19774399 DOI: 10.1007/s00438-009-0485-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2009] [Accepted: 09/07/2009] [Indexed: 11/28/2022]
Abstract
LET-721 is the Caenorhabditis elegans ortholog of electron-transferring flavoprotein dehydrogenase (ETFDH). We are studying this protein in C. elegans in order to establish a tractable model system for further exploration of ETFDH structure and function. ETFDH is an inner mitochondrial membrane localized enzyme that plays a key role in the beta-oxidation of fatty acids and catabolism of amino acids and choline. ETFDH accepts electrons from at least twelve mitochondrial matrix flavoprotein dehydrogenases via an intermediate dimer protein and transfers the electrons to ubiquinone. In humans, ETFDH mutations result in the autosomal recessive metabolic disorder, multiple acyl-CoA dehydrogenase deficiency. Mutants of let-721 in C. elegans are either maternal effect lethals or semi-sterile. let-721 is transcribed in the pharynx, body wall muscle, hypoderm, intestine and somatic gonad. In addition, the subcellular localization of LET-721 agrees with predictions that it is localized to mitochondria. We identified and confirmed three cis-regulatory sequences (pha-site, rep-site, and act-site). Phylogenetic footprinting of each site indicates that they are conserved between four Caenorhabditis species. The pha-site mapped roughly 1,300 bp upstream of let-721's translational start site and is necessary for expression in pharyngeal tissues. The rep-site mapped roughly 830 bp upstream of the translational start site and represses expression of LET-721 within pharyngeal tissues. The act-site mapped roughly 800 bp upstream of the translational start site and is required for expression within spermatheca, body wall muscle, pharynx, and intestine. Taken together, we find that LET-721 is a mitochondrially expressed protein that is under complex transcriptional controls.
Collapse
Affiliation(s)
- Derek S Chew
- Department Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada
| | | | | |
Collapse
|
5
|
Abstract
Operons are widespread in prokaryotes, but are uncommon in eukaryotes, except nematode worms, where approximately 15% of genes reside in over 1100 operons in the model organism Caenorhabditis elegans. It is unclear how operons have become abundant in nematode genomes. The "one-way street" hypothesis asserts that once formed by chance, operons are very difficult to break, because the breakage would leave downstream genes in an operon without a promoter, and hence, unexpressed. To test this hypothesis, we analyzed the presence and absence of C. elegans operons in Caenorhabditis briggsae, Caenorhabditis remanei, and Caenorhabditis brenneri, using Pristionchus pacificus and Brugia malayi as outgroups, and identified numerous operon gains and losses. Coupled with experimental examination of trans-splicing patterns, our comparative genomic analysis revealed diverse molecular mechanisms of operon losses, including inversion, insertion, and relocation, but the presence of internal promoters was not found to facilitate operon losses. In several cases, the data allowed inference of mechanisms by which downstream genes are expressed after operon breakage. We found that the rate of operon gain is approximately 3.3 times that of operon loss. Thus, the evolutionary dynamics of nematode operons is better described as "easy come, slow go," rather than a "one-way street." Based on a mathematic model of operon gains and losses and additional assumptions, we projected that the number of operons in C. elegans will continue to rise by 6%-18% in future evolution before reaching equilibrium between operon gains and losses.
Collapse
Affiliation(s)
- Wenfeng Qian
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| | | |
Collapse
|
6
|
Hujová J, Sikora J, Dobrovolný R, Poupětová H, Ledvinová J, Kostrouchová M, Hřebíček M. Characterization of gana-1, a Caenorhabditis elegans gene encoding a single ortholog of vertebrate alpha-galactosidase and alpha-N-acetylgalactosaminidase. BMC Cell Biol 2005; 6:5. [PMID: 15676072 PMCID: PMC548690 DOI: 10.1186/1471-2121-6-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2004] [Accepted: 01/27/2005] [Indexed: 11/10/2022] Open
Abstract
Background Human α-galactosidase A (α-GAL) and α-N-acetylgalactosaminidase (α-NAGA) are presumed to share a common ancestor. Deficiencies of these enzymes cause two well-characterized human lysosomal storage disorders (LSD) – Fabry (α-GAL deficiency) and Schindler (α-NAGA deficiency) diseases. Caenorhabditis elegans was previously shown to be a relevant model organism for several late endosomal/lysosomal membrane proteins associated with LSDs. The aim of this study was to identify and characterize C. elegans orthologs to both human lysosomal luminal proteins α-GAL and α-NAGA. Results BlastP searches for orthologs of human α-GAL and α-NAGA revealed a single C. elegans gene (R07B7.11) with homology to both human genes (α-galactosidase and α-N-acetylgalactosaminidase) – gana-1. We cloned and sequenced the complete gana-1 cDNA and elucidated the gene organization. Phylogenetic analyses and homology modeling of GANA-1 based on the 3D structure of chicken α-NAGA, rice α-GAL and human α-GAL suggest a close evolutionary relationship of GANA-1 to both human α-GAL and α-NAGA. Both α-GAL and α-NAGA enzymatic activities were detected in C. elegans mixed culture homogenates. However, α-GAL activity on an artificial substrate was completely inhibited by the α-NAGA inhibitor, N-acetyl-D-galactosamine. A GANA-1::GFP fusion protein expressed from a transgene, containing the complete gana-1 coding region and 3 kb of its hypothetical promoter, was not detectable under the standard laboratory conditions. The GFP signal was observed solely in a vesicular compartment of coelomocytes of the animals treated with Concanamycin A (CON A) or NH4Cl, agents that increase the pH of the cellular acidic compartment. Immunofluorescence detection of the fusion protein using polyclonal anti-GFP antibody showed a broader and coarsely granular cytoplasmic expression pattern in body wall muscle cells, intestinal cells, and a vesicular compartment of coelomocytes. Inhibition of gana-1 by RNA interference resulted in a decrease of both α-GAL and α-NAGA activities measured in mixed stage culture homogenates but did not cause any obvious phenotype. Conclusions GANA-1 is a single C. elegans ortholog of both human α-GAL and α-NAGA proteins. Phylogenetic, homology modeling, biochemical and GFP expression analyses support the hypothesis that GANA-1 has dual enzymatic activity and is localized in an acidic cellular compartment.
Collapse
Affiliation(s)
- Jana Hujová
- Institute of Inherited Metabolic Disorders, Charles University, 1Medical Faculty, Prague, Czech Republic
| | - Jakub Sikora
- Institute of Inherited Metabolic Disorders, Charles University, 1Medical Faculty, Prague, Czech Republic
| | - Robert Dobrovolný
- Institute of Inherited Metabolic Disorders, Charles University, 1Medical Faculty, Prague, Czech Republic
| | - Helena Poupětová
- Institute of Inherited Metabolic Disorders, Charles University, 1Medical Faculty, Prague, Czech Republic
| | - Jana Ledvinová
- Institute of Inherited Metabolic Disorders, Charles University, 1Medical Faculty, Prague, Czech Republic
| | - Marta Kostrouchová
- Institute of Inherited Metabolic Disorders, Charles University, 1Medical Faculty, Prague, Czech Republic
| | - Martin Hřebíček
- Institute of Inherited Metabolic Disorders, Charles University, 1Medical Faculty, Prague, Czech Republic
| |
Collapse
|
7
|
Affiliation(s)
- Thomas Blumenthal
- Department of Biochemistry and Molecular Genetics, Box B-121, University of Colorado School of Medicine, Denver, Colorado 80262, USA.
| | | |
Collapse
|