1
|
Exitron splicing of odor receptor genes in Drosophila. Proc Natl Acad Sci U S A 2024; 121:e2320277121. [PMID: 38507450 PMCID: PMC10990081 DOI: 10.1073/pnas.2320277121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 02/06/2024] [Indexed: 03/22/2024] Open
Abstract
Proper expression of odor receptor genes is critical for the function of olfactory systems. In this study, we identified exitrons (exonic introns) in four of the 39 Odorant receptor (Or) genes expressed in the Drosophila antenna. Exitrons are sequences that can be spliced out from within a protein-coding exon, thereby altering the encoded protein. We focused on Or88a, which encodes a pheromone receptor, and found that exitron splicing of Or88a is conserved across five Drosophila species over 20 My of evolution. The exitron was spliced out in 15% of Or88a transcripts. Removal of this exitron creates a non-coding RNA rather than an RNA that encodes a stable protein. Our results suggest the hypothesis that in the case of Or88a, exitron splicing could act in neuronal modulation by decreasing the level of functional Or transcripts. Activation of Or88a-expressing olfactory receptor neurons via either optogenetics or pheromone stimulation increased the level of exitron-spliced transcripts, with optogenetic activation leading to a 14-fold increase. A fifth Or can also undergo an alternative splicing event that eliminates most of the canonical open reading frame. Besides these cases of alternative splicing, we found alternative polyadenylation of four Ors, and exposure of Or67c to its ligand ethyl lactate in the antenna downregulated all of its 3' isoforms. Our study reveals mechanisms by which neuronal activity could be modulated via regulation of the levels of Or isoforms.
Collapse
|
2
|
Development of Moloney Murine Leukemia Virus Reverse Transcriptase Fused with Archaeal DNA-binding Protein Sis7a. Recent Pat Biotechnol 2024; 18:71-83. [PMID: 37016518 DOI: 10.2174/1872208317666230403104302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 12/21/2022] [Accepted: 02/01/2023] [Indexed: 04/06/2023]
Abstract
INTRODUCTION Moloney Murine Leukemia Virus Reverse Transcriptase (MMLV RT) is a common enzyme used to convert RNA sequences into cDNA. However, it still has its shortcomings, especially in terms of processivity and thermostability. According to a previous patent, the fusion of polymerase enzyme to an archaeal DNA-binding protein has been proven to enhance its performance. Furthermore, recent studies have also stated that the fusion of a polymerase enzyme to an archaeal DNA-binding protein is predicted to improve its thermostability and processivity. AIM As an early stage of enzyme development, this study aimed to design, express, and purify enzymatically active MMLV RT fused with archaeal DNA-binding protein. METHODS RT fusion proteins were designed and evaluated using in silico methods. The RT fusion enzyme was then expressed in Escherichia coli BL21(DE3) and purified. Its reverse transcriptional activity was proved using reverse transcription quantitative polymerase chain reaction (RT-qPCR). RESULTS This study showed that MMLV RT fusion with Sis7a protein at its C-terminal end using commercial linker (GGVDMI) produced the best in silico evaluation results. The RT fusion was successfully expressed and purified. It was also known that the optimal condition for expression of the RT fusion was using 0.5 mM IPTG with post-induction incubation at room temperature (± 26°C) for 16 hours. In addition, the activity assay proved that the RT fusion has the reverse transcriptional activity. CONCLUSION This study shows that the designed MMLV RT Sis7a fusion can be expressed and purified, is enzymatically active, and has the potential to be developed as an improved RT enzyme. Further study is still needed to prove its thermostability and processivity, and further characterize, and plan production scale-up of the MMLV RT Sis7a fusion for commercial use.
Collapse
|
3
|
A duplex droplet digital PCR assay for absolute quantification and characterization of long self-amplifying RNA. Sci Rep 2023; 13:19050. [PMID: 37923834 PMCID: PMC10624827 DOI: 10.1038/s41598-023-46314-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 10/30/2023] [Indexed: 11/06/2023] Open
Abstract
Self-amplifying messenger ribonucleic acid (saRNA) provides extended expression of genes of interest by encoding an alphavirus-derived RNA replicase and thus is 2-3 times larger than conventional messenger RNA. However, quality assessment of long RNA transcripts is challenging using standard techniques. Here, we utilized a multiplex droplet digital polymerase chain reaction (ddPCR) assay to assess the quality of saRNA produced from an in vitro transcription reaction and the replication kinetics in human cell lines. Using the one-step reverse transcription ddPCR, we show that an in vitro transcription generates 50-60% full-length saRNA transcripts. However, we note that the two-step reverse transcription ddPCR assay results in a 20% decrease from results obtained using the one-step and confirmed using capillary gel electrophoresis. Additionally, we provided three formulas that differ in the level of stringency and assumptions made to calculate the fraction of intact saRNA. Using ddPCR, we also showed that subgenomic transcripts of saRNA were 19-to-108-fold higher than genomic transcripts at different hours post-transfection of mammalian cells in copies. Therefore, we demonstrate that multiplex ddPCR is well suited for quality assessment of long RNA and replication kinetics of saRNA based on absolute quantification.
Collapse
|
4
|
Identification of high-confidence human poly(A) RNA isoform scaffolds using nanopore sequencing. RNA (NEW YORK, N.Y.) 2022; 28:162-176. [PMID: 34728536 PMCID: PMC8906549 DOI: 10.1261/rna.078703.121] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Accepted: 10/13/2021] [Indexed: 06/13/2023]
Abstract
Nanopore sequencing devices read individual RNA strands directly. This facilitates identification of exon linkages and nucleotide modifications; however, using conventional direct RNA nanopore sequencing, the 5' and 3' ends of poly(A) RNA cannot be identified unambiguously. This is due in part to RNA degradation in vivo and in vitro that can obscure transcription start and end sites. In this study, we aimed to identify individual full-length human RNA isoforms among ∼4 million nanopore poly(A)-selected RNA reads. First, to identify RNA strands bearing 5' m7G caps, we exchanged the biological cap for a modified cap attached to a 45-nt oligomer. This oligomer adaptation method improved 5' end sequencing and ensured correct identification of the 5' m7G capped ends. Second, among these 5'-capped nanopore reads, we screened for features consistent with a 3' polyadenylation site. Combining these two steps, we identified 294,107 individual high-confidence full-length RNA scaffolds from human GM12878 cells, most of which (257,721) aligned to protein-coding genes. Of these, 4876 scaffolds indicated unannotated isoforms that were often internal to longer, previously identified RNA isoforms. Orthogonal data for m7G caps and open chromatin, such as CAGE and DNase-HS seq, confirmed the validity of these high-confidence RNA scaffolds.
Collapse
|
5
|
RT-qPCR Detection of SARS-CoV-2: No Need for a Dedicated Reverse Transcription Step. Int J Mol Sci 2022; 23:ijms23031303. [PMID: 35163227 PMCID: PMC8835954 DOI: 10.3390/ijms23031303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 12/27/2021] [Accepted: 01/21/2022] [Indexed: 11/16/2022] Open
Abstract
Reverse transcription of RNA coupled to amplification of the resulting cDNA by the polymerase chain reaction (RT-PCR) is one of the principal molecular technologies in use today, with applications across all areas of science and medicine. In its real-time, fluorescence-based usage (RT-qPCR), it has long been a core technology driving the accurate, rapid and sensitive laboratory diagnosis of infectious diseases. However, RT-qPCR protocols have changed little over the past 30 years, with the RT step constituting a significant percentage of the time taken to complete a typical RT-qPCR assay. When applied to research investigations, reverse transcription has been evaluated by criteria such as maximum yield, length of transcription, fidelity, and faithful representation of an RNA pool. Crucially, however, these are of less relevance in a diagnostic RT-PCR test, where speed and sensitivity are the prime RT imperatives, with specificity contributed by the PCR component. We propose a paradigm shift that omits the requirement for a separate high-temperature RT step at the beginning of an RT-qPCR assay. This is achieved by means of an innovative protocol that incorporates suitable reagents with a revised primer and amplicon design and we demonstrate a proof of principle that incorporates the RT step as part of the PCR assay setup at room temperature. Use of this modification as part of a diagnostic assay will of course require additional characterisation, validation and optimisation of the PCR step. Combining this revision with our previous development of fast qPCR protocols allows completion of a 40 cycle RT-qPCR run on a suitable commercial instrument in approximately 15 min. Even faster times, in combination with extreme PCR procedures, can be achieved.
Collapse
|
6
|
A novel role for gag as a cis-acting element regulating RNA structure, dimerization and packaging in HIV-1 lentiviral vectors. Nucleic Acids Res 2021; 50:430-448. [PMID: 34928383 PMCID: PMC8754630 DOI: 10.1093/nar/gkab1206] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 11/17/2021] [Accepted: 11/30/2021] [Indexed: 11/22/2022] Open
Abstract
Clinical usage of lentiviral vectors is now established and increasing but remains constrained by vector titer with RNA packaging being a limiting factor. Lentiviral vector RNA is packaged through specific recognition of the packaging signal on the RNA by the viral structural protein Gag. We investigated structurally informed modifications of the 5′ leader and gag RNA sequences in which the extended packaging signal lies, to attempt to enhance the packaging process by facilitating vector RNA dimerization, a process closely linked to packaging. We used in-gel SHAPE to study the structures of these mutants in an attempt to derive structure-function correlations that could inform optimized vector RNA design. In-gel SHAPE of both dimeric and monomeric species of RNA revealed a previously unreported direct interaction between the U5 region of the HIV-1 leader and the downstream gag sequences. Our data suggest a structural equilibrium exists in the dimeric viral RNA between a metastable structure that includes a U5–gag interaction and a more stable structure with a U5–AUG duplex. Our data provide clarification for the previously unexplained requirement for the 5′ region of gag in enhancing genomic RNA packaging and provide a basis for design of optimized HIV-1 based vectors.
Collapse
|
7
|
Direct long-read RNA sequencing identifies a subset of questionable exitrons likely arising from reverse transcription artifacts. Genome Biol 2021; 22:190. [PMID: 34183059 PMCID: PMC8240250 DOI: 10.1186/s13059-021-02411-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 06/16/2021] [Indexed: 11/24/2022] Open
Abstract
Resistance to CD19-directed immunotherapies in lymphoblastic leukemia has been attributed, among other factors, to several aberrant CD19 pre-mRNA splicing events, including recently reported excision of a cryptic intron embedded within CD19 exon 2. While "exitrons" are known to exist in hundreds of human transcripts, we discovered, using reporter assays and direct long-read RNA sequencing (dRNA-seq), that the CD19 exitron is an artifact of reverse transcription. Extending our analysis to publicly available datasets, we identified dozens of questionable exitrons, dubbed "falsitrons," that appear only in cDNA-seq, but never in dRNA-seq. Our results highlight the importance of dRNA-seq for transcript isoform validation.
Collapse
MESH Headings
- Alternative Splicing
- Antibodies, Bispecific/pharmacology
- Antineoplastic Agents, Immunological/pharmacology
- Artifacts
- B-Lymphocytes/drug effects
- B-Lymphocytes/immunology
- B-Lymphocytes/pathology
- Base Pairing
- Base Sequence
- Cell Line, Tumor
- Datasets as Topic
- Exons
- High-Throughput Nucleotide Sequencing
- Humans
- Immunotherapy/methods
- Introns
- Models, Biological
- Nucleic Acid Conformation
- Precursor Cell Lymphoblastic Leukemia-Lymphoma/drug therapy
- Precursor Cell Lymphoblastic Leukemia-Lymphoma/genetics
- Precursor Cell Lymphoblastic Leukemia-Lymphoma/immunology
- Precursor Cell Lymphoblastic Leukemia-Lymphoma/pathology
- Protein Isoforms/chemistry
- Protein Isoforms/genetics
- Protein Isoforms/immunology
- RNA, Messenger/chemistry
- RNA, Messenger/genetics
- RNA, Messenger/immunology
- Receptors, Antigen, T-Cell/genetics
- Receptors, Antigen, T-Cell/immunology
- Reverse Transcription
Collapse
|
8
|
Using an L7Ae-Tethered, Hydroxyl Radical-Mediated Footprinting Strategy to Identify and Validate Kink-Turns in RNAs. Methods Mol Biol 2021; 2167:147-169. [PMID: 32712919 DOI: 10.1007/978-1-0716-0716-9_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Kink-turns are important RNA structural modules that facilitate long-range tertiary interactions and form binding sites for members of the L7Ae family of proteins. Present in a wide variety of functional RNAs, kink-turns play key organizational roles in many RNA-based cellular processes, including translation, modification, and tRNA biogenesis. It is important to determine the contribution of kink-turns to the overall architecture of resident RNAs, as these modules dictate ribonucleoprotein (RNP) assembly and function. This chapter describes a site-directed, hydroxyl radical-mediated footprinting strategy that utilizes L7Ae-tethered chemical nucleases to experimentally validate computationally identified kink-turns in any RNA and under a wide variety of conditions. The work plan described here uses the catalytic RNase P RNA as an example to provide a blueprint for using this footprinting method to map RNA-protein interactions in other RNP complexes.
Collapse
|
9
|
Formation of HERV-K and HERV-Fc1 Envelope Family Members is Suppressed on Transcriptional and Translational Level. Int J Mol Sci 2020; 21:ijms21217855. [PMID: 33113941 PMCID: PMC7660216 DOI: 10.3390/ijms21217855] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 10/15/2020] [Accepted: 10/21/2020] [Indexed: 12/12/2022] Open
Abstract
The human genome comprises 8% sequences of retroviral origin, so-called human endogenous retroviruses (HERVs). Most of these proviral sequences are defective, but some possess open reading frames. They can lead to the formation of viral transcripts, when activated by intrinsic and extrinsic factors. HERVs are thought to play a pathological role in inflammatory diseases and cancer. Since the consequences of activated proviral sequences in the human body are largely unexplored, selected envelope proteins of human endogenous retroviruses associated with inflammatory diseases, namely HERV-K18, HERV-K113, and HERV-Fc1, were investigated in the present study. A formation of glycosylated envelope proteins was demonstrated in different mammalian cell lines. Nevertheless, protein maturation seemed to be incomplete as no transport to the plasma membrane was observed. Instead, the proteins remained in the ER where they induced the expression of genes involved in unfolded protein response, such as HSPA5 and sXBP1. Furthermore, low expression levels of native envelope proteins were increased by codon optimization. Cell-free expression systems showed that both the transcriptional and translational level is affected. By generating different codon-optimized variants of HERV-K113 envelope, the influence of single rare t-RNA pools in certain cell lines was demonstrated. The mRNA secondary structure also appears to play an important role in the translation of the tested viral envelope proteins. In summary, the formation of certain HERV proteins is basically possible. However, their complete maturation and thus full biologic activity seems to depend on additional factors that might be disease-specific and await elucidation in the future.
Collapse
|
10
|
A comparison between whole transcript and 3' RNA sequencing methods using Kapa and Lexogen library preparation methods. BMC Genomics 2019; 20:9. [PMID: 30616562 PMCID: PMC6323698 DOI: 10.1186/s12864-018-5393-3] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 12/18/2018] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND 3' RNA sequencing provides an alternative to whole transcript analysis. However, we do not know a priori the relative advantage of each method. Thus, a comprehensive comparison between the whole transcript and the 3' method is needed to determine their relative merits. To this end, we used two commercially available library preparation kits, the KAPA Stranded mRNA-Seq kit (traditional method) and the Lexogen QuantSeq 3' mRNA-Seq kit (3' method), to prepare libraries from mouse liver RNA. We then sequenced and analyzed the libraries to determine the advantages and disadvantages of these two approaches. RESULTS We found that the traditional whole transcript method and the 3' RNA-Seq method had similar levels of reproducibility. As expected, the whole transcript method assigned more reads to longer transcripts, while the 3' method assigned roughly equal numbers of reads to transcripts regardless of their lengths. We found that the 3' RNA-Seq method detected more short transcripts than the whole transcript method. With regard to differential expression analysis, we found that the whole transcript method detected more differentially expressed genes, regardless of the level of sequencing depth. CONCLUSIONS The 3' RNA-Seq method was better able to detect short transcripts, while the whole transcript RNA-Seq was able to detect more differentially expressed genes. Thus, both approaches have relative advantages and should be selected based on the goals of the experiment.
Collapse
|
11
|
Stimulation of reverse transcriptase generated cDNAs with specific indels by template RNA structure: retrotransposon, dNTP balance, RT-reagent usage. Nucleic Acids Res 2017; 45:10143-10155. [PMID: 28973469 PMCID: PMC5737552 DOI: 10.1093/nar/gkx689] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Accepted: 07/24/2017] [Indexed: 01/03/2023] Open
Abstract
RNA dependent DNA-polymerases, reverse transcriptases, are key enzymes for retroviruses and retroelements. Their fidelity, including indel generation, is significant for their use as reagents including for deep sequencing. Here, we report that certain RNA template structures and G-rich sequences, ahead of diverse reverse transcriptases can be strong stimulators for slippage at slippage-prone template motif sequence 3′ of such ‘slippage-stimulatory’ structures. Where slippage is stimulated, the resulting products have one or more additional base(s) compared to the corresponding template motif. Such structures also inhibit slippage-mediated base omission which can be more frequent in the absence of a relevant stem–loop. Slippage directionality, base insertion and omission, is sensitive to the relative concentration ratio of dNTPs specified by the RNA template slippage-prone sequence and its 5′ adjacent base. The retrotransposon-derived enzyme TGIRT exhibits more slippage in vitro than the retroviral enzymes tested including that from HIV. Structure-mediated slippage may be exhibited by other polymerases and enrich gene expression. A cassette from Drosophila retrotransposon Dme1_chrX_2630566, a candidate for utilizing slippage for its GagPol synthesis, exhibits strong slippage in vitro. Given the widespread occurrence and importance of retrotransposons, systematic studies to reveal the extent of their functional utilization of RT slippage are merited.
Collapse
|
12
|
Transcriptomic comparison between two Vitis vinifera L. varieties (Trincadeira and Touriga Nacional) in abiotic stress conditions. BMC PLANT BIOLOGY 2016; 16:224. [PMID: 27733112 PMCID: PMC5062933 DOI: 10.1186/s12870-016-0911-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2016] [Accepted: 09/28/2016] [Indexed: 05/20/2023]
Abstract
BACKGROUND Predicted climate changes announce an increase of extreme environmental conditions including drought and excessive heat and light in classical viticultural regions. Thus, understanding how grapevine responds to these conditions and how different genotypes can adapt, is crucial for informed decisions on accurate viticultural actions. Global transcriptome analyses are useful for this purpose as the response to these abiotic stresses involves the interplay of complex and diverse cascades of physiological, cellular and molecular events. The main goal of the present work was to evaluate the response to diverse imposed abiotic stresses at the transcriptome level and to compare the response of two grapevine varieties with contrasting physiological trends, Trincadeira (TR) and Touriga Nacional (TN). RESULTS Leaf transcriptomic response upon heat, high light and drought treatments in growth room controlled conditions, as well as full irrigation and non-irrigation treatments in the field, was compared in TR and TN using GrapeGene GeneChips®. Breakdown of metabolism in response to all treatments was evidenced by the functional annotation of down-regulated genes. However, circa 30 % of the detected stress-responsive genes are still annotated as «Unknown» function. Selected differentially expressed genes from the GrapeGene GeneChip® were analysed by RT-qPCR in leaves of growth room plants under the combination of individual stresses and of field plants, in both varieties. The transcriptomic results correlated better with those obtained after each individual stress than with the results of plants from field conditions. CONCLUSIONS From the transcriptomic comparison between the two Portuguese grapevine varieties Trincadeira and Touriga Nacional under abiotic stress main conclusions can be drawn: 1. A different level of tolerance to stress is evidenced by a lower transcriptome reprogramming in TN than in TR. Interestingly, this lack of response in TN associates with its higher adaptation to extreme conditions including environmental conditions in a changing climate; 2. A complex interplay between stress transcriptional cascades is evidenced by antagonistic and, in lower frequency, synergistic effects on gene expression when several stresses are imposed together; 3. The grapevine responses to stress under controlled conditions are not fully extrapolated to the complex vineyard scenario and should be cautiously considered for agronomic management decision purposes.
Collapse
|
13
|
Identification of RNA aptamers against recombinant proteins with a hexa-histidine tag. Methods Mol Biol 2014; 1111:41-56. [PMID: 24549611 DOI: 10.1007/978-1-62703-755-6_4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Artificial riboswitches that respond to the concentrations of intracellular proteins are a promising tool with a variety of applications. They can be designed and engineered using existing RNA aptamers that target proteins. Aptamers are generated via an iterative selection-amplification process, known as systematic evolution of ligands by exponential enrichment (SELEX). This chapter describes a SELEX procedure for the identification of RNA aptamers against hexa-histidine-tagged proteins. For the efficient enrichment of higher affinity aptamers, the selection stringency should be gradually increased. Undesired RNA species that bind to affinity resins can be eliminated from the pool by using a negative selection step and alternating different types of resins.
Collapse
|
14
|
Local and global factors affecting RNA sequencing analysis. Anal Biochem 2011; 419:317-22. [PMID: 21889483 DOI: 10.1016/j.ab.2011.08.013] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2011] [Revised: 08/08/2011] [Accepted: 08/08/2011] [Indexed: 10/17/2022]
Abstract
High-throughput RNA sequencing (RNA-seq) continues to provide unparalleled insight into transcriptome complexity. Now the "gold standard" for assessing global transcript levels, RNA-seq is poised to revolutionize our understanding of transcription and posttranscriptional regulation of RNA. Despite significant advantages over prior experimental strategies, RNA-seq is not without pitfalls. We have identified a number of confounding factors that significantly affect sequencing coverage. These include regional GC content, preferential sites of fragmentation, and read "pile-up" due to primer affinity and transcript end effects. Independent of cell type and laboratory, when ignored, these factors can bias analyses. Understanding the underlying principles responsible for producing these artifacts is key to recognizing both their presence and how their effects may be controlled. Here we outline the causes of and strategies to avoid several previously unreported complicating factors common to RNA-seq experiments.
Collapse
|
15
|
Abstract
Intact ribosomal RNAs (rRNAs) comprise the majority of somatic transcripts, yet appear conspicuously absent in spermatozoa, perhaps reflecting cytoplasmic expulsion during spermatogenesis. To discern their fate, total RNA retained in mature spermatozoa from three fertile donors was characterized by Next Generation Sequencing. In all samples, >75% of total sequence reads aligned to rRNAs. The distribution of reads along the length of these transcripts exhibited a high degree of non-uniformity that was reiterated between donors. The coverage of sequencing reads was inversely correlated with guanine-cytosine (GC)-richness such that sequences greater than ∼70% GC were virtually absent in all sperm RNA samples. To confirm the loss of sequence, the relative abundance of specific regions of the 28S transcripts in sperm was established by 7-Deaza-2'-deoxy-guanosine-5'-triphosphate RT-PCR. The inability to amplify specific regions of the 28S sequence from sperm despite the abundant representation of this transcript in the sequencing libraries demonstrates that approximately three-quarters of RNA retained in the mature male gamete are products of rRNA fragmentation. Hence, cleavage (not expulsion of the RNA component of the translational machinery) is responsible for preventing spurious translation following spermiogenesis. These results highlight the potential importance of those transcripts, including many mRNAs, which evade fragmentation and remain intact when sperm are delivered at fertilization. Sequencing data are deposited in GEO as: GSE29160.
Collapse
|
16
|
The rocks and shallows of deep RNA sequencing: Examples in the Vibrio cholerae RNome. RNA (NEW YORK, N.Y.) 2011; 17:1357-1366. [PMID: 21610211 PMCID: PMC3138571 DOI: 10.1261/rna.2682311] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2011] [Accepted: 04/15/2011] [Indexed: 05/30/2023]
Abstract
New deep RNA sequencing methodologies in transcriptome analyses identified a wealth of novel nonprotein-coding RNAs (npcRNAs). Recently, deep sequencing was used to delineate the small npcRNA transcriptome of the human pathogen Vibrio cholerae and 627 novel npcRNA candidates were identified. Here, we report the detection of 223 npcRNA candidates in V. cholerae by different cDNA library construction and conventional sequencing methods. Remarkably, only 39 of the candidates were common to both surveys. We therefore examined possible biasing influences in the transcriptome analyses. Key steps, including tailing and adapter ligations for generating cDNA, contribute qualitatively and quantitatively to the discrepancies between data sets. In addition, the state of 5'-end phosphorylation influences the efficiency of adapter ligation and C-tailing at the 3'-end of the RNA. Finally, our data indicate that the inclusion of sample-specific molecular identifier sequences during ligation steps also leads to biases in cDNA representation. In summary, even deep sequencing is unlikely to identify all RNA species, and caution should be used for meta-analyses among alternatively generated data sets.
Collapse
|
17
|
An efficient and rapid method for cDNA cloning from difficult templates using codon optimization and SOE-PCR: with human RANK and TIMP2 gene as examples. Biotechnol Lett 2011; 33:1939-47. [PMID: 21660576 DOI: 10.1007/s10529-011-0656-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2011] [Accepted: 05/26/2011] [Indexed: 10/18/2022]
Abstract
As gene cloning from difficult templates with regionalized high GC content is a long recognized problem, we have developed a novel and reliable method to clone such genes. Firstly, the high GC content region of the target cDNA was synthesized directly after codon optimization and the remaining cDNA fragment without high GC content was generated by routine RT-PCR. Then the entire redesigned coding sequence of the target gene was obtained by fusing the above available two cDNA fragments with SOE-PCR (splicing by overlapping extension-PCR). We have cloned the human RANK gene (ten exons; CDS 1851 bp) using this strategy. The redesigned cDNA was transfected into an eukaryotic expression system (A459 cells) to verify its expression. RT-PCR and western blotting confirmed this. To validate our method, we also successfully cloned human TIMP2 gene (five exons; CDS 660 bp) also having a regionalized high GC content. Our strategy for combining codon optimization and SOE-PCR to clone difficult genes is thus feasible and potentially universally applicable.
Collapse
|
18
|
Whole transcriptome analysis: what are we still missing? WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2010; 3:527-43. [PMID: 21197667 DOI: 10.1002/wsbm.135] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
New technologies such as tag-based sequencing and tiling arrays have provided unique insights into the transcriptional output of cells. Many new RNA classes have been uncovered in the past decade, despite limitations in current technologies. Even as the repertoire of known functional elements of the transcriptome increases and contemporary technologies become mainstream, inadequacies in conventional protocols for library preparation, sequencing and mapping continue to hamper revelation of the entire transcriptome of cells. In this article, we review current protocols and outline their deficiencies. We also provide our view on what we may be overlooking in the transcriptome, despite exhaustive investigations, and indicate future areas of technological development and research.
Collapse
|
19
|
Enhanced protocol for determining the 3' terminus of hepatitis C virus. J Virol Methods 2010; 167:158-64. [PMID: 20381536 DOI: 10.1016/j.jviromet.2010.03.030] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2010] [Revised: 03/27/2010] [Accepted: 03/30/2010] [Indexed: 11/22/2022]
Abstract
The determination of viral 3' ends is a routine practice in molecular biology. However, this has been a challenging task for hepatitis C virus (HCV), an enveloped single-stranded, positive-sense RNA virus classified into the Flaviviridae family. The extreme end of HCV 3' untranslated region (3'UTR), the so-called 3' X tail, was not identified at the time of HCV discovery. Complete HCV 3'UTR sequences occupy a very small percentage of the exponentially growing HCV sequence databases. Although commercial kits and experimental protocols are available, these methods are both tedious and not reproducible. A stepwise optimization procedure was developed as a simple and robust protocol for determining the complete HCV 3'UTR from clinical samples. The availability of abundant authentic sequence information for the complete HCV 3'UTR will allow full investigation of its biological role in the life cycle of HCV.
Collapse
|
20
|
Differential Gene Expression and Subcellular Targeting of Arabidopsis Glutathione S-Transferase F8 Is Achieved through Alternative Transcription Start Sites. J Biol Chem 2007; 282:28915-28928. [PMID: 17670748 DOI: 10.1074/jbc.m702207200] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Glutathione S-transferases (GSTs) play major roles in the protection of plants from biotic and abiotic stresses through the detoxification of xenobiotics and toxic endogenous products. This report describes additional complexity in the regulation of the well characterized stress-responsive Arabidopsis thaliana GSTF8 promoter. This complexity results from the use of multiple transcription start sites (TSS) to give rise to alternate GSTF8 transcripts with the potential to produce two in-frame proteins differing only in their N-terminal sequence. In addition to the originally mapped TSS (Chen, W., Chao, G., and Singh, K. B. (1996) Plant J. 10, 955-966), a further nine TSS have been identified, with the majority clustered into a distinct group. The most 3' TSS gives rise to the major message (GSTF8-S) and the shorter form of the protein, whereas those originating from upstream TSS (GSTF8-L) are more weakly expressed and encode for the larger form of the protein. Differential tissue-specific and stress-responsive expression patterns were observed (e.g. GSTF8-L is more highly expressed in leaves compared with roots, whereas GSTF8-S expression has the opposite pattern and is much more stress-responsive). Analysis of GSTF8-L and GSTF8-S proteins demonstrated that GSTF8-L is solely targeted to plastids, whereas GSTF8-S is cytoplasmic. In silico analysis revealed potential conservation of GSTF8-S across a wide range of plants; in contrast, conservation of GSTF8-L was confined to the Brassicaceae. These studies demonstrate that alternate TSS of the GSTF8 promoter are used to confer differential tissue-specific and stress-responsive expression patterns as well as to target the same protein to two different subcellular localizations.
Collapse
|
21
|
Abstract
KSHV has been established as the causative agent of KS, PEL, and MCD, malignancies occurring more frequently in AIDS patients. The aggressive nature of KSHV in the context of HIV infection suggests that interactions between the two viruses enhance pathogenesis. KSHV latent infection and lytic reactivation are characterized by distinct gene expression profiles, and both latency and lytic reactivation seem to be required for malignant progression. As a sophisticated oncogenic virus, KSHV has evolved to possess a formidable repertoire of potent mechanisms that enable it to target and manipulate host cell pathways, leading to increased cell proliferation, increased cell survival, dysregulated angiogenesis, evasion of immunity, and malignant progression in the immunocompromised host. Worldwide, approximately 40.3 million people are currently living with HIV infection. Of these, a significant number are coinfected with KSHV. The complex interplay between the two viruses dramatically elevates the risk for development of KSHV-induced malignancies, KS, PEL, and MCD. Although HAART significantly reduces HIV viral load, the entire T-cell repertoire and immune function may not be completely restored. In fact, clinically significant immune deficiency is not necessary for the induction of KSHV-related malignancy. Because of variables such as lack of access to therapy noncompliance with prescribed treatment, failure to respond to treatment and the development of drug-resistant strains of HIV, KSHV-induced malignancies will continue to present as major health concerns.
Collapse
|
22
|
A PCR-based method for detection and quantification of small RNAs. Biochem Biophys Res Commun 2006; 351:756-63. [PMID: 17084816 PMCID: PMC1934510 DOI: 10.1016/j.bbrc.2006.10.105] [Citation(s) in RCA: 124] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2006] [Accepted: 10/23/2006] [Indexed: 10/24/2022]
Abstract
Recent cloning efforts have identified hundreds of thousands of small RNAs including micro RNAs (miRNAs), Piwi-interacting RNAs (piRNAs), and small nucleolar RNAs (snoRNAs). These non-coding small RNAs need to be further validated and characterized by detecting and quantifying their expression in different tissues and during different developmental courses. A simple, accurate, and sensitive method for small RNA expression profiling is in high demand. Here, we report such a PCR-based method.
Collapse
|
23
|
Reverse transcription using random pentadecamer primers increases yield and quality of resulting cDNA. Biotechniques 2006; 40:649-57. [PMID: 16708763 DOI: 10.2144/000112153] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Reverse transcription of RNA is an invaluable method for gene expression analysis by real-time PCR or microarray methods. Random primers of varying lengths were compared with respect to their efficiency of priming reverse transcription reactions. The results showed that 15-nucleotide-long random oligonucleotides (pentadecamers) consistently yielded at least 2-fold as much cDNA as did random hexamers using either poly(A) RNA or an amplified version of messenger RNA (aRNA) as a template. The cDNA generated using pentadecamers did not differ in size distribution or the amount of incorporated label compared with cDNA generated with random hexamers. The increased efficiency of priming using random pentadecamers resulted in reverse transcription of >80% of the template aRNA, while random hexamers induced reverse transcription of only 40% of the template aRNA. This suggests a better coverage of the transcriptome when using random pentadecamers over random hexamers. Using the same amount of aRNA as starting material, random pentadecamer-primed reactions resulted in 11-fold more genes being detected in whole transcriptome DNA microarray experiments than random hexamer-primed reactions. The results indicate that random pentadecamers can replace random hexamers in reverse transcription reactions on both poly(A) RNA and amplified RNA, resulting in higher cDNA yields and quality.
Collapse
|
24
|
Template switching within exons 3 and 4 of KV11.1 (HERG) gives rise to a 5' truncated cDNA. Biochem Biophys Res Commun 2006; 345:1342-9. [PMID: 16723117 DOI: 10.1016/j.bbrc.2006.05.032] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2006] [Accepted: 05/02/2006] [Indexed: 10/24/2022]
Abstract
K(V)11.1 (HERG) channels contribute to membrane potential in a number of excitable cell types. We cloned a variant of K(V)11.1 from human jejunum containing a 171 bp deletion spanning exons 3 and 4. Expression of a full-length cDNA clone containing this deletion gave rise to protein that trafficked to the cell membrane and generated robust currents. The deletion occurred in a G/C-rich region and identical sequence elements of UGGUGG were located at the deletion boundaries. In recent studies these features have been implicated to cause deletions via template switching during cDNA synthesis. To examine this possibility we compared cDNAs from human brain, heart, and jejunum synthesized at lower (42 degrees C) and higher temperatures (70 degrees C). The 171 bp deletion was absent at the higher temperature. Our results suggest that the sequence and secondary structure of mRNA in the G/C rich region leads to template switching producing a cDNA product with a 171 bp deletion.
Collapse
|
25
|
Inverted Gcg/CGC trinucleotide microsatellites in the 5'-region of Mus IDS mRNA: recurrent induction of aberrant reverse transcripts. Mol Biol Rep 2004; 31:107-12. [PMID: 15293786 DOI: 10.1023/b:mole.0000031386.23515.c4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
An investigation was initiated to explore previously published results indicating that approximately 80 bp of the 5'-end of the iduronate sulfatase (IDS) cDNA sequence (Accession No L07291) are 100% homologous with the 3'-UTR of isoform I of the sodium hydrogen exchanger (Acc. No. U51112). 5'-RACE carried out on IDS mRNA demonstrated the apparent homology to be a cloning artifact. A sequence comparison of the IDS 5'-RACE product with a mouse BAC clone covering the region, and with various IDS ESTs, suggested that the region is highly susceptible to cloning artifacts, a common one of which is template switching by reverse transcriptase. The nucleotide sequence flanking the translation start site is unusual in containing two inverted repeats composed of the complementary trinucleotide microsatellites, (GCG)9 and (CGC)6. These likely form a highly stable stem of 20-21 nt, through which reverse transcription is compromised. Such a stem could be involved in the regulation of IDS expression by directly affecting translation, message turnover, or serving as a substrate for siRNA production. Though such mRNA features are relatively rare, they may be more abundant but overlooked due to difficulties in their reverse transcription.
Collapse
|
26
|
Identification of a novel cellular transcriptional repressor interacting with the latent nuclear antigen of Kaposi's sarcoma-associated herpesvirus. J Virol 2003; 77:9758-68. [PMID: 12941884 PMCID: PMC224565 DOI: 10.1128/jvi.77.18.9758-9768.2003] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The latent nuclear antigen (LNA) of Kaposi's sarcoma-associated herpesvirus (KSHV) has an essential role in viral latent infection. LNA maintains the stability of KSHV episomes and modulates the expression of cellular genes. A novel cellular protein KLIP1 was identified to interact with LNA through yeast two-hybrid screening, and confirmed by a glutathione S-transferase pull down assay. Domain mapping showed that KLIP1 interacted with the N-terminal domain of LNA. Northern blot hybridization with a KLIP1 probe identified a major transcript of 1.8 kb and a minor transcript of 2.8 kb. cDNA library screening and 5'-RACE revealed that the major transcript encoded an open-reading-frame of 1,257 bp and had a 5'-untranslated region of 73 nucleotides. The major KLIP1 transcript was ubiquitously present in different cell types examined. A KLIP1 synthetic peptide antibody detected a doublet of 58-kDa and 63-kDa proteins in a Western blot assay. KLIP1 had two putative nuclear localization signals and showed punctate nuclear localization when expressed as a GFP-fusion protein. KLIP1 interacted with LNA in vivo, as demonstrated by coimmunoprecipitation using KSHV-infected cells and colocalization when they were expressed as GFP- and DsRed-fusion proteins, respectively. Consistent with its interaction with LNA, nuclear localization, and possession of two leucine zipper motifs, KLIP1 behaved like a transcriptional factor and repressed herpes simplex virus thymidine kinase (TK) promoter activity in a mammalian one-hybrid assay. In addition, cotransfection with LNA alleviated the transcriptional repression effect of KLIP1 on TK promoter activity. These results suggest that KLIP1 is a new member of cellular transcriptional repressors, and that LNA is involved in deregulating cellular transcription process.
Collapse
|
27
|
Abstract
Transforming growth factor beta 4 (TGF-beta 4) is unique to avian species, though its roles in vivo have not yet been well established. In this paper we describe the expression and partial characterization of recombinant chicken TGF-beta 4. By using a GC-rich PCR system in a modified 5'RACE methodology we generated the 5'-end of cDNA sequence encoding the TGF-beta 4 precursor, which was in-frame cloned into pcDNA3.1/V5-His-TOPO and transfected into the Chinese hamster ovary cell line (CHO-K1). A cell line stably expressing TGF-beta 4 precursor protein was established from CHO-K1 cells. Acid-activated mature TGF-beta 4 inhibited the growth of mink lung epithelial (Mv1Lu) cell line. TGF-beta 4 also stimulated the expression of type I procollagen and enhanced heat shock protein 47 (Hsp47) expression in chicken tendon fibroblasts. Hsp47 expression by TGF beta 4 is likely regulated through activation of heat shock transcription factor 1 (HSF1). Because the presence of TGF-beta 1 has not been documented in avian cells and our data show that TGF-beta 4 elicits biological activities in chicken tendon cells, which closely parallel that of TGF-beta 1, we propose that TGF-beta 4 plays roles in avian species similar to what TGF-beta 1 plays in mammalian species.
Collapse
|
28
|
Analysis of the regulation of the A33 antigen gene reveals intestine-specific mechanisms of gene expression. J Biol Chem 2002; 277:34531-9. [PMID: 12114523 DOI: 10.1074/jbc.m204865200] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The A33 antigen is a transmembrane protein expressed almost exclusively by intestinal epithelial cells. The level of its expression is robust and uniform throughout the rostrocaudal axis of the human and mouse intestines. In the colon, strong expression is found in the basolateral membranes of both the proliferating cells in the lower regions of the crypts and the differentiating cells in the upper regions of crypts. Similarly, in the small intestine, the protein is highly expressed by all the epithelial cells in the crypts and by the differentiated cells migrating over the villi. Thus, the A33 antigen has emerged as a definitive marker for all intestinal epithelial cells, irrespective of cell lineage and differentiation status. To understand the molecular mechanisms mediating this rare tissue-specific expression pattern, we undertook a comprehensive analysis of the 5'-regulatory region of the human A33 antigen gene. This allowed us to point to positive cis-regulatory elements incorporating consensus Krüppel-like factor and caudal-related homeobox (CDX)-binding sites, located just upstream from the human A33 antigen transcription start site, as being important for the intestine-specific expression pattern of this gene. Further analysis provided evidence that the A33 antigen gene may be one of only a few target genes to be described thus far for the intestine-specific homeobox transcription factor, CDX1. Taken together, our data lead us to propose that the activity of CDX1 is pivotal in mediating the exquisite, intestine-specific expression pattern of the A33 antigen gene.
Collapse
|