1
|
Abebe JS, Alwie Y, Fuhrmann E, Leins J, Mai J, Verstraten R, Schreiner S, Wilson AC, Depledge DP. Nanopore Guided Annotation of Transcriptome Architectures. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.02.587744. [PMID: 38617228 PMCID: PMC11014626 DOI: 10.1101/2024.04.02.587744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
High-resolution annotations of transcriptomes from all domains of life are essential for many sequencing-based RNA analyses, including Nanopore direct RNA sequencing (DRS), which would otherwise be hindered by misalignments and other analysis artefacts. DRS allows the capture and full-length sequencing of native RNAs, without recoding or amplification bias, and resulting data may be interrogated to define the identity and location of chemically modified ribonucleotides, as well as the length of poly(A) tails on individual RNA molecules. Existing software solutions for generating high-resolution transcriptome annotations are poorly suited to small gene dense organisms such as viruses due to the challenge of identifying distinct transcript isoforms where alternative splicing and overlapping RNAs are prevalent. To resolve this, we identified key characteristics of DRS datasets and developed a novel approach to transcriptome. We demonstrate, using a combination of synthetic and original datasets, that our novel approach yields a high level of precision and recall when reconstructing both gene sparse and gene dense transcriptomes from DRS datasets. We further apply this approach to generate a new high resolution transcriptome annotation of the neglected pathogen human adenovirus type F 41 for which we identify 77 distinct transcripts encoding at least 23 different proteins.
Collapse
Affiliation(s)
- Jonathan S. Abebe
- Department of Microbiology, New York University School of Medicine, New York, NY, USA
| | - Yasmine Alwie
- Institute of Virology, Hannover Medical School, Hannover, Germany
| | - Erik Fuhrmann
- Institute of Virology, Hannover Medical School, Hannover, Germany
| | - Jonas Leins
- Institute of Virology, Hannover Medical School, Hannover, Germany
| | - Julia Mai
- Institute of Virology, Hannover Medical School, Hannover, Germany
- Institute of Virology, University Medical Center, Albert-Ludwigs-University Freiburg, Freiburg, Germany
| | - Ruth Verstraten
- Institute of Virology, Hannover Medical School, Hannover, Germany
- German Center for Infection Research (DZIF), partner site Hannover-Braunschweig, Hannover, Germany
| | - Sabrina Schreiner
- Institute of Virology, Hannover Medical School, Hannover, Germany
- Institute of Virology, University Medical Center, Albert-Ludwigs-University Freiburg, Freiburg, Germany
- Cluster of Excellence RESIST (EXC 2155), Hannover Medical School, Hannover, Germany
| | - Angus C. Wilson
- Department of Microbiology, New York University School of Medicine, New York, NY, USA
| | - Daniel P. Depledge
- Department of Microbiology, New York University School of Medicine, New York, NY, USA
- Institute of Virology, Hannover Medical School, Hannover, Germany
- German Center for Infection Research (DZIF), partner site Hannover-Braunschweig, Hannover, Germany
- Cluster of Excellence RESIST (EXC 2155), Hannover Medical School, Hannover, Germany
| |
Collapse
|
2
|
Abstract
Pathogen entry into cells occurs by direct penetration of the plasma membrane, clathrin-mediated endocytosis, caveolar endocytosis, pinocytosis or macropinocytosis. For a particular agent, the infectious pathways are typically restricted, reflecting a tight relationship with the host. Here, we survey the uptake process of human adenovirus (Ad) type 2 and 5 and integrate it into the cell biology of endocytosis. Ad2 and Ad5 naturally infect respiratory epithelial cells. They bind to a primary receptor, the coxsackie virus B Ad receptor (CAR). The CAR-docked particles activate integrin coreceptors and this triggers a variety of cell responses, including endocytosis. Ad2/Ad5 endocytosis is clathrin-mediated and involves the large GTPase dynamin and the adaptor protein 2. A second endocytic process is induced simultaneously with viral uptake, macropinocytosis. Together, these pathways are associated with viral infection. Macropinocytosis requires integrins, F-actin, protein kinase C and small G-proteins of the Rho family, but not dynamin. Macropinocytosis per se is not required for viral uptake into epithelial cells, but it appears to be a productive entry pathway of Ad artificially targeted to the high-affinity Fcgamma receptor CD64 of hematopoietic cells lacking CAR. In epithelial and hematopoietic cells, the macropinosomal contents are released to the cytosol. This requires viral signalling from the surface and coincides with particle escape from endosomes and infection. It emerges that incoming Ad2 and Ad5 distinctly modulate the endocytic trafficking and disrupt selective cellular compartments. These features can be exploited for effective artificial targeting of Ad vectors to cell types of interest.
Collapse
Affiliation(s)
- Oliver Meier
- Zoologisches Institut der Universität Zürich, Winterthurerstrasse 190, 8057 Zürich, Switzerland
| | | |
Collapse
|
3
|
Affiliation(s)
- V Mautner
- Medical Research Council Virology Unit, Institute of Virology, University of Glasgow, Scotland, UK
| | | | | |
Collapse
|
4
|
Wang HG, Yaciuk P, Ricciardi RP, Green M, Yokoyama K, Moran E. The E1A products of oncogenic adenovirus serotype 12 include amino-terminally modified forms able to bind the retinoblastoma protein but not p300. J Virol 1993; 67:4804-13. [PMID: 8331729 PMCID: PMC237867 DOI: 10.1128/jvi.67.8.4804-4813.1993] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The cell growth-regulating properties of the adenovirus type 5 (Ad5) E1A oncogene correlate closely with the binding of the E1A products to specific cellular proteins. These proteins include the products of the retinoblastoma tumor susceptibility gene and a 300-kDa product, p300. pRB binds to E1A sequences that are highly conserved among the E1A products of various serotypes, while p300 binding requires sequences in the E1A amino terminus, a region that is not highly conserved. To help evaluate the roles of the E1A-associated proteins in cell growth control, we have compared the p300-binding abilities of the E1A products of Ad5 and of the more oncogenic Ad12 serotype. We show here that despite encoding a sequence that varies somewhat from the p300-binding sequences of Ad5 E1A, the Ad12 E1A products associate with p300 with an affinity similar to that of the Ad5 E1A products. Both the 12S and 13S splice products of Ad12 E1A, like those of Ad5 E1A, encode proteins able to associate with p300. Interestingly, though, both also give rise to prominent forms that are amino terminally modified and unable to associate with p300. This modification, at least in the 13S product, does not appear to diminish the affinity of this product for the retinoblastoma protein.
Collapse
Affiliation(s)
- H G Wang
- Cold Spring Harbor Laboratory, New York 11724-2206
| | | | | | | | | | | |
Collapse
|
5
|
Wang HG, Rikitake Y, Carter MC, Yaciuk P, Abraham SE, Zerler B, Moran E. Identification of specific adenovirus E1A N-terminal residues critical to the binding of cellular proteins and to the control of cell growth. J Virol 1993; 67:476-88. [PMID: 8416379 PMCID: PMC237385 DOI: 10.1128/jvi.67.1.476-488.1993] [Citation(s) in RCA: 230] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
Adenovirus early region 1A (E1A) oncogene-encoded sequences essential for transformation- and cell growth-regulating activities are localized at the N terminus and in regions of highly conserved amino acid sequence designated conserved regions 1 and 2. These regions interact to form the binding sites for two classes of cellular proteins: those, such as the retinoblastoma gene product, whose association with the E1A products is specifically dependent on region 2, and another class which so far is known to include only a large cellular DNA-binding protein, p300, whose association with the E1A products is specifically dependent on the N-terminal region. Association between the E1A products and either class of cellular proteins can be disrupted by mutations in conserved region 1. While region 2 has been studied intensively, very little is known so far concerning the nature of the essential residues in the N-terminal region, or about the manner in which conserved region 1 participates in the binding of two distinct sets of cellular proteins. A combination of site-directed point mutagenesis and monoclonal antibody competition experiments reported here suggests that p300 binding is dependent on specific, conserved residues in the N terminus, including positively charged residues at positions 2 and 3 of the E1A proteins, and that p300 and pRB bind to distinct, nonoverlapping subregions within conserved region 1. The availability of precise point mutations disrupting p300 binding supports previous data linking p300 with cell cycle control and enhancer function.
Collapse
Affiliation(s)
- H G Wang
- Cold Spring Harbor Laboratory, New York 11724
| | | | | | | | | | | | | |
Collapse
|
6
|
Abstract
Enteric adenovirus type 41 (Ad41) is defective for growth in conventional established cell lines. Ad41 is dependent on the Ad5 early regions E1A/E1B since it cannot grow in HEK cells but only in 293 HEK cells transformed by Ad5 E1 region. However, Hep-2 cells have also been shown to support the growth of Ad41 to some extent. The nucleotide sequence of the E1B region of the Ad41 strain D389 has been determined. When compared to the corresponding region of the Ad41 prototype strain (Tak) the degree of homology in the DNA sequences was close to 100%. The mRNAs from the E1B region of the Ad41 strain D389 have been studied by Northern blot, primer extension, and polymerase chain reaction-cDNA analysis. E1B transcripts corresponding to Ad2 14 S, 22 S, and 9 S mRNAs were identified but no 13 S mRNA equivalent was detected, a pattern similar to that seen in the Ad40 and Ad12 transcription maps. However, the Ad41 E1B 14S mRNA equivalent has one additional small exon of 23 nucleotides, created by a donor and an acceptor splice site located at positions not seen in other E1B transcripts of human adenoviruses analyzed so far. The coding potential for E1B 19K, 55K, and 15K proteins and for pIX is retained in the Ad41 transcripts. In contrast to other adenoviruses, except for the closely related Ad40, the ORF of pIX starts in the intron of the 22 S mRNA.
Collapse
Affiliation(s)
- A Allard
- Department of Virology, University of Umeå, Sweden
| | | |
Collapse
|
7
|
Steinthorsdottir V, Mautner V. Enteric adenovirus type 40:E1B transcription map and identification of novel E1A-E1B cotranscripts in lytically infected cells. Virology 1991; 181:139-49. [PMID: 1825250 DOI: 10.1016/0042-6822(91)90478-t] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Adenovirus 40 (Ad40) is defective for growth in tissue culture but is complemented when the Ad2/5 or Ad12 E1B 55K protein is supplied in trans. Ad40 E1B mRNA has not been detected in E1-transformed cells, or at early times in lytically infected cells. In cells constitutively expressing the E1B region of Ad2, Ad40 E1B mRNAs are detected at late times in infection, after the onset of DNA replication. We have determined the Ad40 E1B transcription map from RNA produced at late times in infected KB16 cells, using S1 nuclease, primer extension, PCR-cDNA analysis, and Northern blotting. E1B transcripts corresponding to Ad2 14 S, 22 S, and 9 S mRNAs were identified but no 13 S mRNA equivalent was detected, a pattern similar to that seen in the Ad12 transcription map. The coding potential for E1B 19K, 55K, and 15K proteins and for ppIX is retained in the Ad40 transcripts. In addition we find novel E1A-E1B cotranscript counterparts of the 14 S and 22 S mRNAs. These contain the first 40 codons of the E1A first exon linked to a site 4-5 nt downstream of the E1B cap site, retaining all the coding potential of the E1B mRNAs. No new open reading frames are created by the junction, and the E1A ORF terminates with one codon added after the junction. Each E1A-E1B cotranscript is present in abundance comparable to that of its authentic E1B counterpart. The E1A-E1B junction is unusual in that it does not conform to splice consensus sequences and thus may not be generated by a conventional splicing mechanism.
Collapse
MESH Headings
- Adenovirus Early Proteins
- Adenoviruses, Human/genetics
- Base Sequence
- Blotting, Northern
- Cell Transformation, Viral
- Cloning, Molecular
- Genetic Complementation Test
- Humans
- KB Cells
- Molecular Sequence Data
- Oligonucleotide Probes
- Oncogene Proteins, Viral/genetics
- Plasmids
- Polymerase Chain Reaction
- RNA, Messenger/genetics
- RNA, Messenger/isolation & purification
- RNA, Viral/genetics
- RNA, Viral/isolation & purification
- Restriction Mapping
- Transcription, Genetic
Collapse
|
8
|
Allard A, Girones R, Juto P, Wadell G. Polymerase chain reaction for detection of adenoviruses in stool samples. J Clin Microbiol 1990; 28:2659-67. [PMID: 2279998 PMCID: PMC268252 DOI: 10.1128/jcm.28.12.2659-2667.1990] [Citation(s) in RCA: 218] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The usefulness of the polymerase chain reaction (PCR) method for diagnosing adenovirus infections was investigated. Several primers, including primers specific for the hexon-coding region and for enteric adenovirus types 40 and 41, were evaluated. The PCR method was validated against cell culturing in routine diagnostic work and against restriction enzyme analysis of viral DNA. Sixty diagnostic specimens were selected for evaluation by the PCR method. Twenty of the 60 specimens were found positive on the basis of cytopathic effects and latex agglutination (Adenolex [Orion Diagnostica, Helsinki, Finland]), and 16 were identified and typed as adenoviruses by polyacrylamide gel electrophoresis. PCR was performed on all 60 specimens in parallel directly on diluted stool samples and on viral DNA extracted from cells inoculated with the same stool samples. When the general hexon primers were used 51 of the 60 specimens from infected cell cultures were found positive by PCR, whereas only 13 specimens were found positive when PCR was performed directly on stool samples. With the use of selective primers for enteric adenoviruses 16 of the 60 cell cultures were found to exhibit amplification products by PCR, whereas 4 were detected in stool samples. None of the 60 specimens were found positive by PCR when an adenovirus type 40-specific primer pair was used. PCR was found to be a fast, sensitive, and reliable method for the detection of adenoviruses in diarrheal disease, provided the amplifications were performed directly on diluted stool samples.
Collapse
Affiliation(s)
- A Allard
- Department of Virology, University of Umeå, Sweden
| | | | | | | |
Collapse
|
9
|
Abstract
The fiber gene of adenovirus type 41 was sequenced and compared to the fiber gene sequence of adenovirus type 40 (A. H. Kidd and M. J. Erasmus, 1989, Virology 172, 134-144), the other known member of subgroup F. The open reading frame, from map units 87 through 92 with transcription from the r-strand, comprised 1686 bases and was 45 bases longer than its counterpart on the Ad40 genome. The 45-base difference appears to have resulted from a block deletion on the Ad40 sequence. Apart from this one region, the Ad40 and Ad41 fiber genes showed remarkably high homology (95.6%), indicating a relatively recent evolutionary divergence. The deduced amino acid sequence of the Ad41 fiber polypeptide was analyzed according to the model of N. M. Green et al. (1983, EMBO J. 2, 1357-1365) for the structure of the adenovirus fiber. Ad41 had one more 15-residue repeat in the shaft region than Ad40, there being 22 repeat motifs. A detailed study of various Ad40 and Ad41 strains with proven genome differences indicated that the 15-amino acid difference in polypeptide length at the 14th repeat motif is a type-specific difference among the subgroup F adenoviruses. However, two uncommon Ad41 strains belonging to 2 of the 16 Ad41 genome types tested had a 15-amino acid block deletion which was different to that of the Ad40 polypeptide. The implication from this work is that the Ad40 fiber gene probably arose from its Ad41 counterpart, but the fiber gene sequences of both types of subgroup F adenovirus are so similar that genetic recombination between strains could occur with some frequency.
Collapse
Affiliation(s)
- A H Kidd
- Department of Virology, Auckland Hospital, New Zealand
| | | | | |
Collapse
|
10
|
Spibey N, McClory RS, Cavanagh HM. Identification and nucleotide sequence of the early region 1 from canine adenovirus types 1 and 2. Virus Res 1989; 14:241-55. [PMID: 2623943 DOI: 10.1016/0168-1702(89)90005-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
The genome of canine adenovirus type 1 (CAV-1) has been cloned and restriction maps compiled. These maps are compared with those of canine adenovirus type 2 (CAV-2). The left ends of both genomes were further characterised by DNA sequence analysis. Several features of the DNA sequence and predicted polypeptide sequence are similar to those of the human adenoviruses. The level of homology observed across the E1 regions appears to be of the same order as the overall DNA similarity between CAV-1 and CAV-2 (75%). Transfection experiments using the presumptive E1a containing region of CAV-2 suggests that it encodes a transactivating function typical of the human adenovirus E1a genes.
Collapse
Affiliation(s)
- N Spibey
- Department of Veterinary Pathology, University of Glasgow Veterinary School, Scotland, U.K
| | | | | |
Collapse
|
11
|
Abstract
The fiber gene of human adenovirus type 40 has been characterized. The 6.1-kbp EcoRI fragment C of the Ad40 genome, from map units 74 through 92, was cloned and the right-most 2.8 kbp from 84 map units was sequenced. By analogy with Ad2, this region would be expected to contain the gene specifying the Ad40 fiber polypeptide. Sequencing revealed an open reading frame of 1641 bases on the r-strand, the first 53 bases of which had marked homology with the corresponding L5 (fiber) regions of Ad2 (77.2%), Ad5 (75.0%), and Ad3 (64.3%). In addition, base positions 1114 to 1146 of this open reading frame had 85% homology with base positions 1198 to 1230 of the Ad2 fiber gene. The predicted polypeptide sequence of 547 amino acids showed marked homology with the Ad2, Ad5, and Ad3 fiber polypeptides in two regions, in the first 55 amino acids from the N-terminus and from amino acids 372 through 382. Analysis of hydrophobic amino acid positions revealed a repeating pattern of approximately 15 residues between positions 42 and 374, with 21 repeats. The sequence of the Ad40 polypeptide thus fits the model of Green et al. [1983), EMBO J. 2, 1357-1365) for the structure of the adenovirus fiber, but is 35 amino acids shorter than the Ad2 fiber polypeptide, with one less 15-residue repeat in the shaft region. According to this model, the regions of highest homology between the Ad40 fiber polypeptide and those of Ad2, Ad5, and Ad3 correspond to the tail of the shaft and the base of the knob. The results of this analysis are in agreement with previously published EM data on the fiber length of subgroup F adenoviruses.
Collapse
Affiliation(s)
- A H Kidd
- Department of Virology, University of the Witwatersrand, South Africa
| | | |
Collapse
|
12
|
Ball AO, Beard CW, Redick SD, Spindler KR. Genome organization of mouse adenovirus type 1 early region 1: a novel transcription map. Virology 1989; 170:523-36. [PMID: 2543128 DOI: 10.1016/0042-6822(89)90444-3] [Citation(s) in RCA: 27] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Mouse adenovirus type 1 (MAV-1) genomic DNA from 8.9 to 13.7 map units was sequenced and the early region 1 (E1) transcription map was determined by S1 nuclease, primer extension, and Northern analyses, and cDNA sequencing. The E1 transcription map of MAV-1 had marked dissimilarities from the conserved transcription maps of primate adenovirus E1s. One major E1A and two E1B mRNAs were identified in overlapping transcription units. The single E1A mRNA was composed of three exons; the last exon was coincident with the last exon of the E1B mRNAs. While human adenovirus type 2 (Ad2) utilizes alternate splice donors for the first E1A mRNA exon, MAV-1 does not. Thus, no protein is predicted that would correspond to the Ad2 243 amino acid protein, although MAV-1 can encode a protein similar to the Ad2 289 amino acid protein (A. O. Ball, M. E. Williams, and K. R. Spindler, 1988, J. Virol. 62, 3947-3957). Two spliced E1B mRNAs differed from each other in an intron near the 5' end of the smaller E1B mRNA. This smaller mRNA could encode only the 55K E1B protein, while the larger mRNA could encode both the 21K and 55K E1B proteins.
Collapse
Affiliation(s)
- A O Ball
- Department of Genetics, University of Georgia, Athens 30602
| | | | | | | |
Collapse
|