1
|
Nangia-Makker P, Sarvis R, Visscher DW, Bailey-Penrod J, Raz A, Sarkar FH. Galectin-3 and L1 retrotransposons in human breast carcinomas. Breast Cancer Res Treat 1998; 49:171-83. [PMID: 9696400 DOI: 10.1023/a:1005913810250] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Galectin-3 is a galactoside binding protein found at elevated levels in a wide variety of neoplastic cells and thought to be involved in cognitive cellular interactions during transformation and metastasis. Previously, we have shown that introduction of human galectin-3 (Mr 31,000) cDNA into the human breast cancer cells BT-549 which are galectin-3 null and non-tumorigenic in nude mice resulted in the establishment of four galectin-3 expressing clones. Three of them acquired tumorigenicity when inoculated in the mammary fat pad of nude mice. Here, we questioned what is the molecular difference between the nude mouse tumorigenic and non-tumorigenic galectin-3 expressing BT-549 cell clones. Differential display analysis and Northern blotting revealed that, unlike the tumorigenic clones, neither the parental cells nor the non-tumorigenic clone expressed a 6.5 Kb transcript. A 607 bp PCR (polymerase chain reaction) product from the differentially displayed mRNA revealed a 93% sequence homology with the human L1 retrotransposon previously suggested to play a role in the pathobiology of some breast cancers. In addition, we show that the two gene products, i.e., galectin-3 and L1, are co-expressed in breast carcinoma specimens and in other nude mouse tumorigenic cell lines.
Collapse
Affiliation(s)
- P Nangia-Makker
- Tumor Progression and Metastasis Program, Karmanos Cancer Institute, Detroit, MI 48201, USA
| | | | | | | | | | | |
Collapse
|
2
|
Ogura T, Okano K, Tsuchida K, Miyajima N, Tanaka H, Takada N, Izumi S, Tomino S, Maekawa H. A defective non-LTR retrotransposon is dispersed throughout the genome of the silkworm, Bombyx mori. Chromosoma 1994; 103:311-23. [PMID: 7821086 DOI: 10.1007/bf00417878] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The presence of long repetitive sequences is demonstrated in the genome of the silkworm, Bombyx mori. Members of this BMC1 family reveal several features typical of the L1 (long interspersed sequence one) family of mammals, except for species specific elements. The number of BMC1 elements is estimated to be approximately 3500 per haploid genome. Elements containing the full length unit of 5.1 kb are dispersed throughout the genome and their restriction sites are conserved, although most members are preferentially truncated to varying extents at their 5' ends. DNA sequencing indicates that this element contains six tandem repeats of 15 bp CpG-rich sequence in the 5' proximal region. It terminates with a 3' oligo(A) stretch, and is flanked at both ends by a 7-10 bp target sequence duplication. In addition, there is significant evidence for amino acid sequence homology with reverse transcriptase domains of other L1 families, especially F, Doc and Jockey of Drosophila melanogaster. No large open reading frame is present. The BMC1 element is suggested to be dispersed in the genome by a transposition mechanism involving RNA intermediates.
Collapse
Affiliation(s)
- T Ogura
- Department of Molecular Cell Biology, Kumamoto University School of Medicine, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Abstract
Reverse transcriptase catalyses the conversion of RNA into DNA. This operation seems to have largely contributed to the evolution of complex genomes. More than 10% of a mammalian genome is composed of sequences with reverse transcribed origin, most of which consists of repeated sequences (SINEs, LINEs). In spite of their simplicity, these sequences can play a key role in evolution by favoring illegitimate recombination. In addition to this abundant material, retrotransposed sequences include retrotransposons, retroviruses and genes depleted from intervening sequences, known as pseudogenes. Some of these sequences can be functional or involved in the regulation of neighbouring genes. These hallmarks of reverse transcription activity indicate that it has largely contributed to the fluidity of modern genomes.
Collapse
Affiliation(s)
- P Nouvel
- Unité de Génétique des Mammifères, Institut Pasteur, Paris, France
| |
Collapse
|
4
|
Uitterlinden AG, Slagboom EP, Mullaart E, Meulenbelt I, Vijg J. Genome scanning by two-dimensional DNA typing: the use of repetitive DNA sequences for rapid mapping of genetic traits. Electrophoresis 1991; 12:119-34. [PMID: 1674908 DOI: 10.1002/elps.1150120206] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The existence of repetitive DNA sequences offers the possibility to assess the mammalian genome for individual variation in its entirety rather than at one or only a few sites. In order to fully explore the various sets of mammalian repeat sequences for this purpose, analytical tools are required which allow many if not all individual members of sets of repetitive elements to be resolved and identified in terms of location and allelic variation. We have applied and further developed an electrophoretic system, two-dimensional DNA typing, which may fulfill these requirements. The two-dimensional system combines separation of DNA fragments by size in a neutral gel, with separation by sequence composition in a denaturing gradient gel. By hybridization with minisatellite- and simple-sequence core probes and by inter-repeat polymerase chain reaction techniques, it is possible to obtain individual--and even chromosome-specific separation patterns that consist of hundreds of spots. Computerized image analysis and matching of such spot patterns allows the rapid assessment of multiple polymorphisms, spread over the genome, to monitor genetic variability in populations. When coupled to databases of polymorphic DNA markers with a known genomic location, two-dimensional DNA typing can greatly accelerate the mapping of genetic traits in humans, animals, and plants.
Collapse
Affiliation(s)
- A G Uitterlinden
- Department of Genetic Diagnostics Medscand Ingeny, Rijswijk, The Netherlands
| | | | | | | | | |
Collapse
|
5
|
Belmaaza A, Wallenburg JC, Brouillette S, Gusew N, Chartrand P. Genetic exchange between endogenous and exogenous LINE-1 repetitive elements in mouse cells. Nucleic Acids Res 1990; 18:6385-91. [PMID: 1978749 PMCID: PMC332516 DOI: 10.1093/nar/18.21.6385] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
The repetitive LINE (L1) elements of the mouse, which are present at about 10(5) copies per genome and share over 80% of sequence homology, were examined for their ability to undergo genetic exchange with exogenous L1 sequences. The exogenous L1 sequences, carried by a shuttle vector, consisted of an internal fragment from L1Md-A2, a previously described member of the L1 family of the mouse. Using an assay that does not require the reconstitution of a selectable marker we found that this vector, in either circular or linear form, acquired DNA sequences from endogenous L1 elements at a frequency of 10(-3) to 10(-4) per rescued vector. Physical analysis of the acquired L1 sequences revealed that distinct endogenous L1 elements acted as donors and that different subfamilies participated. These results demonstrate that L1 elements are readily capable of genetic exchange. Apart from gene conversion events, the acquisition of L1 sequences outside the region of homology suggested that a second mechanism was also involved in the genetic exchange. A model which accounts for this mechanism is presented and its potential implication on the rearrangement of L1 elements is discussed.
Collapse
Affiliation(s)
- A Belmaaza
- Canadian Red Cross Society, Research and Development, Montreal Centre, Quebec
| | | | | | | | | |
Collapse
|
6
|
Boccaccio C, Deschatrette J, Meunier-Rotival M. Empty and occupied insertion site of the truncated LINE-1 repeat located in the mouse serum albumin-encoding gene. Gene X 1990; 88:181-6. [PMID: 1971802 DOI: 10.1016/0378-1119(90)90030-u] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Taking advantage of the polymorphism created by the presence or the absence of a LINE-1 repeat in intron 12 of the mouse serum albumin-encoding gene, we sequenced the repeat (Alb-L1Md), as well as the flanking regions in BALB/c DNA. The empty insertion site in a wild-type mouse of the same species Mus domesticus was amplified using PCR and sequenced. The Alb-L1Md was truncated at its 5' end and bordered by two 14-bp repeats, which represented the duplication of the empty insertion site. The absence of mutations in the two direct repeats as well as in the poly(dA) tail suggests that the Alb-L1Md sequence had been inserted very recently. On the basis of the insertion sequence of intron 12 and of the sequence of the consensus L1Md repeat, 5' of the insertion, we discuss a model of integration of full-length L1Md-RNA leading to the truncation of the inserted repeat.
Collapse
Affiliation(s)
- C Boccaccio
- INSERM U56, Hôpital de Bicêtre, Le Kremlin, France
| | | | | |
Collapse
|
7
|
Papadopoulos PC, Greenstein AM, Gaffney RA, Westbrook CA, Wiedemann LM. Characterization of the translocation breakpoint sequences in Philadelphia-positive acute lymphoblastic leukemia. Genes Chromosomes Cancer 1990; 1:233-9. [PMID: 2083218 DOI: 10.1002/gcc.2870010308] [Citation(s) in RCA: 29] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
We have previously described a patient in whom the breakpoint occurred within the first intron of the BCR gene and have cloned the 9q+ and 22q- junctions. We have now determined the nucleotide sequence around the breakpoints on both translocation products from this patient as well as the corresponding regions from the normal chromosomes 9 and 22. We have compared the sequence with that of the breakpoint regions in the Ph1-positive leukemic patients in order to check for the presence of conserved motifs. A + T-rich sequences and ALU repeat elements are the only sequence characteristics which appear to be very common around translocation regions. The chromosome 9 ABL sequences at or adjacent to the breakpoints present in the 22q- product show homology to the consensus ALU sequence while the chromosome 22 sequences do not, suggesting a non-homologous recombination mechanism. While no sequences are deleted, there is a two-base-pair "homology" at the junction. Therefore, staggered breaks followed by ligation and repair could be part of the mechanism involved in the process of translocation in some cases of Ph1-positive ALL.
Collapse
MESH Headings
- Base Sequence
- Chromosomes, Human, Pair 22
- Chromosomes, Human, Pair 9
- DNA Damage
- DNA Repair
- DNA, Neoplasm/genetics
- Fusion Proteins, bcr-abl/genetics
- Humans
- Models, Genetic
- Molecular Sequence Data
- Philadelphia Chromosome
- Precursor Cell Lymphoblastic Leukemia-Lymphoma/genetics
- Protein-Tyrosine Kinases/genetics
- Proto-Oncogenes
- Recombination, Genetic
- Repetitive Sequences, Nucleic Acid
- Sequence Homology, Nucleic Acid
- Translocation, Genetic
Collapse
Affiliation(s)
- P C Papadopoulos
- Leukaemia Research Fund Centre, Institute of Cancer Research, London, England
| | | | | | | | | |
Collapse
|
8
|
|
9
|
The Structure of the Guanine-rich Polypurine: Polypyrimidine Sequence at the Right End of the Rat L1 (LINE) Element. J Biol Chem 1989. [DOI: 10.1016/s0021-9258(19)84886-x] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
10
|
Nur I, Pascale E, Furano AV. Demethylation and specific remethylation of the promoter-like region of the L family of mammalian transposable elements. CELL BIOPHYSICS 1989; 15:61-6. [PMID: 2476227 DOI: 10.1007/bf02991579] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- I Nur
- Orgenics, Ltd., Yavne, Israel
| | | | | |
Collapse
|
11
|
Bertling WM. Full length L1 retroposons contain tRNA-like sequences near the 5' termini--hypothesis on the replication mechanism of retroposons. J Theor Biol 1989; 138:185-94. [PMID: 2558257 DOI: 10.1016/s0022-5193(89)80138-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Retrotransposons replicate via a complex mechanism which depends on, among other things, the presence of long terminal repeats (LTRs) and a tRNA binding site just 3' of the 5' LTR. The LINES 1 (L1) family of sequences, which similar to retrotransposons in many other properties, represents a new class of retroposon which does not possess LTRs. However, we show here that the repetitive 5' motif associated with murine L1 elements contains a tRNA-like sequence in a location analogous to the position of the retro-transposon tRNA binding site. Although the repetition of such a 5' motif has only been found associated with murine L1 elements, we have found an analogous tRNA-like sequence near the 5' ends of the L1 elements from each of the other analyzed species for which the L1 family has been characterized, that is rat (L1Rr), human (L1Hs), drosophila (I element) and trypanosome (INGI). The conservation of this tRNA-like sequence near the 5' terminus of L1 elements from such diverse species suggests that it plays a functional role in the life of the L1 class of retroposon.
Collapse
Affiliation(s)
- W M Bertling
- Max-Planck-Society, University of Erlangen, West Germany
| |
Collapse
|
12
|
Woods-Samuels P, Wong C, Mathias SL, Scott AF, Kazazian HH, Antonarakis SE. Characterization of a nondeleterious L1 insertion in an intron of the human factor VIII gene and further evidence of open reading frames in functional L1 elements. Genomics 1989; 4:290-6. [PMID: 2497061 DOI: 10.1016/0888-7543(89)90332-7] [Citation(s) in RCA: 56] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
We have characterized an insertional event in IVS-10 of the factor VIII gene in a pedigree containing a hemophilia A patient (JH-25). The inserted DNA is a 5' truncated L1 element that is 681 bp long followed by a 3'66-bp poly(A) tract. The L1 element is inserted 154 bp 5' to the start of exon 11 and is flanked by a 13- to 17-bp target site duplication. The L1 insertion is present in four generations of the patient's family. The maternal grandfather who carries the insertion does not have hemophilia A, indicating that the insertion is not the cause of hemophilia A in the patient. We have sequenced this insertion and two previously reported de novo L1 insertions in the factor VIII gene in patients JH-27 (3785 bp) and JH-28 (2132 bp). The three nucleotide sequences differ by 0.2-0.8%. All three of these L1 insertions have open reading frames (ORFs) (1192, 642, and 157 aa) and the three derived amino acid sequences are 98-99% identical. The previously reported sequence similarity between L1 3' ORFs and the polymerase domain of reverse transcriptases is maintained in the ORFs of the JH-27 and JH-28 L1 insertions. The presence of open reading frames and the close sequence similarity of these recently inserted L1 elements provide indirect evidence for the existence of a set of functional L1 elements that encode one or more proteins necessary for their retrotransposition.
Collapse
Affiliation(s)
- P Woods-Samuels
- Department of Pediatrics, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
| | | | | | | | | | | |
Collapse
|
13
|
Extensive movement of LINES ONE sequences in beta-globin loci of Mus caroli and Mus domesticus. Mol Cell Biol 1989. [PMID: 2905421 DOI: 10.1128/mcb.8.11.4669] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
LINES ONE (L1) is a family of movable DNA sequences found in mammals. To measure the rate of their movement, we have compared the positions of L1 elements within homologous genetic loci that are separated by known divergence times. Two models that predict different outcomes of this analysis have been proposed for the behavior of L1 sequences. (i) Previous theoretical studies of concerted evolution in L1 have indicated that the majority of the 100,000 extant L1 elements may have inserted as recently as within the last 3 million years. (ii) Gene conversion has been proposed as an alternative to a history of prolific recent insertions. To distinguish between these two models, we cloned and characterized two embryonic beta-globin haplotypes from Mus caroli and compared them with those of M. domesticus. In 9 of 10 instances, we observed an L1 element to be present in one chromosome and absent at the same site in a homologous chromosome. This frequency is quantitatively consistent with the known rate of concerted evolution. Therefore, we conclude that gene conversion is not required for concerted evolution of the L1 family in the mouse. Furthermore, we show that the extensive movement of L1 sequences contributes to restriction fragment length polymorphism. L1 insertions may be the predominant cause of restriction fragment length polymorphisms in closely related haplotypes.
Collapse
|
14
|
Shehee WR, Loeb DD, Adey NB, Burton FH, Casavant NC, Cole P, Davies CJ, McGraw RA, Schichman SA, Severynse DM. Nucleotide sequence of the BALB/c mouse beta-globin complex. J Mol Biol 1989; 205:41-62. [PMID: 2926808 DOI: 10.1016/0022-2836(89)90363-x] [Citation(s) in RCA: 73] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The nucleotide sequence of 55,856 base-pairs containing all seven beta-globin homologous structures from chromosome 7 of the BALB/c mouse is reported. This sequence links together previously published sequences of the beta-globin genes, pseudogenes and repetitive elements. Using low stringency computer searches, we found no additional beta-globin homologous sequences, but did find many more long interspersed repetitive sequences (L1) than predicted by hybridization. L1 is a major component of the mouse beta-globin complex with at least 15 elements comprising about 22% of the reported sequence. Most open reading frames greater than 300 base-pairs in the cluster overlap with L1 repeats or globin genes. Polypurine, polypyrimidine and alternating purine/pyrimidine tracts are not evenly dispersed throughout the complex, but they do not appear to be excluded from or restricted to particular regions. Several regions of intergenic homology were detected in dot-plot comparisons of the mouse sequence with itself and with the human beta-globin sequence. The significance of these homologies is unclear, but these regions are candidates for further study in functional assays in erythroid cell lines or transgenic animals.
Collapse
Affiliation(s)
- W R Shehee
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill 27514
| | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Casavant NC, Hardies SC, Funk FD, Comer MB, Edgell MH, Hutchison CA. Extensive movement of LINES ONE sequences in beta-globin loci of Mus caroli and Mus domesticus. Mol Cell Biol 1988; 8:4669-74. [PMID: 2905421 PMCID: PMC365556 DOI: 10.1128/mcb.8.11.4669-4674.1988] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
LINES ONE (L1) is a family of movable DNA sequences found in mammals. To measure the rate of their movement, we have compared the positions of L1 elements within homologous genetic loci that are separated by known divergence times. Two models that predict different outcomes of this analysis have been proposed for the behavior of L1 sequences. (i) Previous theoretical studies of concerted evolution in L1 have indicated that the majority of the 100,000 extant L1 elements may have inserted as recently as within the last 3 million years. (ii) Gene conversion has been proposed as an alternative to a history of prolific recent insertions. To distinguish between these two models, we cloned and characterized two embryonic beta-globin haplotypes from Mus caroli and compared them with those of M. domesticus. In 9 of 10 instances, we observed an L1 element to be present in one chromosome and absent at the same site in a homologous chromosome. This frequency is quantitatively consistent with the known rate of concerted evolution. Therefore, we conclude that gene conversion is not required for concerted evolution of the L1 family in the mouse. Furthermore, we show that the extensive movement of L1 sequences contributes to restriction fragment length polymorphism. L1 insertions may be the predominant cause of restriction fragment length polymorphisms in closely related haplotypes.
Collapse
Affiliation(s)
- N C Casavant
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill 27514
| | | | | | | | | | | |
Collapse
|
16
|
Nur I, Pascale E, Furano AV. The left end of rat L1 (L1Rn, long interspersed repeated) DNA which is a CpG island can function as a promoter. Nucleic Acids Res 1988; 16:9233-51. [PMID: 2459662 PMCID: PMC338703 DOI: 10.1093/nar/16.19.9233] [Citation(s) in RCA: 60] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Here we report that the 600 bp promoter-like region at the left end of a newly isolated and characterized rat L1 DNA element can activate the prokaryotic chloramphenicol acyltransferase gene in a rat cell line. Activation only occurs when the promoter region is oriented to the transferase gene as it is to the L1 protein encoding sequences and is 75% inhibited by methylation of just 5 of the 22 CpGs present in the promoter. The G + C rich promoter contains enough CpGs to qualify it as a CpG island, but in contrast to other CpG islands, genomic L1 promoters are fully methylated in both somatic cell and sperm DNA as judged by restriction enzyme analysis. Partial demethylation of the genomic promoters by treatment with 5-azacytidine failed to produce discrete L1 transcripts. The relationship of methylation to the evolutionary history and fate of the rat L1 promoter is discussed.
Collapse
Affiliation(s)
- I Nur
- Section on Genomic Structure and Function, National Institute of Diabetes, and Digestive and Kidney Diseases, Bethesda, MD 20892
| | | | | |
Collapse
|
17
|
Furano AV, Robb SM, Robb FT. The structure of the regulatory region of the rat L1 (L1Rn, long interspersed repeated) DNA family of transposable elements. Nucleic Acids Res 1988; 16:9215-31. [PMID: 2845369 PMCID: PMC338702 DOI: 10.1093/nar/16.19.9215] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Here we report the DNA structure of the left 1.5 kb of two newly isolated full length members of the rat L1 DNA family (L1Rn, long interspersed repeated DNA). In contrast to earlier isolated rat L1 members, both of these contain promoter-like regions that are most likely full length. In addition, the promoter-like region of both members has undergone a partial tandem duplication. A second internal region of the left end of one of the reported members is also tandemly duplicated. The propensity of the left end of rat L1 elements to undergo this form of genetic rearrangement, as well as other structural features revealed by the present work, is discussed in light of the fact that during evolution the otherwise conserved mammalian L1 DNA families have each acquired completely different promoter-like regions. In an accompanying paper [Nur, I., Pascale, E., and Furano, A. V. (1988) Nucleic Acids Res. 16, submitted], we report that one of the rat promoter-like regions can function as a promoter in rat cells when fused to the Escherichia coli chloramphenicol acyltransferase gene.
Collapse
Affiliation(s)
- A V Furano
- Section on Genomic Structure and Function, National Institute of Diabetes, and Digestive and Kidney Diseases, Bethesda, MD 20892
| | | | | |
Collapse
|
18
|
Burton FH, Loeb DD, McGraw RA, Edgell MH, Hutchison CA. A directed nucleotide-sequencing approach for single-stranded vectors based on recloning intermediates of a progressive DNA synthesis reaction. Gene 1988; 67:159-68. [PMID: 3169575 DOI: 10.1016/0378-1119(88)90393-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
A simple method for site-directed nucleotide sequencing is presented that uses a novel procedure for generating nested 'deletions' within inserts of single-stranded clones. In this method, single-stranded template, sequencing primer, and the Klenow fragment of Escherichia coli DNA polymerase I are used to initiate progressive DNA synthesis of the entire insert of the clone. By time-dependent sampling and pooling of intermediates from the synthesis reaction a series of nested double-stranded DNA subfragments of the insert can be created. Nested subclones are then produced by S1-endonuclease treatment and oriented subcloning methods. First, smaller quantities of template DNA can be used, equivalent to a fraction of a small DNA sequencing prep. Second, it works with single-stranded M13 phage DNA rather than requiring the preparation of double-stranded replicative form DNA as in ExoIII-based methods. Third, the 'deletions' it generates can span areas of simple nucleotide sequence or secondary structure that often halt digestion in the single-stranded exonuclease-based method. Last, the method is adaptable to a larger variety of insert cloning sites than the ExoIII-based method. The main disadvantage of the method is that, due to the lower efficiency of subcloning larger DNA fragments, subclone inserts larger than 3 kb are generated only infrequently.
Collapse
Affiliation(s)
- F H Burton
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill 27514
| | | | | | | | | |
Collapse
|
19
|
Abstract
We have characterized the approximately 6.5-kilobase cytoplasmic poly(A)+ Line-1 (L1) RNA present in a human teratocarcinoma cell line, NTera2D1, by primer extension and by analysis of cloned cDNAs. The bulk of the RNA begins (5' end) at the residue previously identified as the 5' terminus of the longest known primate genomic L1 elements, presumed to represent "unit" length. Several of the cDNA clones are close to 6 kilobase pairs, that is, close to full length. The partial sequences of 18 cDNA clones and full sequence of one (5,975 base pairs) indicate that many different genomic L1 elements contribute transcripts to the 6.5-kilobase cytoplasmic poly(A)+ RNA in NTera2D1 cells because no 2 of the 19 cDNAs analyzed had identical sequences. The transcribed elements appear to represent a subset of the total genomic L1s, a subset that has a characteristic consensus sequence in the 3' noncoding region and a high degree of sequence conservation throughout. Two open reading frames (ORFs) of 1,122 (ORF1) and 3,852 (ORF2) bases, flanked by about 800 and 200 bases of sequence at the 5' and 3' ends, respectively, can be identified in the cDNAs. Both ORFs are in the same frame, and they are separated by 33 bases bracketed by two conserved in-frame stop codons. ORF 2 is interrupted by at least one randomly positioned stop codon in the majority of the cDNAs. The data support proposals suggesting that the human L1 family includes one or more functional genes as well as an extraordinarily large number of pseudogenes whose ORFs are broken by stop codons. The cDNA structures suggest that both genes and pseudogenes are transcribed. At least one of the cDNAs (cD11), which was sequenced in its entirety, could, in principle, represent an mRNA for production of the ORF1 polypeptide. The similarity of mammalian L1s to several recently described invertebrate movable elements defines a new widely distributed class of elements which we term class II retrotransposons.
Collapse
|
20
|
Usdin K, Furano AV. Rat L (long interspersed repeated DNA) elements contain guanine-rich homopurine sequences that induce unpairing of contiguous duplex DNA. Proc Natl Acad Sci U S A 1988; 85:4416-20. [PMID: 2837766 PMCID: PMC280440 DOI: 10.1073/pnas.85.12.4416] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The L family (long interspersed repeated DNA) of mobile genetic elements is a persistent feature of the mammalian genome. In rats, this family contains approximately equal to 40,000 members and accounts for approximately equal to 10% of the haploid genome. We demonstrate here that the guanine-rich homopurine stretches located at the right end of L-DNA induce oligonucleotide uptake by contiguous duplex DNA. The uptake is dependent on negative supercoiling and the length of the homopurine stretch and occurs even when the L-DNA homopurine stretches are introduced into a different DNA environment. The bound oligomer primes DNA synthesis when DNA polymerase and deoxyribonucleoside triphosphates are added, resulting in a faithful copy of the template to which the oligonucleotide had bound. The implications of this property of the L-DNA guanine-rich homopurine stretches in the amplification, recombination, and dispersal of L elements is discussed.
Collapse
Affiliation(s)
- K Usdin
- Section on Genomic Structure and Function, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD 20892
| | | |
Collapse
|
21
|
Skowronski J, Fanning TG, Singer MF. Unit-length line-1 transcripts in human teratocarcinoma cells. Mol Cell Biol 1988; 8:1385-97. [PMID: 2454389 PMCID: PMC363295 DOI: 10.1128/mcb.8.4.1385-1397.1988] [Citation(s) in RCA: 134] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
We have characterized the approximately 6.5-kilobase cytoplasmic poly(A)+ Line-1 (L1) RNA present in a human teratocarcinoma cell line, NTera2D1, by primer extension and by analysis of cloned cDNAs. The bulk of the RNA begins (5' end) at the residue previously identified as the 5' terminus of the longest known primate genomic L1 elements, presumed to represent "unit" length. Several of the cDNA clones are close to 6 kilobase pairs, that is, close to full length. The partial sequences of 18 cDNA clones and full sequence of one (5,975 base pairs) indicate that many different genomic L1 elements contribute transcripts to the 6.5-kilobase cytoplasmic poly(A)+ RNA in NTera2D1 cells because no 2 of the 19 cDNAs analyzed had identical sequences. The transcribed elements appear to represent a subset of the total genomic L1s, a subset that has a characteristic consensus sequence in the 3' noncoding region and a high degree of sequence conservation throughout. Two open reading frames (ORFs) of 1,122 (ORF1) and 3,852 (ORF2) bases, flanked by about 800 and 200 bases of sequence at the 5' and 3' ends, respectively, can be identified in the cDNAs. Both ORFs are in the same frame, and they are separated by 33 bases bracketed by two conserved in-frame stop codons. ORF 2 is interrupted by at least one randomly positioned stop codon in the majority of the cDNAs. The data support proposals suggesting that the human L1 family includes one or more functional genes as well as an extraordinarily large number of pseudogenes whose ORFs are broken by stop codons. The cDNA structures suggest that both genes and pseudogenes are transcribed. At least one of the cDNAs (cD11), which was sequenced in its entirety, could, in principle, represent an mRNA for production of the ORF1 polypeptide. The similarity of mammalian L1s to several recently described invertebrate movable elements defines a new widely distributed class of elements which we term class II retrotransposons.
Collapse
Affiliation(s)
- J Skowronski
- Laboratory of Biochemistry, National Cancer Institute, Bethesda, Maryland 20892
| | | | | |
Collapse
|
22
|
Lubsen NH, Aarts HJ, Schoenmakers JG. The evolution of lenticular proteins: the beta- and gamma-crystallin super gene family. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 1988; 51:47-76. [PMID: 3064189 DOI: 10.1016/0079-6107(88)90010-7] [Citation(s) in RCA: 155] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
23
|
Fanning TG, Singer MF. LINE-1: a mammalian transposable element. BIOCHIMICA ET BIOPHYSICA ACTA 1987; 910:203-12. [PMID: 2445384 DOI: 10.1016/0167-4781(87)90112-6] [Citation(s) in RCA: 180] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- T G Fanning
- Laboratory of Biochemistry, National Cancer Institute, Bethesda, MD 20892
| | | |
Collapse
|
24
|
Wincker P, Jubier-Maurin V, Roizès G. Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies. Nucleic Acids Res 1987; 15:8593-606. [PMID: 3684566 PMCID: PMC306393 DOI: 10.1093/nar/15.21.8593] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies.
Collapse
Affiliation(s)
- P Wincker
- CRBM-L.P 8402 CNRS, U.249 INSERM, Institut de Biologie, Montpellier, France
| | | | | |
Collapse
|
25
|
Jubier-Maurin V, Wincker P, Cuny G, Roizès G. The relationships between the 5' end repeats and the largest members of the L1 interspersed repeated family in the mouse genome. Nucleic Acids Res 1987; 15:7395-410. [PMID: 3658697 PMCID: PMC306256 DOI: 10.1093/nar/15.18.7395] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Analysis of a few large L1 elements has revealed two types of tandem repeats at the 5' end: A and F. In this study, the relationships between these repeats and a series of large L1 elements has been analysed. Most of cloned L1 repeats were shown to lack either A or F sequences at their 5' ends. F sequences are found less frequently associated than A sequences to the 5' ends of L1 and an evolutionary comparison shows that the A type was introduced more recently during the evolution of the mouse genome than the F type.
Collapse
Affiliation(s)
- V Jubier-Maurin
- CNRS (CRBM-LP 8402), INSERM U.249, Institut de Biologie, Montpellier, France
| | | | | | | |
Collapse
|
26
|
Shehee WR, Chao SF, Loeb DD, Comer MB, Hutchison CA, Edgell MH. Determination of a functional ancestral sequence and definition of the 5' end of A-type mouse L1 elements. J Mol Biol 1987; 196:757-67. [PMID: 3681977 DOI: 10.1016/0022-2836(87)90402-5] [Citation(s) in RCA: 58] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
The complete nucleotide sequence of L1Md-A13, a 6372 base-pair (bp) member of the L1Md repetitive family isolated from a BALB/c mouse genomic DNA library, is reported. The nucleotide sequence of 4331 bp from the 5' end of L1Md-9, which is located in the beta-globin complex of the C57BL/10 mouse, is also reported. Parsimony analysis of these sequences plus two previously reported L1Md sequences allows the determination of an ancestral L1Md sequence. Analysis of the L1Md population indicates that this ancestral sequence is likely to represent a functional L1 sequence. This ancestral sequence confirms that the length (1137 bp and 3900 bp) and relationship (14 bp overlap) of the two large open reading frames previously reported are conserved features of the L1Md family. It also allows the determination of an ancestral amino acid sequence for these two open reading frames. Full-length L1Md elements have one of two sequences tandemly repeated at the 5' end. These two monomers are called A-type and F-type. Our data define the 5' end of A-type full-length L1Md elements. L1Md elements of the A-type have varying numbers of tandemly repeated 208 bp monomers, but each element ends about 78 bp from the 5' end of the terminal 208 bp monomer.
Collapse
Affiliation(s)
- W R Shehee
- Department of Microbiology and Immunology, Curriculum in Genetics, University of North Carolina, Chapel Hill 27514
| | | | | | | | | | | |
Collapse
|
27
|
Nucleotide sequence analysis of a cloned DNA fragment from human cells reveals homology to retrotransposons. Mol Cell Biol 1987. [PMID: 3031462 DOI: 10.1128/mcb.7.1.231] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
During molecular cloning of proviral DNA of human spumaretrovirus, various recombinant clones were established and analyzed. Blot hybridization revealed that one of the recombinant plasmids had the characteristic features of a member of the long interspersed repetitive sequences family. The DNA element was analyzed by restriction mapping and nucleotide sequencing. It showed a high degree of amino acid sequence homology of 54.3% when compared with the 5'-terminal part of the pol gene product of the murine retrotransposon LIMd. The 3' region of the cloned DNA element encodes proteins with an even higher degree of homology of 67.4% in comparison to the corresponding parts of a member of the primate KpnI sequence family.
Collapse
|
28
|
The sequence of a large L1Md element reveals a tandemly repeated 5' end and several features found in retrotransposons. Mol Cell Biol 1987. [PMID: 3023821 DOI: 10.1128/mcb.6.1.168] [Citation(s) in RCA: 178] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The complete nucleotide sequence of a 6,851-base pair (bp) member of the L1Md repetitive family from a selected random isolate of the BALB/c mouse genome is reported here. Five kilobases of the element contains two overlapping reading frames of 1,137 and 3,900 bp. The entire 3,900-bp frame and the 3' 600 bp of the 1,137-bp frame, when compared with a composite consensus primate L1 sequence, show a ratio of replacement to silent site differences characteristic of protein coding sequences. This more closely defines the protein coding capacity of this repetitive family, which was previously shown to possess a large open reading frame of undetermined extent. The relative organization of the 1,137- and 3,900-bp reading frames, which overlap by 14 bp, bears resemblance to protein-coding, mobile genetic elements. Homology can be found between the amino acid sequence of the 3,900-bp frame and selected domains of several reverse transcriptases. The 5' ends of the two L1Md elements described in this report have multiple copies, 4 2/3 copies and 1 2/3 copy, of a 208-bp direct tandem repeat. The sequence of this 208-bp element differs from the sequence of a previously defined 5' end for an L1Md element, indicating that there are at least two different 5' end motifs for L1Md.
Collapse
|
29
|
Flügel RM, Maurer B, Bannert H, Rethwilm A, Schnitzler P, Darai G. Nucleotide sequence analysis of a cloned DNA fragment from human cells reveals homology to retrotransposons. Mol Cell Biol 1987; 7:231-6. [PMID: 3031462 PMCID: PMC365061 DOI: 10.1128/mcb.7.1.231-236.1987] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
During molecular cloning of proviral DNA of human spumaretrovirus, various recombinant clones were established and analyzed. Blot hybridization revealed that one of the recombinant plasmids had the characteristic features of a member of the long interspersed repetitive sequences family. The DNA element was analyzed by restriction mapping and nucleotide sequencing. It showed a high degree of amino acid sequence homology of 54.3% when compared with the 5'-terminal part of the pol gene product of the murine retrotransposon LIMd. The 3' region of the cloned DNA element encodes proteins with an even higher degree of homology of 67.4% in comparison to the corresponding parts of a member of the primate KpnI sequence family.
Collapse
|
30
|
Holdener-Kenny B, Weaver S. A naturally occurring deletion in the mouse Hbbs beta-globin gene cluster. Proc Natl Acad Sci U S A 1986; 83:4374-8. [PMID: 3459181 PMCID: PMC323735 DOI: 10.1073/pnas.83.12.4374] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
A restriction fragment size difference of 1.9 kilobases exists between the C57BL/10 (Hbbs) and BALB/c (Hbbd) beta-globin gene clusters in the vicinity of the Hbb-y and Hbb-bh0 genes. This length difference is the result of a deletion that removes the majority of the C57BL/10 Hbb-bh0 gene. The Hbb-bh0 deletion appears to be widespread among Hbbs-carrying mice. In addition to the deletion, 119 base pairs of DNA of unknown origin are found in C57BL/10 DNA at the point of discontinuity. The existence of this deletion is inconsistent with reports of transcripts homologous to the third exon of the Hbb-bh0 gene in both C57BL/10 embryos and in a cell line carrying the Hbbs haplotype. We propose that the Hbb-bh0 gene is a recently disabled pseudogene.
Collapse
|
31
|
Callahan R, Gallahan D, D'Hoostelaere LA, Potter M. Endogenous MMTV proviral genomes in feral Mus musculus domesticus. Curr Top Microbiol Immunol 1986; 127:362-70. [PMID: 3015501 DOI: 10.1007/978-3-642-71304-0_44] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
32
|
Loeb DD, Padgett RW, Hardies SC, Shehee WR, Comer MB, Edgell MH, Hutchison CA. The sequence of a large L1Md element reveals a tandemly repeated 5' end and several features found in retrotransposons. Mol Cell Biol 1986; 6:168-82. [PMID: 3023821 PMCID: PMC367496 DOI: 10.1128/mcb.6.1.168-182.1986] [Citation(s) in RCA: 141] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
The complete nucleotide sequence of a 6,851-base pair (bp) member of the L1Md repetitive family from a selected random isolate of the BALB/c mouse genome is reported here. Five kilobases of the element contains two overlapping reading frames of 1,137 and 3,900 bp. The entire 3,900-bp frame and the 3' 600 bp of the 1,137-bp frame, when compared with a composite consensus primate L1 sequence, show a ratio of replacement to silent site differences characteristic of protein coding sequences. This more closely defines the protein coding capacity of this repetitive family, which was previously shown to possess a large open reading frame of undetermined extent. The relative organization of the 1,137- and 3,900-bp reading frames, which overlap by 14 bp, bears resemblance to protein-coding, mobile genetic elements. Homology can be found between the amino acid sequence of the 3,900-bp frame and selected domains of several reverse transcriptases. The 5' ends of the two L1Md elements described in this report have multiple copies, 4 2/3 copies and 1 2/3 copy, of a 208-bp direct tandem repeat. The sequence of this 208-bp element differs from the sequence of a previously defined 5' end for an L1Md element, indicating that there are at least two different 5' end motifs for L1Md.
Collapse
|