51
|
Characterization of the multigene family encoding the mouse S16 ribosomal protein: strategy for distinguishing an expressed gene from its processed pseudogene counterparts by an analysis of total genomic DNA. Mol Cell Biol 1986. [PMID: 3915781 DOI: 10.1128/mcb.5.12.3560] [Citation(s) in RCA: 103] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Two genes from the family encoding mouse ribosomal protein S16 were cloned, sequenced, and analyzed. One gene was found to be a processed pseudogene, i.e., a nonfunctional gene presumably derived from an mRNA intermediate. The other S16 gene contained introns and had exonic sequences identical to those of a cloned S16 cDNA. The expression of this gene was demonstrated by Northern blot analysis of nuclear poly(A)+ RNA with cDNA and unique sequence intron probes. Each S16 intron contains a well-preserved remnant of the TACTAAC motif, which is ubiquitous in yeast introns and known to play a critical role in intron splicing. A sequence comparison with two other mouse ribosomal protein genes analyzed in our laboratory, L30 and L32, revealed common structural features which might be involved in the control and coordination of ribosomal protein gene expression. These include the lack of a canonical TATA box in the -20 to -30 region and a remarkably similar 12-nucleotide pyrimidine sequence (CTTCCYTYYTC) that spans the cap site and is flanked by C + G-rich sequences. The nature of the other members of the S16 family was evaluated by three types of experiment: a DNase I sensitivity analysis to measure the extent of chromatin condensation; an analysis of the thermal stability of cDNA-gene hybrids to estimate the extent of divergence of each gene sequence from that of the expressed gene; and a restriction fragment analysis which distinguishes intron-containing genes from intronless processed genes. The results of these analyses show that all genes except the expressed S16 gene are in a condensed chromatin configuration associated with transcriptional quiescence; that most of the genes within the S16 family have sequences greater than 7% divergent from the expressed S16 gene; and that at least 7 of the 10 S16 genes lack introns. We conclude that the ribosomal protein S16 multigene family contains one expressed intron-containing gene and nine inactive pseudogenes, most or all of which are of the processed type.
Collapse
|
52
|
|
53
|
Alsip GR, Konkel DA. A processed chicken pseudogene (CPS1) related to the ras oncogene superfamily. Nucleic Acids Res 1986; 14:2123-38. [PMID: 3083400 PMCID: PMC339647 DOI: 10.1093/nar/14.5.2123] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
We describe the first polyA-containing processed pseudogene reported in the chicken. It includes a 0.52 kb open reading frame which could encode a 175 amino acid protein. The putative protein shows extensive homology to the ras oncogene superfamily, being most closely related to the yeast protein YP2. It is one of the two most divergent members of the ras superfamily yet described and is the most homologous of any ras-related protein to the G-protein alpha-transducin. The chicken genome contains at least one other gene highly homologous to CPS1; at least one member of the CPS1 family is active, but only early in chicken development. This pattern of expression, and the presence of mutations in regions known to activate human c-ras genes to oncogenicity, suggest that CPS1 may represent a new oncogene family.
Collapse
|
54
|
Lerman MI, Hegamyer GA, Colburn NH. Cloning and characterization of putative genes that specify sensitivity to neoplastic transformation by tumor promoters. Int J Cancer 1986; 37:293-302. [PMID: 3002990 DOI: 10.1002/ijc.2910370219] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Two putative genes (termed pro 1 and pro 2) specifying sensitivity to induction of neoplastic transformation by TPA in mouse epidermal JB6 cells were cloned by sib selection from a size-selected genomic library of clonal cells sensitive to promotion of transformation. By restriction analysis, heteroduplex analysis, direct hybridization, and sequencing, the putative genes are different from and have no homology to known oncogenes. Both genes are independently and equally active as total DNA in the transfection assay. The transformation-promoting potential of these putative genes does not appear to result from gene amplification or detectable rearrangements, suggesting that small structural changes might confer the promoting activity. The mouse pro sequences are also found in monkey and human DNAs. The pro-1 sequence is homologous to middle repetitive elements in the mouse genome, namely the BAM 5 and B1 repeats. The sequence of pro-1 was determined and suggests that it contains the signals to be transcribed by RNA polymerase II and to encode a protein of 7.1 kDa.
Collapse
|
55
|
Costanzo F, Colombo M, Staempfli S, Santoro C, Marone M, Frank R, Delius H, Cortese R. Structure of gene and pseudogenes of human apoferritin H. Nucleic Acids Res 1986; 14:721-36. [PMID: 3003694 PMCID: PMC339460 DOI: 10.1093/nar/14.2.721] [Citation(s) in RCA: 105] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Ferritin is composed of two subunits, H and L. cDNA's coding for these proteins from human liver (1,2,3), lymphocytes (4) and from the monocyte-like cell line U937 (5) have been cloned and sequenced. Southern blot analysis on total human DNA reveals that there are many DNA segments hybridizing to the apoferritin H and L cDNA probes (1,2,4,6). In view of the tissue heterogeneity of ferritin molecules (7,8), it appeared possible that apoferritin molecules could be coded by a family of genes differentially expressed in various tissues (1,2). In this paper we describe the cloning and sequencing of the gene coding for human apoferritin H. This gene has three introns; the exon sequence is identical to that of cDNA's isolated from human liver, lymphocytes, HeLa cells and endothelial cells. In addition we show that at least 15 intronless pseudogenes exist, with features suggesting that they were originated by reverse transcription and insertion. On the basis of these results we conclude that only one gene is responsible for the synthesis of the majority of apoferritin H mRNA in various tissues examined, and that probably all the other DNA segments hybridizing with apoferritin cDNA are pseudogenes.
Collapse
|
56
|
Wagner M, Perry RP. Characterization of the multigene family encoding the mouse S16 ribosomal protein: strategy for distinguishing an expressed gene from its processed pseudogene counterparts by an analysis of total genomic DNA. Mol Cell Biol 1985; 5:3560-76. [PMID: 3915781 PMCID: PMC369187 DOI: 10.1128/mcb.5.12.3560-3576.1985] [Citation(s) in RCA: 65] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Two genes from the family encoding mouse ribosomal protein S16 were cloned, sequenced, and analyzed. One gene was found to be a processed pseudogene, i.e., a nonfunctional gene presumably derived from an mRNA intermediate. The other S16 gene contained introns and had exonic sequences identical to those of a cloned S16 cDNA. The expression of this gene was demonstrated by Northern blot analysis of nuclear poly(A)+ RNA with cDNA and unique sequence intron probes. Each S16 intron contains a well-preserved remnant of the TACTAAC motif, which is ubiquitous in yeast introns and known to play a critical role in intron splicing. A sequence comparison with two other mouse ribosomal protein genes analyzed in our laboratory, L30 and L32, revealed common structural features which might be involved in the control and coordination of ribosomal protein gene expression. These include the lack of a canonical TATA box in the -20 to -30 region and a remarkably similar 12-nucleotide pyrimidine sequence (CTTCCYTYYTC) that spans the cap site and is flanked by C + G-rich sequences. The nature of the other members of the S16 family was evaluated by three types of experiment: a DNase I sensitivity analysis to measure the extent of chromatin condensation; an analysis of the thermal stability of cDNA-gene hybrids to estimate the extent of divergence of each gene sequence from that of the expressed gene; and a restriction fragment analysis which distinguishes intron-containing genes from intronless processed genes. The results of these analyses show that all genes except the expressed S16 gene are in a condensed chromatin configuration associated with transcriptional quiescence; that most of the genes within the S16 family have sequences greater than 7% divergent from the expressed S16 gene; and that at least 7 of the 10 S16 genes lack introns. We conclude that the ribosomal protein S16 multigene family contains one expressed intron-containing gene and nine inactive pseudogenes, most or all of which are of the processed type.
Collapse
|
57
|
Sundaralingam M, Drendel W, Greaser M. Stabilization of the long central helix of troponin C by intrahelical salt bridges between charged amino acid side chains. Proc Natl Acad Sci U S A 1985; 82:7944-7. [PMID: 3865207 PMCID: PMC390886 DOI: 10.1073/pnas.82.23.7944] [Citation(s) in RCA: 69] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The unusual dumbbell shape of troponin C is due to the presence of a long alpha-helix of nine turns that connects the amino- and carboxyl-terminal calcium-binding domains. The center of the long helix appears to be stabilized by several salt bridges. The long helix is also bent about 16 degrees at glycine-92. Calmodulin, which lacks the central glycine, also is predicted to be stabilized by salt bridges in the central helix. The presence of a proline residue in the center of the long helix of ascidian troponin C and the myosin regulatory light chains suggests that a sharper bend may occur in these molecules. The conservation of the bend and salt bridges in the related calcium-binding proteins suggests they may have an important biological function. The structure of troponin C suggests that intrahelix salt bridges between neighboring charged residues may be involved in the stabilization of protein secondary structure. The preponderance of potential salt bridges in other muscle proteins as well may be related to their elongated structures and their participation in the contractile process.
Collapse
|
58
|
Soares MB, Schon E, Henderson A, Karathanasis SK, Cate R, Zeitlin S, Chirgwin J, Efstratiadis A. RNA-mediated gene duplication: the rat preproinsulin I gene is a functional retroposon. Mol Cell Biol 1985; 5:2090-103. [PMID: 2427930 PMCID: PMC366927 DOI: 10.1128/mcb.5.8.2090-2103.1985] [Citation(s) in RCA: 82] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Rats and mice have two, equally expressed, nonallelic genes encoding preproinsulin (genes I and II). Cytological hybridization with metaphase chromosomes indicated that both genes reside on rat chromosome I but are approximately 100,000 kilobases apart. In mice the two genes reside on two different chromosomes. DNA sequence comparisons of the gene-flanking regions in rats and mice indicated that the preproinsulin gene I has lost one of the two introns present in gene II, is flanked by a long (41-base) direct repeat, and has a remnant of a polydeoxyadenylate acid tract preceding the downstream direct repeat. These structural features indicated that gene I was generated by an RNA-mediated duplication-transposition event involving a transcript of gene II which was initiated upstream from the normal capping site. Sequence divergence analysis indicated that the pair of the original gene and its retroposed, but functional, counterpart (which appeared about 35 million years ago) is maintained by strong negative selection operating primarily on the segments encoding the chains of the mature hormone, whereas the segments encoding the parts of the polypeptide that are eliminated during processing and also the introns and the flanking regions are evolving neutrally.
Collapse
|
59
|
Abstract
A simple kinetic model is developed that describes the accumulation of processed pseudogenes in a functional gene family. Insertion of new pseudogenes occurs at rate v per gene and is countered by spontaneous deletion (at rate delta per DNA segment) of segments containing processed pseudogenes. If there are k functional genes in a gene family, the equilibrium number of processed pseudogenes is k(v/delta), and the percentage of functional genes in the gene family at equilibrium is 1/[1 + (v/delta)]. v/delta values estimated for five gene families ranged from 1.7 to 15. This fairly narrow range suggests that the rates of formation and deletion of processed pseudogenes may be positively correlated for these families. If delta is sufficiently large relative to the per nucleotide mutation rate mu (delta greater than 20 mu), processed pseudogenes will show high homology with each other, even in the absence of gene conversion between pseudogenes. We argue that formation of processed pseudogenes may share common pathways with transposable elements and retroviruses, creating the potential for correlated responses in the evolution of processed pseudogenes due to direct selection for control of transposable elements and/or retroviruses. Finally, we discuss the nature of the selective forces that may act directly or indirectly to influence the evolution of processed pseudogenes.
Collapse
|
60
|
Tschudi C, Young AS, Ruben L, Patton CL, Richards FF. Calmodulin genes in trypanosomes are tandemly repeated and produce multiple mRNAs with a common 5' leader sequence. Proc Natl Acad Sci U S A 1985; 82:3998-4002. [PMID: 3858856 PMCID: PMC397921 DOI: 10.1073/pnas.82.12.3998] [Citation(s) in RCA: 125] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
In Trypanosoma brucei gambiense, the Ca2+ binding protein calmodulin is encoded by three identical tandemly repeated genes. The transcripts of these genes consist of several RNA species similar in size. A 35-nucleotide spliced leader sequence is present at the 5' end of each mRNA but is not encoded by DNA contiguous to these genes. We have identified two different sites for the fusion of the leader to the mRNA. These results strongly support the idea that a novel, possibly discontinuous, transcription mechanism is used by these parasites.
Collapse
|
61
|
Bacterial expression and characterization of proteins derived from the chicken calmodulin cDNA and a calmodulin processed gene. J Biol Chem 1985. [DOI: 10.1016/s0021-9258(18)89127-x] [Citation(s) in RCA: 163] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
|
62
|
Rogers JH. The origin and evolution of retroposons. INTERNATIONAL REVIEW OF CYTOLOGY 1985; 93:187-279. [PMID: 2409043 DOI: 10.1016/s0074-7696(08)61375-3] [Citation(s) in RCA: 421] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
63
|
|
64
|
Stumph WE, Hodgson CP, Tsai MJ, O'Malley BW. Genomic structure and possible retroviral origin of the chicken CR1 repetitive DNA sequence family. Proc Natl Acad Sci U S A 1984; 81:6667-71. [PMID: 6593723 PMCID: PMC391991 DOI: 10.1073/pnas.81.21.6667] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
We have analyzed the sequence and structure of three CR1 family repetitive elements found in the region adjoining the 3' end of a chicken calmodulin gene. Members of this family are approximately equal to 300 base pairs long and are dispersed throughout the chicken genome. The present data, when taken together with that from four CR1s sequenced previously, reveal that the CR1 family has an overall structure possessing several features associated with the long terminal repeats of avian retroviruses. This finding implies that a retroviral mechanism may be responsible for the dispersion of CR1 sequences throughout the chicken genome. The seven different CR1 repeats that have been analyzed exist at defined locations in the chicken genome relative to nearby structural genes. A directional polarity has been assigned to the CR1 family based upon limited sequence homology to mammalian Alu-type sequences. Interestingly, whether present in 5' - or 3'-flanking DNA, the CR1 sequences have an inverse orientation such that they all "point toward" the nearby structural genes. This is consistent with the previously proposed concept that chicken CR1 sequences may be involved in defining the boundaries of active chromosomal domains of gene expression.
Collapse
|
65
|
Anagnou NP, O'Brien SJ, Shimada T, Nash WG, Chen MJ, Nienhuis AW. Chromosomal organization of the human dihydrofolate reductase genes: dispersion, selective amplification, and a novel form of polymorphism. Proc Natl Acad Sci U S A 1984; 81:5170-4. [PMID: 6089182 PMCID: PMC391659 DOI: 10.1073/pnas.81.16.5170] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
The human dihydrofolate reductase (DHFR; tetrahydrofolate dehydrogenase; 5,6,7,8-tetrahydrofolate: NADP+ oxidoreductase, EC 1.5.1.3) gene family includes a functional gene (hDHFR) and at least four intronless genes. Three intronless genes (hDHFR-psi 2, hDHFR-psi 3, and hDHFR-psi 4) are identifiable as pseudogenes because of DNA sequence divergence from the functional gene with introns, while one intronless gene (hDHFR-psi 1) is completely homologous to the coding sequences of the functional gene. Analysis of genomic DNA from two panels of somatic human-rodent cell hybrids with specific molecular probes provide insight into the chromosomal organization and assignment of these genes. The five genes are dispersed in that each one is found on a different chromosome. The functional gene hDHFR has been assigned to chromosome 5, and one pseudogene (hDHFR-psi 4), to chromosome 3. In a human cell line (HeLa) that was selected for methotrexate resistance, the functional locus became amplified, while there was no amplification of the four intronless pseudogenes. hDHFR-psi 1 was found to be present in DNA of some individuals and absent from DNA of others, consistent with a recent evolutionary origin of this gene originally suggested by its sequence identity to the coding portions of the functional gene. The presence or absence of this intronless pseudogene represents a previously unreported form of DNA polymorphism.
Collapse
|
66
|
Means AR, Slaughter GR, Putkey JA. Postreceptor signal transduction by cyclic adenosine monophosphate and the Ca2+-calmodulin complex. J Cell Biol 1984; 99:226s-231s. [PMID: 6086667 PMCID: PMC2275604 DOI: 10.1083/jcb.99.1.226s] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
|
67
|
Abstract
Two cDNAs derived from Xenopus laevis calmodulin mRNA have been cloned. Both cDNAs contain the complete protein-coding region and various lengths of untranslated segments. The two cDNAs encode an identical protein but differ from each other by 5% nucleotide substitutions. The 5' and 3' untranslated regions, to the extent available, are highly homologous between the two cDNAs. The predicted sequence of X. laevis calmodulin is identical to that of vertebrate calmodulins from mammals and chickens and shows one substitution compared with electric eel calmodulin. Genomic DNA sequences homologous to each of the two cDNA clones have been isolated and were shown to account for the major calmodulin-coding DNA sequences in X. laevis. These data suggest that X. laevis carries two active, nonallelic calmodulin genes. Although no complete analysis has been carried out, it appears that the X. laevis calmodulin genes are interrupted by at least four introns. The relative concentrations of calmodulin mRNA have been estimated in different embryonic stages and adult tissues and found to vary by up to a factor of 10. The highest levels of calmodulin mRNA were found in ovaries, testes, and brains. In these three tissues, the two calmodulin genes appear to be expressed at approximately equal levels.
Collapse
|
68
|
Klein A, Meyuhas O. A multigene family of intron lacking and containing genes, encoding for mouse ribosomal protein L7. Nucleic Acids Res 1984; 12:3763-76. [PMID: 6328436 PMCID: PMC318788 DOI: 10.1093/nar/12.9.3763] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Mouse ribosomal protein L7 is encoded by a multigene family. Screening of two mouse genomic libraries with cloned L7 cDNA, has resulted in the isolation of nine independent lambda Charon 4A recombinant phages which include seven different L7 genes. Restriction enzyme mapping of six of these genes (L7-1, L7-16, L7-18, L7-28, L7-35 and L7- 16b ) reveals dissimilarity in sites within the L7 sequences as well as in the flanking regions. Electron microscopic analysis of heteroduplex and S1 nuclease mapping demonstrate that the first five genes contain the entire L7 mRNA sequence but lack introns. Based on these features we propose that these are processed genes. Of the L7 genes described here only one (L7- 16b ) exhibits a high degree of homology with L7 mRNA and contains introns. We discuss the possibility that this low representation of intron containing L7 genes may reflect the proportion of functional L7 genes in this multigene family.
Collapse
|
69
|
Abstract
Two cDNAs derived from Xenopus laevis calmodulin mRNA have been cloned. Both cDNAs contain the complete protein-coding region and various lengths of untranslated segments. The two cDNAs encode an identical protein but differ from each other by 5% nucleotide substitutions. The 5' and 3' untranslated regions, to the extent available, are highly homologous between the two cDNAs. The predicted sequence of X. laevis calmodulin is identical to that of vertebrate calmodulins from mammals and chickens and shows one substitution compared with electric eel calmodulin. Genomic DNA sequences homologous to each of the two cDNA clones have been isolated and were shown to account for the major calmodulin-coding DNA sequences in X. laevis. These data suggest that X. laevis carries two active, nonallelic calmodulin genes. Although no complete analysis has been carried out, it appears that the X. laevis calmodulin genes are interrupted by at least four introns. The relative concentrations of calmodulin mRNA have been estimated in different embryonic stages and adult tissues and found to vary by up to a factor of 10. The highest levels of calmodulin mRNA were found in ovaries, testes, and brains. In these three tissues, the two calmodulin genes appear to be expressed at approximately equal levels.
Collapse
|