1
|
Kögler A, Seibt KM, Heitkam T, Morgenstern K, Reiche B, Brückner M, Wolf H, Krabel D, Schmidt T. Divergence of 3' ends as a driver of short interspersed nuclear element (SINE) evolution in the Salicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:443-458. [PMID: 32056333 DOI: 10.1111/tpj.14721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/13/2020] [Accepted: 01/29/2020] [Indexed: 06/10/2023]
Abstract
Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.
Collapse
Affiliation(s)
- Anja Kögler
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kristin Morgenstern
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Birgit Reiche
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | | | - Heino Wolf
- Staatsbetrieb Sachsenforst, 01796, Pirna, Germany
| | - Doris Krabel
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
2
|
Dridi S. Alu mobile elements: from junk DNA to genomic gems. SCIENTIFICA 2012; 2012:545328. [PMID: 24278713 PMCID: PMC3820591 DOI: 10.6064/2012/545328] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2012] [Accepted: 11/06/2012] [Indexed: 06/02/2023]
Abstract
Alus, the short interspersed repeated sequences (SINEs), are retrotransposons that litter the human genomes and have long been considered junk DNA. However, recent findings that these mobile elements are transcribed, both as distinct RNA polymerase III transcripts and as a part of RNA polymerase II transcripts, suggest biological functions and refute the notion that Alus are biologically unimportant. Indeed, Alu RNAs have been shown to control mRNA processing at several levels, to have complex regulatory functions such as transcriptional repression and modulating alternative splicing and to cause a host of human genetic diseases. Alu RNAs embedded in Pol II transcripts can promote evolution and proteome diversity, which further indicates that these mobile retroelements are in fact genomic gems rather than genomic junks.
Collapse
Affiliation(s)
- Sami Dridi
- Nutrition Research Institute, The University of North Carolina at Chapel Hill, 500 Laureate Way, Kannapolis, NC 28081, USA
| |
Collapse
|
3
|
Losch FO, Bredenbeck A, Hollstein VM, Walden P, Wrede P. Evidence for a large double-cruciform DNA structure on the X chromosome of human and chimpanzee. Hum Genet 2007; 122:337-43. [PMID: 17638018 DOI: 10.1007/s00439-007-0405-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2007] [Accepted: 07/09/2007] [Indexed: 01/05/2023]
Abstract
The human X chromosome consists of a high number of large inverted repeat (IR) DNA sequences which fulfill all requirements for formation of cruciform DNA structures. Such alternative DNA structures are suggested to have a great impact in altering the chromatin architecture and function. Our comprehensive analysis of the corresponding orthologous nucleotide sequences of an IR sequence from Homo sapiens and Pan troglodytes revealed that most of the nucleotide differences between the two species are symmetrical to the apex of the IR, and that the spacer region of the orthologous IRs are in reverse orientation. We provide evidence that this IR forms a large non-B DNA structure containing two Holliday junctions, allowing intrastrand nucleotide pairing of the arms and interstrand pairing of the spacer region of the IR. This structure would extrude into a large double-cruciform DNA structure providing the molecular basis of translocation events and regulation of gene expression.
Collapse
MESH Headings
- Animals
- Antigens, Neoplasm/genetics
- Base Sequence
- Chromosomes, Human, X/chemistry
- Chromosomes, Human, X/genetics
- DNA/chemistry
- DNA/genetics
- DNA, Cruciform/chemistry
- DNA, Cruciform/genetics
- Humans
- Models, Molecular
- Nucleic Acid Conformation
- Pan troglodytes/genetics
- Phylogeny
- Repetitive Sequences, Nucleic Acid
- Sequence Homology, Nucleic Acid
- Species Specificity
- X Chromosome/chemistry
- X Chromosome/genetics
Collapse
Affiliation(s)
- Florian O Losch
- Clinical Research Group Tumor Immunology, Department of Dermatology, Charité - Universitätsmedizin Berlin, 10098 Berlin, Germany.
| | | | | | | | | |
Collapse
|
4
|
Abstract
During the past 65 million years, Alu elements have propagated to more than one million copies in primate genomes, which has resulted in the generation of a series of Alu subfamilies of different ages. Alu elements affect the genome in several ways, causing insertion mutations, recombination between elements, gene conversion and alterations in gene expression. Alu-insertion polymorphisms are a boon for the study of human population genetics and primate comparative genomics because they are neutral genetic markers of identical descent with known ancestral states.
Collapse
Affiliation(s)
- Mark A Batzer
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana 70803, USA.
| | | |
Collapse
|
5
|
Pavlícek A, Jabbari K, Paces J, Paces V, Hejnar JV, Bernardi G. Similar integration but different stability of Alus and LINEs in the human genome. Gene 2001; 276:39-45. [PMID: 11591470 DOI: 10.1016/s0378-1119(01)00645-x] [Citation(s) in RCA: 89] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Alus and LINEs (LINE1) are widespread classes of repeats that are very unevenly distributed in the human genome. The majority of GC-poor LINEs reside in the GC-poor isochores whereas GC-rich Alus are mostly present in GC-rich isochores. The discovery that LINES and Alus share similar target site duplication and a common AT-rich insertion site specificity raised the question as to why these two families of repeats show such a different distribution in the genome. This problem was investigated here by studying the isochore distributions of subfamilies of LINES and Alus characterized by different degrees of divergence from the consensus sequences, and of Alus, LINEs and pseudogenes located on chromosomes 21 and 22. Young Alus are more frequent in the GC-poor part of the genome than old Alus. This suggests that the gradual accumulation of Alus in GC-rich isochores has occurred because of their higher stability in compositionally matching chromosomal regions. Densities of Alus and LINEs increase and decrease, respectively, with increasing GC levels, except for the telomeric regions of the analyzed chromosomes. In addition to LINEs, processed pseudogenes are also more frequent in GC-poor isochores. Finally, the present results on Alu and LINE stability/exclusion predict significant losses of Alu DNA from the GC-poor isochores during evolution, a phenomenon apparently due to negative selection against sequences that differ from the isochore composition.
Collapse
Affiliation(s)
- A Pavlícek
- Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Flemingovo 2, CZ-16637, Prague, Czech Republic
| | | | | | | | | | | |
Collapse
|
6
|
Roy-Engel AM, Carroll ML, Vogel E, Garber RK, Nguyen SV, Salem AH, Batzer MA, Deininger PL. Alu insertion polymorphisms for the study of human genomic diversity. Genetics 2001; 159:279-90. [PMID: 11560904 PMCID: PMC1461783 DOI: 10.1093/genetics/159.1.279] [Citation(s) in RCA: 93] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Genomic database mining has been a very useful aid in the identification and retrieval of recently integrated Alu elements from the human genome. We analyzed Alu elements retrieved from the GenBank database and identified two new Alu subfamilies, Alu Yb9 and Alu Yc2, and further characterized Yc1 subfamily members. Some members of each of the three subfamilies have inserted in the human genome so recently that about a one-third of the analyzed elements are polymorphic for the presence/absence of the Alu repeat in diverse human populations. These newly identified Alu insertion polymorphisms will serve as identical-by-descent genetic markers for the study of human evolution and forensics. Three previously classified Alu Y elements linked with disease belong to the Yc1 subfamily, supporting the retroposition potential of this subfamily and demonstrating that the Alu Y subfamily currently has a very low amplification rate in the human genome.
Collapse
Affiliation(s)
- A M Roy-Engel
- Tulane Cancer Center, Department of Environmental Health Sciences, Tulane University Health Sciences Center, New Orleans, Louisiana 70112, USA
| | | | | | | | | | | | | | | |
Collapse
|
7
|
Roy AM, Gong C, Kass DH, Deininger PL. Recent B2 element insertions in the mouse genome. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 2001; 8:343-8. [PMID: 10993604 DOI: 10.3109/10425179809034078] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
B2 elements are a family of short interspersed repeats that have amplified within rodent genomes. Recent mobility of only two individual B2 elements has been reported to date. We identified an additional recent B2 insertion occurring within intron 4 of the murine beta-glucuronidase gene (Gus-s) of the BalbC strain of mouse by analyzing orthologous loci of a nonrandomly selected B2 element. The basis of selection for the B2 element was its high level of sequence identity to the B2 consensus. The selected B2 element was amplified by the polymerase chain reaction (PCR) using primers to the unique flanking sequences from genomic DNA of several species and laboratory strains of mice. Our results demonstrated the presence of the selected B2 element only in the genome of Mus musculus BalbC strain. Cloning and sequencing of a representative sample of the products obtained confirmed the absence of the B2 element within this intron in addition to other variations in the sequence. The detection of the B2 element only in the BalbC strain suggests that the element recently inserted within this mouse population when the initial laboratory colony was formed. Sequence comparison of the two previously identified recent B2 inserts also shows a low divergence in relation to the B2 type II consensus. The data presented confirms that recently inserted B2 elements closely match their consensus sequence, potentially allowing for their identification.
Collapse
Affiliation(s)
- A M Roy
- Department of Biochemistry and Molecular Biology, Louisiana State University--Medical Center, New Orleans 70112, USA
| | | | | | | |
Collapse
|
8
|
Abstract
Several distinct families of endogenous retrovirus-like elements (ERVs) exist in the genomes of primates. Despite the important evolutionary consequences that carrying these intragenomic parasites may have for their hosts, our knowledge about their evolution is still scarce. A matter of particular interest is whether evolution of ERVs occurs via a master lineage or through several lineages coexisting over long periods of time. In this work, the paleogenomic approach has been applied to the study of the evolution of ERV9, one of the human endogenous retrovirus families mobilized during primate evolution. By searching the GenBank database with the first 676 bp of the ERV9 long terminal repeat, we identified 156 different element insertions into the human genome. These elements were grouped into 14 subfamilies based on several characteristic nucleotide differences. The age of each subfamily was roughly estimated based on the average sequence divergence of its members from the subfamily consensus sequence. Determination of the sequential order of diagnostic substitutions led to the identification of four distinct lineages, which retained their capacity of transposition over extended periods of evolution. Strong evidence for mosaic evolution of some of these lineages is presented. Taken altogether, the available data indicate that the possibility of ERV9 still being active in the human lineage can not be discarded.
Collapse
Affiliation(s)
- J Costas
- Departamento de Bioloxía Fundamental, Facultade de Bioloxía, Universidade de Santiago de Compostela, Spain.
| | | |
Collapse
|
9
|
Sng JH, Heaton VJ, Bell M, Maini P, Austin CA, Fisher LM. Molecular cloning and characterization of the human topoisomerase IIalpha and IIbeta genes: evidence for isoform evolution through gene duplication. BIOCHIMICA ET BIOPHYSICA ACTA 1999; 1444:395-406. [PMID: 10095062 DOI: 10.1016/s0167-4781(99)00020-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Human DNA topoisomerase II is essential for chromosome segregation and is the target for several clinically important anticancer agents. It is expressed as genetically distinct alpha and beta isoforms encoded by the TOP2alpha and TOP2beta genes that map to chromosomes 17q21-22 and 3p24, respectively. The genes display different patterns of cell cycle- and tissue-specific expression, with the alpha isoform markedly upregulated in proliferating cells. In addition to the fundamental role of TOP2alpha and TOP2beta genes in cell growth and development, altered expression and rearrangement of both genes are implicated in anticancer drug resistance. Here, we report the complete structure of the human topoisomerase IIalpha gene, which consists of 35 exons spanning 27.5 kb. Sequence data for the exon-intron boundaries were determined and examined in the context of topoisomerase IIalpha protein structure comprising three functional domains associated with energy transduction, DNA breakage-reunion activity and nuclear localization. The organization of the 3' half of human TOP2beta, including sequence specifying the C-terminal nuclear localization domain, was also elucidated. Of the 15 introns identified in this 20 kb region of TOP2beta, the first nine and the last intron align in identical positions and display the same phases as introns in TOP2alpha. Though their extreme 3' ends differ, the striking conservation suggests the two genes diverged recently in evolutionary terms consistent with a gene duplication event. Access to TOP2alpha and TOP2beta gene structures should aid studies of mutations and gene rearrangements associated with anticancer drug resistance.
Collapse
Affiliation(s)
- J H Sng
- Molecular Genetics Group, Department of Biochemistry, St. George's Hospital Medical School, University of London, London SW17 0RE, UK
| | | | | | | | | | | |
Collapse
|
10
|
Lundqvist E, Johansson I, Ingelman-Sundberg M. Genetic mechanisms for duplication and multiduplication of the human CYP2D6 gene and methods for detection of duplicated CYP2D6 genes. Gene 1999; 226:327-38. [PMID: 9931507 DOI: 10.1016/s0378-1119(98)00567-8] [Citation(s) in RCA: 118] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The polymorphic CYP2D6 gene determines the rates at which several different classes of clinically important drugs are metabolized in vivo. A specific phenotype whereby a subject metabolizes drugs very rapidly (ultrarapid metabolizer, UM) has been shown to be caused by the presence of multiple active CYP2D6 genes on one allele. Hitherto, individuals with 1, 2, 3, 4, 5, or 13 CYP2D6 genes in tandem have been described for various ethnic groups. In the present investigation, we present results from restriction mapping of the CYP2D loci of individuals with two or more consecutive CYP2D6 genes, along with sequence analysis of this gene (CYP2D6*2). Our results indicate that alleles with duplicated or multiduplicated genes have occurred through unequal crossover at a specific breakpoint in the 3'-flanking region of the CYP2D6*2B allele with a specific repetitive sequence. In contrast, alleles with 13 copies of the gene are proposed to have been formed by unequal segregation and extrachromosomal replication of the acentric DNA. We present a rapid and efficient PCR-based allele-specific method for the detection of duplicated, multiduplicated, or amplified CYP2D6 genes.
Collapse
Affiliation(s)
- E Lundqvist
- Division of Molecular Toxicology, Institute for Environmental Medicine, Karolinska Institutet, S-171 77, Stockholm, Sweden
| | | | | |
Collapse
|
11
|
Jendraschak E, Kaminski WE. Isolation of human promoter regions by Alu repeat consensus-based polymerase chain reaction. Genomics 1998; 50:53-60. [PMID: 9628822 DOI: 10.1006/geno.1998.5290] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Knowledge of the promoter structure is critical for an understanding of the regulation of genes. We demonstrate by analysis of 405 human genes that human promoter regions are flanked by upstream Alu repeat elements, typically at a distance of 0.5-5 kb from their protein-coding areas. We identified common Alu repeat consensus sequences (ARC) among the different members of the Alu subfamilies that can be used as universal anchor sites for polymerase chain reaction (PCR) amplification. Utilizing ARC-specific primers and oligonucleotides specific for the 5' end of a selected target gene, we show that sequences spanning unknown human gene promoter regions can be directly amplified by PCR from genomic DNA. This novel technique, termed ARC-PCR, allowed us to characterize the proximal promoters of the human LTA4 hydrolase and SPARC genes, each within 1 day.
Collapse
Affiliation(s)
- E Jendraschak
- Department of Pathology, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | | |
Collapse
|
12
|
Komonyi O, Mink M, Csiha J, Maróy P. Genomic organization of DHR38 gene in Drosophila: presence of Alu-like repeat in a translated exon and expression during embryonic development. ARCHIVES OF INSECT BIOCHEMISTRY AND PHYSIOLOGY 1998; 38:185-192. [PMID: 9704500 DOI: 10.1002/(sici)1520-6327(1998)38:4<185::aid-arch4>3.0.co;2-q] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The recombinant lambda clone 4-2 containing the genomic region of the Drosophila hormone receptor 38 (DHR 38) gene, homologous to mammalian neuronal growth factor I-B (NGFI-B), was isolated by radioactive labelled oligonucleotide hybridization. The nucleotide sequence of the genomic clone revealed three exons that encode the functional domains of the protein. The N-terminal exon1 had no homology at the amino acid level with NGFI-B, the mammalian homologue. A glutamine-rich region, probably involved in transcriptional activation, was observed at the C-terminal part of this exon. A similar motif is also present upstream in another reading frame of the same strand. Both motifs are preceded by a repetitive non-anucleotide sequence containing an AluI site, resembling a duplicated human Alu-sequence. A monomeric version of this sequence, coding similarly for an oligoglutamine peptide, occurs at a surprisingly high frequency in other regulatory genes in Drosophila. In contrast to mammalian Alu sequences, this sequence is found almost exclusively in the coding regions of Drosophila genes, but not in the non-coding parts of the genes. The DNA-binding domain with two zinc-fingers, and at least part of the ligand-binding peptide, is coded by the largest middle exon2 in DHR38 and exhibits up to 100% homology in short peptide motifs to its mammalian counterpart, where these domains are split into exons 3, 4, 5, and 6. However, the length, information content, stop codon, and splice site are conserved in the last exons in both fly and rat. In situ hybridization to 0-24 h wholemount embryos showed strong expression of DHR38 in neurogenic regions and in the intestinal tract during embryogenesis, suggesting a spatial and temporal control of transcription, partially analogous to the central nervous system-specific expression of NGFI-B in mammals.
Collapse
Affiliation(s)
- O Komonyi
- Department of Genetics, A.J. University, Szeged, Hungary
| | | | | | | |
Collapse
|
13
|
von Sternberg R. The role of constrained self-organization in genome structural evolution. Acta Biotheor 1996; 44:95-118. [PMID: 9028019 DOI: 10.1007/bf00048418] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
A hypothesis of genome structural evolution is explored. Rapid and cohesive alterations in genome organization are viewed as resulting from the dynamic and constrained interactions of chromosomal subsystem components. A combination of macromolecular boundary conditions and DNA element involvement in far-from-equilibrium reactions is proposed to increase the complexity of genomic subsystems via the channelling of genome turnover; interactions between subsystems create higher-order subsystems expanding the phase space for further genetic evolution. The operation of generic constraints on structuration in genome evolution is suggested by i) universal, homoplasic features of chromosome organization and ii) the metastable nature of genome structures where lower-level flux is constrained by higher-order structures. Phenomena such as 'genomic shock', bursts of transposable element activity, concerted evolution, etc., are hypothesized to result from constrained systemic responses to endogenous/exogenous, micro/macro perturbations. The constraints operating on genome turnover are expected to increase with chromosomal structural complexity, the number of interacting subsystems, and the degree to which interactions between genomic components are tightly ordered.
Collapse
Affiliation(s)
- R von Sternberg
- Center for Intelligent Systems, T.J. Watson School, State University of New York at Binghamton 13902, USA
| |
Collapse
|
14
|
Tishkoff SA, Ruano G, Kidd JR, Kidd KK. Distribution and frequency of a polymorphic Alu insertion at the plasminogen activator locus in humans. Hum Genet 1996; 97:759-64. [PMID: 8641693 DOI: 10.1007/bf02346186] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
We have investigated the frequency distribution, across a broad range of geographically dispersed populations, of alleles of the polymorphic Alu insertion that occurs within the 8th intron of the tissue plasminogen, activator gene (PLAT). This Alu is a member of a recently derived subfamily of Alu elements that has been expanding during human evolution and continues to be transpositionally active. We used a "population tube" approach to screen 10 chromosomes from each of 19 human populations for presence or absence of this Alu in the PLAT locus and found that all tested populations are dimorphic for presence/absence of this insertion. We show that the previously published EcoRI, HincII, PstI, TaqI, and XmnI polymorphisms at the PLAT locus all result from insertion of this Alu and we use both restriction fragment length polymorphism and polymerase chain reaction analysis to examine the frequency of Alu(+) and Alu(-) alleles in a sample of 1003 individuals from 27 human populations and in 38 nonhuman primates. Nonhuman primates are monomorphic for the Alu(-) allele. Human populations differ substantially in allele frequency, and in several populations both alleles are common. Our results date the insertion event prior to the spread and diversification of modern humans.
Collapse
Affiliation(s)
- S A Tishkoff
- Department of Genetics, Yale University, New Haven, CT 06510, USA
| | | | | | | |
Collapse
|
15
|
Englander EW, Howard BH. A naturally occurring T14A11 tract blocks nucleosome formation over the human neurofibromatosis type 1 (NF1)-Alu element. J Biol Chem 1996; 271:5819-23. [PMID: 8621451 DOI: 10.1074/jbc.271.10.5819] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
The nature of chromatin organization over Alu repetitive elements is of interest with respect to the maintenance of their transcriptional silencing as well as their potential to influence local chromatin structure. We previously demonstrated that the pattern of nucleosomal organization over Alu elements in native chromatin is specific and similar to the pattern observed with an in vitro reconstituted Alu template. This pattern, distinguished by a nucleosome centered over the 5 -end of the Alu element, is associated with repression of polymerase III-dependent transcription in vitro (Englander, E. W., Wolffe, A. P., and Howard, B. H. (1993) J. Biol. Chem. 268, 19565-19573; Englander, E. W., and Howard, B. H. (1995) J. Biol. Chem. 270, 10091-10096). In the current study, additional templates representing both evolutionarily old and young Alu subfamilies were found to direct a similar pattern of nucleosome assembly, consistent with the view that nucleosome positioning in vitro is shared by a majority of Alus. We discovered however, that the specific nucleosome positioning pattern was disrupted over one member of a young Alu subfamily, which recently transposed immediately downstream to a T14A11 sequence in the neurofibromatosis type 1 locus (Wallace, M. R., Andersen, L. B., Saulino, A. M., Gregory, P. E., Glover, T. W., and Collins, F. S. (1991) Nature 353, 864-866). Upon removal of this sequence motif, the expected pattern of assembly was restored to the neurofibromatosis type 1-Alu template. This finding indicates that, at least in vitro, certain sequences can override the propensity for positioning nucleosomes that is inherent to Alu elements. The finding also raises the possibility that a similar situation may occur in vivo, with potential implications for understanding mechanisms by which certain Alu elements may evade chromatin-mediated transcriptional silencing.
Collapse
Affiliation(s)
- E W Englander
- Laboratory of Molecular Growth Regulation, NICHD, National Institutes of Health, Bethesda, Maryland 20892-2753, USA
| | | |
Collapse
|
16
|
Murata S, Takasaki N, Saitoh M, Tachida H, Okada N. Details of retropositional genome dynamics that provide a rationale for a generic division: the distinct branching of all the pacific salmon and trout (Oncorhynchus) from the Atlantic salmon and trout (Salmo). Genetics 1996; 142:915-26. [PMID: 8849897 PMCID: PMC1207028 DOI: 10.1093/genetics/142.3.915] [Citation(s) in RCA: 51] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Salmonid species contain numerous short interspersed repetitive elements (SINEs), known collectively as the HpaI family, in their genomes. Amplification and successive integration of individual SINEs into the genomes have occurred during the evolution of salmonids. We reported previously a strategy for determining the phylogenetic relationships among the Pacific salmonids in which these SINEs were used as temporal landmarks of evolution. Here, we provide evidence for extensive genomic rearrangements that involved retropositions and deletions in a common ancestor of all the Pacific salmon and trout. Our results provide genetic support for the recent phylogenetic reassignment of steelhead and related species from the genus Salmo to the genus Oncorhynchus. Several other informative loci identified by insertions of HpaI SINEs have been isolated, and previously proposed branching orders of the Oncorhynchus species have been confirmed. The authenticity of our phylogenetic tree is supported both by the isolation of more than two informative loci per branching point and by the congruence of all our data, which suggest that the period between successive speciations was sufficiently long for each SINE that had been amplified in the original species to become fixed in all individuals of that species.
Collapse
Affiliation(s)
- S Murata
- Faculty of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Japan
| | | | | | | | | |
Collapse
|
17
|
Milewicz DM, Byers PH, Reveille J, Hughes AL, Duvic M. A dimorphic Alu Sb-like insertion in COL3A1 is ethnic-specific. J Mol Evol 1996; 42:117-23. [PMID: 8919863 DOI: 10.1007/bf02198836] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Alu elements are a class of repetitive DNA sequences found throughout the human genome that are thought to be duplicated via an RNA intermediate in a process termed retroposition. Recently inserted Alu elements are closely related, suggesting that they are derived from a single source gene or closely related source genes. Analysis of the type III collagen gene (COL3A1) revealed a polymorphic Alu insertion in intron 8 of the gene. The Alu insertion in the COL3A1 gene had a high degree of nucleotide identity to the Sb family of Alu elements, a family of older Alu elements. The Alu sequence was less similar to the consensus sequence for the PV or Sb2 subfamilies, subfamilies of recently inserted Alu elements. These data support the observations that at least three source genes are active in the human genome, one of which is distinct from the PV and Sb2 subfamilies and predates either of these two subfamilies. Appearance of the Alu insertion in different ethnic populations suggests that the insertion may have occurred in the last 100,000 years. This Alu insert should be a useful marker for population studies and for marking COL3A1 alleles.
Collapse
Affiliation(s)
- D M Milewicz
- Department of Internal Medicine, University of Texas Medical School at Houston 77030, USA
| | | | | | | | | |
Collapse
|
18
|
Abstract
Using Kimura's distance measure we have calculated the average age of all major Alu subfamilies based on the most recent available data. We conclude that AluJ sequences are some 26 Myr older than previously thought. Furthermore, the origin of the FLA (Free Left Arm) Alu family can be traced back to the very beginning of the mammalian radiation. One new minor subfamily is reported and discussed in the context of sequence diversity in major Alu subfamilies.
Collapse
Affiliation(s)
- V Kapitonov
- Linus Pauling Institute of Science and Medicine, Palo Alto, CA 94306, USA
| | | |
Collapse
|
19
|
Batzer MA, Arcot SS, Phinney JW, Alegria-Hartman M, Kass DH, Milligan SM, Kimpton C, Gill P, Hochmeister M, Ioannou PA, Herrera RJ, Boudreau DA, Scheer WD, Keats BJ, Deininger PL, Stoneking M. Genetic variation of recent Alu insertions in human populations. J Mol Evol 1996; 42:22-9. [PMID: 8576959 DOI: 10.1007/bf00163207] [Citation(s) in RCA: 134] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The Alu family of interspersed repeats is comprised of over 500,000 members which may be divided into discrete subfamilies based upon mutations held in common between members. Distinct subfamilies of Alu sequences have amplified within the human genome in recent evolutionary history. Several individual Alu family members have amplified so recently in human evolution that they are variable as to presence and absence at specific loci within different human populations. Here, we report on the distribution of six polymorphic Alu insertions in a survey of 563 individuals from 14 human population groups across several continents. Our results indicate that these polymorphic Alu insertions probably have an African origin and that there is a much smaller amount of genetic variation between European populations than that found between other population groups.
Collapse
Affiliation(s)
- M A Batzer
- Human Genome Center, Lawrence Livermore National Laboratory, CA 94551, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Batzer MA, Deininger PL, Hellmann-Blumberg U, Jurka J, Labuda D, Rubin CM, Schmid CW, Zietkiewicz E, Zuckerkandl E. Standardized nomenclature for Alu repeats. J Mol Evol 1996; 42:3-6. [PMID: 8576960 DOI: 10.1007/bf00163204] [Citation(s) in RCA: 186] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Affiliation(s)
- M A Batzer
- Human Genome Center, Lawrence Livermore National Laboratory, CA 94551, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Kido Y, Saitoh M, Murata S, Okada N. Evolution of the active sequences of the HpaI short interspersed elements. J Mol Evol 1995; 41:986-95. [PMID: 8587145 DOI: 10.1007/bf00173180] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Ninety-nine members of the salmonid HpaI and AvaIII families of short interspersed repetitive elements (SINEs) were aligned and a general consensus sequence was deduced. The presence of 26 correlated changes in nucleotides (diagnostic nucleotides) from those in the consensus sequence allowed us to divide the members of the HpaI family into 12 subfamilies and those of the AvaIII family into two subfamilies. On the basis of the average sequence divergences and the phylogenetic distributions of the subfamilies, the relative antiquity of the subfamilies and the process of sequential changes in the respective source sequences were inferred. Despite the higher mutation rates of CG dinucleotides in individual dispersed members, no hypermutability of CG positions was observed in changes in the source sequences. This result suggests that sequences of SINEs located in a nonmethylated or hypomethylated genomic region could have been selected as source sequences for retroposition and/or that some CG sites are the parts of recognition sequences of retropositional machineries.
Collapse
Affiliation(s)
- Y Kido
- Faculty of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Japan
| | | | | | | |
Collapse
|
22
|
Arcot SS, Fontius JJ, Deininger PL, Batzer MA. Identification and analysis of a 'young' polymorphic Alu element. BIOCHIMICA ET BIOPHYSICA ACTA 1995; 1263:99-102. [PMID: 7632743 DOI: 10.1016/0167-4781(95)00080-z] [Citation(s) in RCA: 28] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
A polymorphic Alu element belonging to a young subfamily of Alu repeats has been identified. Sequence analysis showed that this Alu element is flanked by perfect direct repeats and a 3' oligo(dA)-rich tail. The Alu element, designated A25, is deleted by 34 nucleotides at the 5' end and has a single CpG mutation compared to the human-specific consensus sequence. Using a PCR-based assay, we demonstrated that the A25 Alu repeat is localized to human chromosome 8 and is polymorphic in humans.
Collapse
Affiliation(s)
- S S Arcot
- Human Genome Center, Lawrence Livermore National Laboratory, CA 94551, USA
| | | | | | | |
Collapse
|
23
|
Deininger PL, Batzer MA, Hutchison CA, Edgell MH. Master genes in mammalian repetitive DNA amplification. Trends Genet 1995; 8:307-11. [PMID: 1365396 DOI: 10.1016/0168-9525(92)90262-3] [Citation(s) in RCA: 223] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The analysis of species-specific subfamilies of both the LINE and SINE mammalian repetitive DNA families suggests that such subfamilies have arisen by amplification of an extremely small group of 'master' genes. In contrast to the master genes, the vast majority of both SINEs and LINEs appear to behave like psudogenes in their inability to undergo extensive amplification.
Collapse
Affiliation(s)
- P L Deininger
- Department of Biochemistry and Molecular Biology, Louisiana State University Medical Center, New Orleans 70112
| | | | | | | |
Collapse
|
24
|
Tachida H, Iizuka M. A population genetic study of the evolution of SINEs. I. Polymorphism with regard to the presence or absence of an element. Genetics 1993; 133:1023-30. [PMID: 7681799 PMCID: PMC1205395 DOI: 10.1093/genetics/133.4.1023] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
SINEs are short interspersed repeated DNA elements which are considered to spread throughout genomes via RNA intermediates. Polymorphisms with regard to the presence or absence of SINE are occasionally observed in a specific location of a genome. We modeled the evolution of SINEs with regard to this type of polymorphism. Because SINEs are rarely deleted, multiplication of elements is confined to a certain period, and a few master copies are considered to be responsible for their multiplication, the usual population genetic models of transposable elements assuming the equilibrium state are not applicable to describe the evolution of SINEs. Taking into account these properties and assuming selective neutrality, we computed conditional probabilities of finding a SINE at a specific site given that this site is first found because it is occupied by a SINE in an original sample. Using these probabilities, we investigated ways to estimate the multiplication period and infer relationships among populations. The latter inference procedures are shown to be strongly dependent on the multiplication period.
Collapse
Affiliation(s)
- H Tachida
- National Institute of Genetics, Shizuoka-ken, Japan
| | | |
Collapse
|
25
|
Affiliation(s)
- M A Batzer
- Human Genome Center, Lawrence Livermore National Laboratory, Livermore, California 94551
| | | | | |
Collapse
|
26
|
Novick GE, Gonzalez T, Garrison J, Novick CC, Batzer MA, Deininger PL, Herrera RJ. The use of polymorphic Alu insertions in human DNA fingerprinting. EXS 1993; 67:283-91. [PMID: 8400699 DOI: 10.1007/978-3-0348-8583-6_26] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
We have characterized several Human Specific (HS) Alu insertions as either dimorphic (TPA25, PV92, APO), slightly dimorphic (C2N4 and C4N4) or monomorphic (C3N1, C4N6, C4N2, C4N5, C4N8) based on studies of Caucasian, Asian, American Black and African Black populations. Our approach is based upon: 1) PCR amplification using primers complementary to the unique DNA sequences that flank the site of insertion of the different Alu elements studied; 2) gel electrophoresis and scoring according to the presence or absence of an Alu insertion in one or both homologous chromosomes; 3) allele frequencies determined by gene counting and compared to Hardy-Weinberg expectations. Our DNA fingerprinting procedure using PCR amplification of diallelic polymorphic (dimorphic) Human Specific Alu insertions, may be used as a tool for genetic mapping, to characterize populations, study human migrational patterns, and track the inheritance of human genetic disorders.
Collapse
Affiliation(s)
- G E Novick
- Department of Biological Sciences, Florida International University, Miami 33199
| | | | | | | | | | | | | |
Collapse
|
27
|
|
28
|
Abstract
The molecular phylogeny of Alu and other repeated sequences in the human genome provides clues to events during primate evolution. A subclass of human Alu's has been previously identified as dimorphic insertions within members of the medium reiteration frequency (mer) class of repeats, reflecting the complicated sequence of insertion and radiation events leading to the current human genome structure. One dimorphic Alu is located within a previously unidentified mer family member, in the first intron of the human T4 (CD4) gene. The insertion (Alu+ allele) has a frequency of approximately 70% in Europeans and Africans and is homozygous in 20 Asian samples. Polymerase chain reaction amplification, direct DNA sequencing, and Southern analysis using oligonucleotide probes revealed that the Alu- allele was derived from the Alu+ allele by loss of part of the inserted sequence. Comparison with a tightly linked marker within the human genome and studies of baboon DNA samples revealed that the original insertion was a relatively early event in primate evolution, but that the Alu sequence loss leading to the dimorphism has occurred much more recently. Loss of Alu insertions therefore represents one mechanism for the generation of human Alu dimorphisms.
Collapse
Affiliation(s)
- M C Edwards
- Institute for Molecular Genetics, Baylor College of Medicine, Houston, Texas 77030
| | | |
Collapse
|
29
|
Sinnett D, Richer C, Deragon JM, Labuda D. Alu RNA transcripts in human embryonal carcinoma cells. Model of post-transcriptional selection of master sequences. J Mol Biol 1992; 226:689-706. [PMID: 1507221 DOI: 10.1016/0022-2836(92)90626-u] [Citation(s) in RCA: 73] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Alu master sequences colonized the human genome using RNA as amplification intermediate. To understand this phenomenon better we isolated and analyzed Alu RNA from NTera2D1 pluripotential cells. Northern hybridization, primer extension, cDNA cloning and sequencing data are congruent and demonstrate a low level of Alu specific transcription. These bona fide RNA Polymerase III Alu transcripts, although enriched in the cytoplasm, are not dominated by a single master species but rather originate from a variety of loci. However, when compared with the genomic average, or to repeats from RNA Polymerase II co-transcripts, they belong to the youngest group of Alu subfamilies (p less than 0.001) and have a higher content of intact CpG-dinucleotides. This suggests that Alu transcription is influenced both by mutations and the genomic context, and points to a possible role of DNA methylation in silencing the bulk of genomic repeats. Because of the heterogeneity of Alu transcripts a post-transcriptional selection mechanism recruiting Alu master sequences for retroposition is required. We propose that Alu RNA masters could have evolved as selfish satellites to a more complex retroposition system equipped with a reverse transcriptase activity and that their structure was conserved through "phenotypic" selection of the RNA level.
Collapse
MESH Headings
- Base Sequence
- Blotting, Northern
- Cell Line
- Cell Nucleus/metabolism
- Cloning, Molecular
- Cytoplasm/metabolism
- Gene Library
- Genetic Variation
- Genome, Human
- Humans
- Models, Genetic
- Molecular Sequence Data
- Oligodeoxyribonucleotides
- Plasmids
- Polymerase Chain Reaction
- RNA Polymerase II/metabolism
- RNA Processing, Post-Transcriptional
- RNA, Neoplasm/genetics
- RNA, Neoplasm/isolation & purification
- Repetitive Sequences, Nucleic Acid
- Ribonucleoproteins/metabolism
- Sequence Homology, Nucleic Acid
- Templates, Genetic
- Teratoma
- Transcription, Genetic
Collapse
Affiliation(s)
- D Sinnett
- Hôpital Sainte-Justine, Département de Pédiatrie, Université de Montréal, Québec, Canada
| | | | | | | |
Collapse
|
30
|
Leeflang EP, Liu WM, Hashimoto C, Choudary PV, Schmid CW. Phylogenetic evidence for multiple Alu source genes. J Mol Evol 1992; 35:7-16. [PMID: 1325566 DOI: 10.1007/bf00160256] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
A member of the young PV Alu subfamily is detected in chimpanzee DNA showing that the PV subfamily is not specific to human DNA. This particular Alu is absent from the orthologous loci in both human and gorilla DNAs, indicating that PV subfamily members transposed within the chimpanzee lineage following the divergence of chimpanzee from both gorilla and human. These findings and previous reports describing the transpositional activity of other Alu sequences within the human, gorilla, and chimpanzee lineages provide phylogenetic evidence for the existence of multiple Alu source genes. Sequences surrounding this particular Alu resemble known transcriptional control elements associated with RNA polymerase III, suggesting a mechanism by which cis-acting elements might be acquired upon retrotransposition.
Collapse
Affiliation(s)
- E P Leeflang
- Department of Chemistry, University of California, Davis 95616
| | | | | | | | | |
Collapse
|
31
|
Lidberg U, Nilsson J, Strömberg K, Stenman G, Sahlin P, Enerbäck S, Bjursell G. Genomic organization, sequence analysis, and chromosomal localization of the human carboxyl ester lipase (CEL) gene and a CEL-like (CELL) gene. Genomics 1992; 13:630-40. [PMID: 1639390 DOI: 10.1016/0888-7543(92)90134-e] [Citation(s) in RCA: 53] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The gene encoding human carboxyl ester lipase (CEL), including 1628 bp of the 5'-flanking region, has been isolated and characterized from two overlapping lambda phage clones. The gene spans 9832 bp and contains 11 exons interrupted by 10 introns. The exons range in size from 88 to 204 bp, except for the last exon, which is 841 bp. A major and a minor transcription initiation site were determined 13 and 7 bp, respectively, upstream of the initiator methionine. The nucleotide sequence is identical with that of the previously reported cDNA, except for the third nucleotide in the 5'-untranslated sequence, a C, which in the cDNA is a T. A TAAATA sequence is present 26 nt upstream from the major CAP site, and within the 5'-flanking region there are several putative transcription factor binding sites. Seven Alu repetitive sequence elements are present in the region analyzed. The organization of the human CEL gene is similar to that of the recently reported rat pancreatic cholesterol esterase gene. The CEL gene was assigned to chromosome 9q34-qter, which confirms the recently reported results of Tayler et al. (1991, Genomics 10: 425-431). A previously unknown gene with a striking homology to the human CEL gene, here called the CEL-like gene (CELL), has also been isolated and characterized, including 1724 bp of the 5'-flanking region. The CELL gene, which most likely is a psuedogene, spans 4846 bp, and due to the absence of a 4.8-kb segment, the CEL gene exons 2-7 are not present in the CELL gene. Despite these differences, the CELL gene is transcribed. We have also assigned the CELL gene to a separate locus at chromosome 9q34-qter.
Collapse
Affiliation(s)
- U Lidberg
- Department of Molecular Biology, University of Göteborg, Sweden
| | | | | | | | | | | | | |
Collapse
|
32
|
Plass C, Hellwig T, Traut W, Winking H. Evolution of a B2 tagged sequence from a long-range repeat family in the genus Mus. Mamm Genome 1992; 3:197-201. [PMID: 1611215 DOI: 10.1007/bf00355719] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
A long-range repeat family of more than 50 kb repeat size is clustered in Chromosomes (Chr) 1 of Mus musculus and M. spretus. In M. musculus this long-range repeat family shows considerable variation of copy-number frequency and contains coding regions for at least two genes. In an intron of a gene, which is part of the repeat, a B2 small interspersed repetitive element (SINE) is inserted at identical positions. The B2 element is present in all copies of the long-range repeat family; it was presumably a component of the ancestral single-copy precursor sequence that gave rise by amplification to the repeat family. Copies of the long-range repeat family vary with respect to the number of TAAA tandem repeats in the A-rich 3' end region of the B2 element. As inferred from polymerase chain reaction (PCR) data, presence and frequency of repeat number variants in the (TAAA)n block are strain and species specific. The B2 element and its flanking regions were sequenced from two copies of the long-range repeat family. Sequence divergence between the two copies (only non-CG base substitutions and deletions/insertions) was determined to be 2.6%. Based on the drift rate in human Alu elements and a correction for the higher drift rates in rodents, an estimate for the divergence time of 1.7 million years was calculated. Since the long-range repeat family is present in M. musculus and M. spretus, it must have evolved by amplification before the separation of the two species about 1-4 million years ago.
Collapse
Affiliation(s)
- C Plass
- Institut für Biologie, Medizinische Universität zu Lübeck, FRG
| | | | | | | |
Collapse
|
33
|
Driesen MS, Dauwerse JG, Wapenaar MC, Meershoek EJ, Mollevanger P, Chen KL, Fischbeck KH, van Ommen GJ. Generation and fluorescent in situ hybridization mapping of yeast artificial chromosomes of 1p, 17p, 17q, and 19q from a hybrid cell line by high-density screening of an amplified library. Genomics 1991; 11:1079-87. [PMID: 1783377 DOI: 10.1016/0888-7543(91)90035-d] [Citation(s) in RCA: 28] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
A yeast artificial chromosome (YAC) library has been constructed from a somatic cell hybrid containing a t(1p;19q) chromosome and chromosome 17. After amplification, part of this library was analyzed by high-density colony filter screening with a repetitive human DNA probe (Alu). The human YACs distinguished by the screening were further analyzed by Alu fingerprinting and Alu PCR. Fluorescent in situ hybridization (FISH) was performed to localize the YACs to subchromosomal regions of chromosome 1p, 17, or 19q. We have obtained a panel of 123 individual YACs with a mean size of 160 kb, and 77 of these were regionally localized by FISH: 33 to 1p, 10 to 17p, 25 to 17q, and 9 to 19q. The YACs cover a total of 19.7 Mb or 9% of the 220 Mb of human DNA contained in the hybrid. No overlapping YACs have yet been detected. These YACs are available upon request and should be helpful in mapping studies of disease loci, e.g., Charcot-Marie-Tooth disease, Miller-Dieker syndrome, hereditary breast tumor, myotonic dystrophy, and malignant hyperthermia.
Collapse
MESH Headings
- Chromosome Mapping
- Chromosomes, Fungal
- Chromosomes, Human, Pair 1
- Chromosomes, Human, Pair 17
- Chromosomes, Human, Pair 19
- Cloning, Molecular
- DNA Fingerprinting
- Gene Amplification
- Gene Library
- Genome, Human
- Humans
- Hybrid Cells
- Microscopy, Fluorescence
- Nucleic Acid Hybridization
- Polymerase Chain Reaction
- Repetitive Sequences, Nucleic Acid
Collapse
Affiliation(s)
- M S Driesen
- Department of Human Genetics, State University Leiden, The Netherlands
| | | | | | | | | | | | | | | |
Collapse
|
34
|
Abstract
SINEs with internal promoters for RNA polymerase III are ubiquitous in the genomes of the animal kingdom, including invertebrates. Although the human Alu family, and related families, originates from 7SL RNA, all other SINEs originate from tRNA. SINEs have been amplified many times, altered in genomic organization and fixed in the population at certain stages of evolution. They can therefore be regarded as time-landmarks of evolution. It is proposed that both population genetics and molecular biology are required for understanding the expansion of SINEs.
Collapse
Affiliation(s)
- N Okada
- Institute of Biological Sciences, University of Tsukuba, Ibaraki, Japan
| |
Collapse
|
35
|
Abstract
A comparison of Alu sequences that comprise more recently amplified Alu subfamilies was made. There are 18 individual diagnostic mutations associated with the different subfamilies. This analysis confirmed that the formation of each subfamily can be explained by the sequential accumulation of mutations relative to the previous subfamily. Polymerase chain reaction amplification of orthologous loci in several primate species allowed us to determine the time of insertion of Alu sequences in individual loci. These data suggest that the vast majority of Alu elements amplified at any given time comprised a single Alu subfamily. We find that, although the individual divergence relative to a consensus sequence correlate reasonably well with sequence age, the diagnostic mutations are a more accurate measure of the age of any individual Alu family member. Our data are consistent with a model in which all Alu family members have been made from a single master gene or from a series of sequential master genes. This master gene(s) accumulated diagnostic base changes, resulting in the amplification of different subfamilies from the master gene at different times in primate evolution. The changes in the master gene(s) probably occurred individually, but their appearance is clearly punctuated. Ten of them have occurred within an approximately 15-million-year time span, 40-25 million years ago, and 8 changes have occurred within the last 5 million years. Surprisingly, no changes appeared in the 20 million years separating these periods.
Collapse
Affiliation(s)
- M R Shen
- Department of Biochemistry and Molecular Biology, Louisiana State University Medical Center, New Orleans 70112
| | | | | |
Collapse
|
36
|
Filatov LV, Mamayeva SE, Tomilin NV. Alu family variations in neoplasia. CANCER GENETICS AND CYTOGENETICS 1991; 56:11-22. [PMID: 1747866 DOI: 10.1016/0165-4608(91)90357-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Human chromosomes contain about one million copies of dispersed repeats of the Alu family which are distributed non-randomly. In this study we have compared the pattern of hybridization of tritiated Alu-probes on chromosomes of PHA-stimulated lymphocytes of normal donors and of non-stimulated bone marrow cells of acute leukemia patients, and found regular differences in this pattern over some chromosome bands (3q26, 8p11-p12, 14q24, 15q21, 6q22) between normal individuals and leukemia patients. These data were interpreted as indicative of somatic variation of the Alu family in acute leukemia. Possible mechanisms of the variation and the role of the Alu family in chromosome rearrangements in neoplasia are discussed.
Collapse
MESH Headings
- Acute Disease
- Bone Marrow/pathology
- Cells, Cultured
- Chromosome Aberrations
- Chromosomes, Human, Pair 10
- Chromosomes, Human, Pair 14
- Chromosomes, Human, Pair 3
- Chromosomes, Human, Pair 6
- Chromosomes, Human, Pair 8
- Gene Rearrangement
- Genetic Variation
- Humans
- Karyotyping
- Leukemia/genetics
- Leukemia/pathology
- Leukemia, Myeloid/genetics
- Leukemia, Myeloid/pathology
- Lymphocytes/cytology
- Lymphocytes/pathology
- Metaphase
- Multigene Family
- Reference Values
- Repetitive Sequences, Nucleic Acid
Collapse
Affiliation(s)
- L V Filatov
- Institute of Cytology, Academy of Sciences of the USSR, Leningrad
| | | | | |
Collapse
|
37
|
Myöhänen S, Kauppinen L, Wahlfors J, Alhonen L, Jänne J. Human spermidine synthase gene: structure and chromosomal localization. DNA Cell Biol 1991; 10:467-74. [PMID: 2069720 DOI: 10.1089/dna.1991.10.467] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
The human spermidine synthase (EC 2.5.1.16) gene was isolated from a genomic library constructed with DNA obtained from a human immunoglobulin G (IgG) myeloma cell line. Subsequent sequence analyses revealed that the gene comprised of 5,818 nucleotides from the cap site to the last A of the putative polyadenylation signal with 8 exons and 7 intervening sequences. The 5'-flanking region of the gene was extremely GC rich, lacking any TATA box but containing CCAAT consensus sequences. No perfect consensus sequence for the cAMP-responsive element for the AP-1 binding site was found, yet the gene contained seven AP-2 binding site consensus sequences. The putative polyadenylation signal was an unusual AATACA instead of AATAAA. Polymerase chain reaction analysis with DNA obtained from human x hamster somatic cell hybrids indicated that human spermidine synthase genomic sequences segregate with human chromosome 1. Transfection of the genomic clone into Chinese hamster ovary cells displaying a low endogenous spermidine synthase activity revealed that the gene was transiently expressed and hence in all likelihood represents a functional gene.
Collapse
Affiliation(s)
- S Myöhänen
- Department of Biochemistry & Biotechnology, University of Kuopio, Finland
| | | | | | | | | |
Collapse
|
38
|
Rad MR, Lützenkirchen K, Xu G, Kleinhans U, Hollenberg CP. The complete sequence of a 11,953 bp fragment from C1G on chromosome III encompasses four new open reading frames. Yeast 1991; 7:533-8. [PMID: 1897318 DOI: 10.1002/yea.320070513] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Affiliation(s)
- M R Rad
- Institut für Mikrobiologie, Heinrich-Heine-Universität, Düsseldorf, Germany
| | | | | | | | | |
Collapse
|
39
|
Labuda D, Sinnett D, Richer C, Deragon JM, Striker G. Evolution of mouse B1 repeats: 7SL RNA folding pattern conserved. J Mol Evol 1991; 32:405-14. [PMID: 1710278 DOI: 10.1007/bf02101280] [Citation(s) in RCA: 56] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
In a recent report mouse B1 genomic repeats were divided into six families representing different waves of fixation of B1 variants, consistent with the retroposition model of human Alu elements. These data are used to examine the distribution of nucleotide substitutions in individual genomic repeats with respect to family consensus sequences and to compare the minimal energy structures of the corresponding B1 RNAs. By an enzymatic approach the predicted structure of B1 RNAs is experimentally confirmed using as a model sequence an RNA of a young B1 family member transcribed in vitro by T7 RNA polymerase. B1 RNA preserves folding domains of the Alu fragment of 7SL RNA, its progenitor molecule. Our results reveal similarities among 7SL-like retroposons, human Alu, and rodent B1 repeats, and relate the evolutionary conservation of B1 family consensus sequences to selection at the RNA level.
Collapse
Affiliation(s)
- D Labuda
- Service de Génétique Médicale, Hôpital Ste-Justine, Département de Pédiatrie, Université de Montréal, Québec, Canada
| | | | | | | | | |
Collapse
|
40
|
Lomax MI, Hsieh CL, Darras BT, Francke U. Structure of the human cytochrome c oxidase subunit Vb gene and chromosomal mapping of the coding gene and of seven pseudogenes. Genomics 1991; 10:1-9. [PMID: 1646156 DOI: 10.1016/0888-7543(91)90476-u] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Subunit Vb of mammalian cytochrome c oxidase (COX; EC 1.9.3.1) is encoded by a nuclear gene and assembled with the other 12 COX subunits encoded in both mitochondrial and nuclear DNA. We have cloned the gene for human COX subunit Vb (COX5B) and determined the exon-intron structure by both hybridization analysis and DNA sequencing. The gene contains five exons and four introns; the four coding exons span a region of approximately 2.4 kb. The 5' end of the COX5B gene is GC-rich and contains many HpaII sites. Genomic Southern blot analysis of human DNA probed with the human COX Vb cDNA identified eight restriction fragments containing COX Vb-related sequences that were mapped to different chromosomes with panels of human x Chinese hamster somatic cell hybrids. Because only one of these fragments hybridized with a 210-bp probe from intron 4, we conclude that there is a single expressed gene for COX subunit Vb in the human genome. We have mapped this gene to chromosome 2, region cen-q13.
Collapse
Affiliation(s)
- M I Lomax
- Department of Anatomy and Cell Biology, University of Michigan, Ann Arbor 48109
| | | | | | | |
Collapse
|
41
|
Abstract
Of a total of 500,000 Alu family members, approximately 500 are present as a human-specific (HS) subfamily. Each of the HS subfamily members shares a high degree of nucleotide identity and is not present at orthologous positions in other primate genomes, suggesting that HS subfamily members have recently inserted within the human genome. This confirms the hypothesis that the majority of Alu family members are amplified copies of a "master" gene(s). This master gene appears to be amplifying at a rate much slower than that seen earlier in primate evolution. Some of the HS Alu subfamily members have amplified so recently that they are dimorphic in the human population, making them a potentially powerful tool for studies of human populations.
Collapse
Affiliation(s)
- M A Batzer
- Department of Biochemistry and Molecular Biology, Louisiana State University Medical Center, New Orleans 70112
| | | |
Collapse
|
42
|
Abstract
The existing classification of human Alu sequences is revised and expanded using a novel methodology and a larger set of sequence data. Our study confirms that there are two major Alu subfamilies, Alu-J and Alu-S. The Alu-S subfamily consists of at least five distinct subfamilies referred to as Alu-Sx, Alu-Sq, Alu-Sp, Alu-Sc, and Alu-Sb. The Alu-Sp and Alu-Sq subfamilies have been revealed by this study. Alu subfamilies differ from one another in a number of positions called diagnostic. In this paper the diagnostic positions are defined in quantitative terms and are used to evaluate statistical significance of the observed subfamilies. Each Alu subfamily most likely represents pseudogenes retroposed from evolving functional source Alu genes. Evidence presented in this paper indicates that Alu-Sp and Alu-Sc pseudogenes were retroposed from different source genes, during overlapping periods of time, and at different rates. Our analysis also indicates that the previously identified Alu-type transcript BC200 comes from an active Alu gene that might have existed even before the origin of dimeric Alu sequences. The source genes for Alu pseudogene families are reconstructed. It is assumed that diagnostic differences between reconstructed source genes reflect mutations that have occurred in true source Alu genes under natural selection. Some of these mutations are compensatory and are used to reconstruct a common secondary structure of Alu RNAs transcribed from the source genes. The biological function of Alu RNA is discussed in the context of its homology to the elongation-arresting domain of 7SL RNA.(ABSTRACT TRUNCATED AT 250 WORDS)
Collapse
Affiliation(s)
- J Jurka
- Linus Pauling Institute of Science and Medicine, Palo Alto, CA 94306
| | | |
Collapse
|
43
|
Wells D, Bains W. Characterization of an unusual human histone H3.3 pseudogene. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 1991; 2:125-7. [PMID: 1777675 DOI: 10.3109/10425179109039681] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The analysis of a genomic loci containing human histone H3.3 processed pseudogenes, has revealed two regions that are unusually rich in other retroposons. At one of the loci the H3.3 pseudogene is itself interrupted by 2 Alu repetitive sequences. The characterization of these two recently transposed Alus provides confirmation of the "multiple origin" hypothesis of these repetitive elements. The unusual occurrence of 3 different types of retroposons in a small region suggests that there may be particular chromosomal regions that are hot spots for retroposon insertion.
Collapse
Affiliation(s)
- D Wells
- Dept. of Biology, University of Houston, TX 77204
| | | |
Collapse
|
44
|
Stanford DR, Holicky EL, Perry CA, Rehder K, Harvey SE, Rohleder AM, Wieben ED. The snRNP E protein multigene family contains five pseudogenes with common mutations. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 1991; 1:357-63. [PMID: 1839226 DOI: 10.3109/10425179109020790] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Sequence data from three previously-uncharacterized members of the snRNP E protein multigene family suggest that each is a non-transcribed processed pseudogene, even though one clone has the potential to code for a full-length protein with greater than 90% similarity to previously-characterized E protein cDNAs. Each of the newly-analyzed family members is without introns, contains a tract of polyadenylic acid residues, and is flanked by short direct repeats. In addition, the three sequences all contain point mutations that distinguish them from the E protein coding sequence. Seven point mutations are common to the three sequences described here and to two previously-described E protein pseudogenes. Although all of these mutations are transitions, only 5 of 7 could have been generated by deamination of methylated cytosines in inactive genes. Thus, the common mutations in the pseudogenes suggest an origin other than the expressed gene that we have described. Allelic variants for two of the pseudogenes were detected and repetitive elements are located near four of the five E protein pseudogenes that have been characterized.
Collapse
Affiliation(s)
- D R Stanford
- Department of Biochemistry and Molecular Biology, Mayo Clinic/Foundation, Rochester, MN 55905
| | | | | | | | | | | | | |
Collapse
|
45
|
Hickok NJ, Wahlfors J, Crozat A, Halmekytö M, Alhonen L, Jänne J, Jänne OA. Human ornithine decarboxylase-encoding loci: nucleotide sequence of the expressed gene and characterization of a pseudogene. Gene X 1990; 93:257-63. [PMID: 2227439 DOI: 10.1016/0378-1119(90)90233-h] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Previous studies have shown that human ornithine decarboxylase (ODC)-encoding sequences map to two chromosome regions: 2pter-p23 and 7cen-qter. In the present work we have cloned the expressed human ODC gene from a genomic library of myeloma cells that overproduce ODC protein due to selective gene amplification and determined its entire nucleotide sequence. The gene comprises 12 exons and 11 introns and spans about 8 kb of chromosome 2 DNA. The organization of the human gene is very similar to that of the mouse and rat, with the major difference being the presence of longer intronic sequences in the human gene. Some of these differences can be accounted for by the insertion of four Alu sequences in the human gene. Several potential regulatory elements are present in the promoter region and in 5'-proximal introns, including a TATA box; GC boses; AP-1-, AP-2- and NF-1-binding sites; and a cAMP-responsive element. The 5'-untranslated sequence of ODC mRNA is extremely GC-rich, and computer predictions suggest a very stable secondary structure for this region, with an overall free energy of formation of -225.4 kcal/mol. In addition to the active ODC gene on chromosome 2, ODC gene-related sequences were isolated from human chromosome 7-specific libraries and shown to represent a processed ODC pseudogene.
Collapse
Affiliation(s)
- N J Hickok
- Department of Dermatology, Thomas Jefferson University, Philadelphia, PA 19107
| | | | | | | | | | | | | |
Collapse
|
46
|
Del Pozzo G, Guardiola J. A SINE insertion provides information on the divergence of the HLA-DQA1 and HLA-DQA2 genes. Immunogenetics 1990; 31:229-32. [PMID: 2158485 DOI: 10.1007/bf00204892] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The class II region of the human major histocompatibility complex (MHC) contains a cluster of highly polymorphic genes organized into at least three subloci (DR, DQ, and DP), each encoding a subset of surface antigens participating in the modulation of the immune response. Genetic diversity in this system is brought about by two major mechanisms, hypermutation and trans-species evolution. The DQ subregion contains a pair of closely related A genes, HLA-DQA1 and HLA-DQA2, whose phylogenetic relationship is uncertain, although their generation by duplication of an ancestral A gene before or after speciation can be implied. We report here the presence of a member of the Alu repetitive family immediately 5' to the HLA-DQA1 gene. The sequence of this element indicates that it may have integrated by transposition at the time of divergence of hominoids from Old World monkeys. HLA-DQA2 carries an empty integration target site in place of the Alu, thereby suggesting that the insertion of Alu near HLA-DQA1 was preceded by the separation of the two genes.
Collapse
Affiliation(s)
- G Del Pozzo
- International Institute of Genetics and Biophysics, CNR, Napoli, Italy
| | | |
Collapse
|
47
|
Zuckerkandl E, Latter G, Jurka J. Maintenance of function without selection: Alu sequences as "cheap genes". J Mol Evol 1989; 29:504-12. [PMID: 2515297 DOI: 10.1007/bf02602922] [Citation(s) in RCA: 29] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Continued insertion into the genome of functional Alu sequences is expected to compensate for the functional eclipse of older sequences attributable to structural adulteration and can be presumed to establish a renewable store of functional sequences at a relatively elevated numerical level. This store of functional sequences could be maintained at almost no selective cost. A strategy of maintaining function in multiple sequence copies with selection limited to a very few master (source) sequences may be resorted to also by other types of DNA sequences that are generated repeatedly during evolution and that are spread over many sectors of the genome.
Collapse
Affiliation(s)
- E Zuckerkandl
- Linus Pauling Institute of Science and Medicine, Palo Alto, California 94306
| | | | | |
Collapse
|
48
|
Neckelmann N, Warner CK, Chung A, Kudoh J, Minoshima S, Fukuyama R, Maekawa M, Shimizu Y, Shimizu N, Liu JD. The human ATP synthase beta subunit gene: sequence analysis, chromosome assignment, and differential expression. Genomics 1989; 5:829-43. [PMID: 2687158 DOI: 10.1016/0888-7543(89)90125-0] [Citation(s) in RCA: 60] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
In humans, the functional F0F1-ATP synthase beta subunit gene is located on chromosome 12 in the p13----qter region. Other partially homologous sequences have been detected on chromosomes 2 and 17. The bona fide beta subunit gene has 10 exons encoding a leader peptide of 49 amino acids and a mature protein of 480 amino acids. Thirteen Alu family DNA repeats are found upstream from the gene and in four introns. The gene has four "CCAAT" sequences upstream and in close proximity to the transcriptional initiation site. A 13-bp motif is found in the 5' nontranscribed region of both the beta subunit gene and an ADP/ATP translocator gene that is expressed in high levels in cardiac and skeletal muscle. Analysis of the beta subunit mRNA levels reveals marked differences among tissues. The highest levels are found in heart, lower levels in skeletal muscle, and the lowest levels in liver and kidney. These findings suggest that the tissue-specific levels of ATP synthase beta subunit mRNA may be generated through transcriptional control.
Collapse
MESH Headings
- Amino Acid Sequence
- Base Sequence
- Chromosome Mapping
- Chromosomes, Human, Pair 12
- Chromosomes, Human, Pair 17
- Chromosomes, Human, Pair 2
- Exons
- Humans
- Introns
- Molecular Sequence Data
- Organ Specificity/genetics
- Proton-Translocating ATPases/biosynthesis
- Proton-Translocating ATPases/genetics
- RNA, Messenger/analysis
- Repetitive Sequences, Nucleic Acid
- Sequence Homology, Nucleic Acid
- Transcription, Genetic
- Translocation, Genetic
Collapse
Affiliation(s)
- N Neckelmann
- Department of Biochemistry, Emory University School of Medicine, Atlanta, Georgia 30322
| | | | | | | | | | | | | | | | | | | |
Collapse
|