1
|
Vívenes De Lugo M, Rodríguez-Larralde A, Castro De Guerra D. Beta-globin gene cluster haplotypes as evidence of African gene flow to the northeastern coast of Venezuela. Am J Hum Biol 2003; 15:29-37. [PMID: 12552576 DOI: 10.1002/ajhb.10120] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
In order to study the origin of mutation HBB*S in Sucre and Anzoátegui states and the genetic affinities of these Venezuelan populations with other human groups, the beta-globin gene cluster haplotypes were determined for 28 sickle cell and/or S-beta thalassemia patients and for 37 individuals with normal hematological parameters. Bantu, Benin, Senegal, and atypical haplotypes were identified in 50%, 36%, 2%, and 12% of the HBB*S chromosomes, respectively. Similar results have been published for Venezuelan patients from the central states, but a different trend is shown in a publication based on a group of patients from different regions of the country. For HBB*A, haplotype 2 (+ - - - -), characteristic of non-African groups, was the most common (39%), followed by haplotype 3 (- - - - +) of African origin, and haplotype 6 (- + + - +), also typical of non-Africans. The results reveal a high level of admixture of the Sucre-Anzoátegui population. The importance of specific conditions which have acted differently in the Venezuelan populations, such as founder effect, genetic drift, isolation, and endogamy are discussed. Genetic distances between the Sucre-Anzoátegui sample and several other human populations calculated on the basis of the HBB*S and HBB*A haplotypes revealed similar results, the closest genetic relationships being observed in relation to Bantu-speaking groups. These results confirm the utility of the beta-globin haplotypes for population studies and contribute to knowledge of the Venezuelan gene pool.
Collapse
Affiliation(s)
- Merlyn Vívenes De Lugo
- Instituto Venezolano de Investigaciones Científicas (IVIC), Centro de Medicina Experimental, Laboratorio de Genética Humana, Caracas 1020 A, Venezuela
| | | | | |
Collapse
|
2
|
|
3
|
Long Q, Bengra C, Li C, Kutlar F, Tuan D. A long terminal repeat of the human endogenous retrovirus ERV-9 is located in the 5' boundary area of the human beta-globin locus control region. Genomics 1998; 54:542-55. [PMID: 9878258 DOI: 10.1006/geno.1998.5608] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Transcription of the human beta-like globin genes in erythroid cells is regulated by the far-upstream locus control region (LCR). In an attempt to define the 5' border of the LCR, we have cloned and sequenced 5 kb of new upstream DNA. We found an LTR retrotransposon belonging to the ERV-9 family of human endogenous retroviruses in the apparent 5' boundary area of the LCR. This ERV-9 LTR contains an unusual U3 enhancer region composed of 14 tandem repeats with recurrent GATA, CACCC, and CCAAT motifs. This LTR is conserved in human and gorilla, indicating its evolutionary stability in the genomes of the higher primates. In both recombinant constructs and the endogenous human genome, the LTR enhancer and promoter activate the transcription of cis-linked DNA preferentially in erythroid cells. Our findings suggest the possibility that this LTR retrotransposon may serve a relevant host function in regulating the transcription of the beta-globin LCR.
Collapse
Affiliation(s)
- Q Long
- Department of Medicine, Medical College of Georgia, Augusta, Georgia, 30912, USA
| | | | | | | | | |
Collapse
|
4
|
Hardison R, Riemer C, Chui DH, Huisman TH, Miller W. Electronic access to sequence alignments, experimental results, and human mutations as an aid to studying globin gene regulation. Genomics 1998; 47:429-37. [PMID: 9480762 DOI: 10.1006/geno.1997.5147] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- R Hardison
- Department of Biochemistry, Pennsylvania State University, University Park, Pennsylvania 16802, USA.
| | | | | | | | | |
Collapse
|
5
|
Chiu CH, Schneider H, Slightom JL, Gumucio DL, Goodman M. Dynamics of regulatory evolution in primate beta-globin gene clusters: cis-mediated acquisition of simian gamma fetal expression patterns. Gene 1997; 205:47-57. [PMID: 9461379 DOI: 10.1016/s0378-1119(97)00476-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Phylogenetic reconstructions by parsimony were carried out on an enlarged body of primate gamma1 and gamma2-globin sequences. The results confirm that gamma1 and gamma2 arose from a tandem duplication in an ancient simian lineage ancestral to both platyrrhines (New World monkeys) and catarrhines (Old World monkeys and hominoids). Gene conversions between the two gamma homologs were frequent over the gamma gene proper but less frequent over the 5' flanking and very infrequent over the 3' flanking regions. The ancient platyrrhine conversion in the most distal 5' flanking region had the polarity of gamma2-->gamma1. Recent platyrrhine conversions between 5' regulatory sequences were very infrequent, in striking contrast to catarrhines which have large, uninterrupted stretches of converted 5' regulatory sequences. Comparisons of reconstructed ancestral primate and simian gamma promoter sequences revealed an accumulation of 21 nucleotide substitutions concentrated in or near cis-elements that may have mediated the change from embryonic to fetal gamma expression. Almost all 21 substitutions were retained in the lineages leading to functional gamma genes of extant catarrhines (both gamma1 and gamma2) and platyrrhines (most often gamma2). Fewer of these simian specific substitutions were retained in the platyrrhine gamma1 genes and new mutations occurred more often in the platyrrhine gamma1 than gamma2 promoters.
Collapse
Affiliation(s)
- C H Chiu
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI 48201, USA
| | | | | | | | | |
Collapse
|
6
|
Hardison R, Slightom JL, Gumucio DL, Goodman M, Stojanovic N, Miller W. Locus control regions of mammalian beta-globin gene clusters: combining phylogenetic analyses and experimental results to gain functional insights. Gene X 1997; 205:73-94. [PMID: 9461381 DOI: 10.1016/s0378-1119(97)00474-5] [Citation(s) in RCA: 175] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Locus control regions (LCRs) are cis-acting DNA segments needed for activation of an entire locus or gene cluster. They are operationally defined as DNA sequences needed to achieve a high level of gene expression regardless of the position of integration in transgenic mice or stably transfected cells. This review brings together the large amount of DNA sequence data from the beta-globin LCR with the vast amount of functional data obtained through the use of biochemical, cellular and transgenic experimental systems. Alignment of orthologous LCR sequences from five mammalian species locates numerous conserved regions, including previously identified cis-acting elements within the cores of nuclease hypersensitive sites (HSs) as well as conserved regions located between the HS cores. The distribution of these conserved sequences, combined with the effects of LCR fragments utilized in expression studies, shows that important sites are more widely distributed in the LCR than previously anticipated, especially in and around HS2 and HS3. We propose that the HS cores plus HS flanking DNAs comprise a 'unit' to which proteins bind and form an optimally functional structure. Multiple HS units (at least three: HS2, HS3 and HS4 cores plus flanking DNAs) together establish a chromatin structure that allows the proper developmental regulation of genes within the cluster.
Collapse
Affiliation(s)
- R Hardison
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park 16802, USA.
| | | | | | | | | | | |
Collapse
|
7
|
Cavallesco R, Tuan D. Modulatory subdomains of the HS2 enhancer differentially regulate enhancer activity in erythroid cells at different developmental stages. Blood Cells Mol Dis 1997; 23:8-26. [PMID: 9215747 DOI: 10.1006/bcmd.1997.0115] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The HS2 enhancer in the locus control region of human beta-like globin genes displays developmental-stage-independent enhancer function. The mechanism by which it regulates the transcription of the globin genes in erythroid cells throughout development is not fully understood. In this paper we dissect the HS2 enhancer into an enhancer core and five modulatory subdomains M1 to M5. The enhancer core possesses developmental-stage-independent enhancer activity. The modulatory subdomains by themselves do not possess such enhancer activity, but they apparently respond to environmental signals and modulate enhancer core activity in a developmental-stage specific manner. M1 located 5' of the core strongly stimulates core activity in K562 cells at the embryonic stage. M2 and M3 located 3' of the core strongly stimulate core activity in MEL cells at the adult stage. Moreover, M3 suppresses core activity at the embryonic stage and exhibits an adult-stage-selector activity. These findings indicate that the apparent developmental-stage-independence of the HS2 enhancer is a result of multiple interactions between the core and the modulatory subdomains located both near and far from the core.
Collapse
Affiliation(s)
- R Cavallesco
- Harvard-MIT Division of Health Science and Technology and Center for Cancer Research, Massachusetts Institute of Technology, Cambridge 02139, USA
| | | |
Collapse
|
8
|
Benlian P, Etienne J, de Gennes JL, Noé L, Brault D, Raisonnier A, Arnault F, Hamelin J, Foubert L, Chuat JC. Homozygous deletion of exon 9 causes lipoprotein lipase deficiency: possible intron-Alu recombination. J Lipid Res 1995. [DOI: 10.1016/s0022-2275(20)39913-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
|
9
|
Koop B. Human and rodent genomes: Discrepant patterns of DNA sequence similarity. Anim Biotechnol 1994. [DOI: 10.1080/10495399409525810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
10
|
Winichagoon P, Fucharoen S, Wilairat P, Chihara K, Fukumaki Y. Nondeletional type of hereditary persistence of fetal haemoglobin: molecular characterization of three unrelated Thai HPFH. Br J Haematol 1994; 87:797-804. [PMID: 7527242 DOI: 10.1111/j.1365-2141.1994.tb06740.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
The beta-globin gene clusters of three unrelated Thai families with a nondeletional type of hereditary persistence of fetal haemoglobin (HPFH) were studied using polymerase chain reaction-related techniques. All appeared to have normal nucleotide sequences from the Cap site to position -400 of both the G gamma- and A gamma-globin genes. Two individuals suspected of having a beta-thalassaemia gene linked to the high HbF condition also had a normal beta-globin gene sequence, spanning from position -108 from the Cap site to the polyadenylation site. Deletion of four nucleotides, AGCA, at positions -225 to -222 of one A gamma-globin allele was detected in one subject and was confirmed by dot-blot hybridization. Restriction fragment length polymorphisms in the beta-globin gene cluster showed that the 5' haplotype (-+-++) and the presence (+) of an Xmm 1 polymorphic site at -158 of the G gamma-globin gene are associated with the high F phenotype in these families. Direct sequencing of the 5' hypersensitive-2 (5' HS-2) site of the locus control region (LCR) showed that this Xmn 1 (+) site is also linked to a specific rearrangement of TA repeats (TA)9CACATATACG(TA)10, in HS-2 segment.
Collapse
Affiliation(s)
- P Winichagoon
- Department of Medicine, Faculty of Medicine, Siriraj Hospital, Thailand
| | | | | | | | | |
Collapse
|
11
|
Trepicchio WL, Dyer MA, Hardison RC, Baron MH. Upstream regulatory region of the human embryonic beta-like globin gene, epsilon. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 1994; 4:409-12. [PMID: 7841465 DOI: 10.3109/10425179409010191] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The sequence of the human embryonic beta-like globin gene (epsilon) upstream regulatory region has been reported previously. In the course of our own work, we found a significant number of discrepancies between our sequence and the data base sequence, which we show here to contain large clusters of errors within functional epsilon-globin regulatory domains.
Collapse
Affiliation(s)
- W L Trepicchio
- Department of Cellular and Developmental Biology, Harvard University, Cambridge, MA 02138
| | | | | | | |
Collapse
|
12
|
Boulikas T. Homeodomain protein binding sites, inverted repeats, and nuclear matrix attachment regions along the human beta-globin gene complex. J Cell Biochem 1993; 52:23-36. [PMID: 8320272 DOI: 10.1002/jcb.240520105] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
beta-Globin genes in primates arose during evolution by duplication of an ancestral gene, and their order of arrangement along the DNA is related to their timing of expression during development. We believe that nuclear matrix anchorage sites (MARs) along the beta-globin gene complex considered to be mass binding sites for transcription protein factors, some of which are developmental stage specific and others ubiquitous, play a decisive role in cell memory by determining the developmental stage-specific expression of the genes. The AT-rich class of MARs appears to possess a significant number of ATTA and ATTTA motifs known to be mass binding sites for homeodomain proteins that determine body formation in development. MARs also appear to harbor origins of replication, to be enriched in inverted repeats (dyad symmetry motifs) and were proposed to include the DNase I hypersensitive sites of a particular gene determined at the chromatin level. This study is an attempt to finely identify MARs at the nucleotide level along the beta-globin gene complex. Searches of a contiguous stretch of about 73.3 kb of human sequences comprising and surrounding the epsilon, gamma G-, gamma A-, delta-, and beta-globin genes of the human beta-globin gene complex for homeotic protein binding sites as well as for inverted repeats has shown that these elements are clustered nonrandomly at particular sites within the beta-globin gene complex. These sites are presumed to be the AT-rich class of MARs of the beta-globin gene complex. The inverted repeats which are characteristic of origins of replication and some promoter/enhancer regions and the homeotic protein sites seem to include the DNase I hypersensitive sites of the gene complex. Indeed, dyad symmetry sequences are present close to the four DNase I HS sites in the locus control region (LCR) of the gene complex as well as in the 5' flanking regions and the large introns of the delta- and beta-globin genes. A search of the putative MAR regions of the gene complex suggests that, in addition to their enrichment in ATTA motifs, palindromes, and DNase I hypersensitive sites, these regions may comprise TG-rich motifs and potential Z-DNA as well as polypurine and polypyrimidine blocks. From the positions of palindromes and clusters of homeodomain protein sites along the complex we propose that an extended origin of replication able to initiate at several sites is present in the LCR and two others surrounding the delta- and beta-globin genes.(ABSTRACT TRUNCATED AT 400 WORDS)
Collapse
Affiliation(s)
- T Boulikas
- Institute of Molecular Medical Sciences, Palo Alto, California 94306
| |
Collapse
|
13
|
Hardison R, Chao KM, Adamkiewicz M, Price D, Jackson J, Zeigler T, Stojanovic N, Miller W. Positive and negative regulatory elements of the rabbit embryonic epsilon-globin gene revealed by an improved multiple alignment program and functional analysis. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 1993; 4:163-76. [PMID: 8161819 DOI: 10.3109/10425179309015629] [Citation(s) in RCA: 26] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
The epsilon-globin genes of mammals are expressed in early embryos, but are silenced during fetal and adult erythropoiesis. As a guide to defining the regulatory elements involved in this developmental switch, we have searched the sequences of epsilon-globin genes from different mammals for highly conserved segments. The search was facilitated by the development of a new program, called yama, to generate a multiple alignment of very long sequences using an improved scoring scheme. This allowed us to generate a multiple alignment of sequences from a more divergent group than previously analyzed, as demonstrated here for representatives of four mammalian orders. In parallel experiments, we constructed a series of deletion mutations in the 5' flank of the rabbit epsilon-globin gene and tested their effect on an epsilon-globin-luciferase hybrid reporter gene. These results show that 121 bp of 5' flank, containing CACC, CCAAT and ATA motifs, is sufficient for expression in erythroid K562 cells. Both positive and negative cis-acting control sequences are located between 218 and 394 bp 5' to the cap site, in a region previously proposed to be a silencer. The positive regulatory sequence contains conserved binding sites for the nuclear protein YY1 adjacent to another highly conserved sequence. The negative element contains a conserved sequence followed by a purine-rich segment. This analysis maps the upstream control sequences more precisely and points to a very complex regulatory scheme for this gene.
Collapse
Affiliation(s)
- R Hardison
- Department of Molecular and Cell Biology, Pennsylvania State University, University Park 16802
| | | | | | | | | | | | | | | |
Collapse
|
14
|
Tagle DA, Stanhope MJ, Siemieniak DR, Benson P, Goodman M, Slightom JL. The beta globin gene cluster of the prosimian primate Galago crassicaudatus: nucleotide sequence determination of the 41-kb cluster and comparative sequence analyses. Genomics 1992; 13:741-60. [PMID: 1639402 DOI: 10.1016/0888-7543(92)90150-q] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The nucleotide sequence of the beta globin gene cluster of the prosimian Galago crassicaudatus has been determined. A total sequence spanning 41,101 bp contains and links together previously published sequences of the five galago beta-like globin genes (5'-epsilon-gamma-psi eta-delta-beta-3'). A computer-aided search for middle interspersed repetitive sequences identified 10 LINE (L1) elements, including a 5' truncated repeat that is orthologous to the full-length L1 element found in the human epsilon-gamma intergenic region. SINE elements that were identified included one Alu type I repeat, four Alu type II repeats, and two methionine tRNA-derived Monomer (type III) elements. Alu type II and Monomer sequences are unique to the galago genome. Structural analyses of the cluster sequence reveals that it is relatively A+T rich (about 62%) and regions with high G+C content are associated primarily with globin coding regions. Comparative analyses with the beta globin cluster sequences of human, rabbit, and mouse reveal extensive sequence homologies in their genic regions, but only human, galago, and rabbit sequences share extensive intergenic sequence homologies. Divergence analyses of aligned intergenic and flanking sequences from orthologous human, galago, and rabbit sequences show a gradation in the rate of nucleotide sequence evolution along the cluster where sequences 5' of the epsilon globin gene region show the least sequence divergence and sequences just 5' of the beta globin gene region show the greatest sequence divergence.
Collapse
Affiliation(s)
- D A Tagle
- Department of Molecular Biology, Wayne State University School of Medicine, Detroit, Michigan 48201
| | | | | | | | | | | |
Collapse
|
15
|
Simon MC, Pevny L, Wiles MV, Keller G, Costantini F, Orkin SH. Rescue of erythroid development in gene targeted GATA-1- mouse embryonic stem cells. Nat Genet 1992; 1:92-8. [PMID: 1302015 DOI: 10.1038/ng0592-92] [Citation(s) in RCA: 205] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Development of definitive (fetal liver-derived) red cells is blocked by a targeted mutation in the gene encoding the transcription factor GATA-1. We used in vitro differentiation of GATA-1- mouse embryonic stem (ES) cells to reveal a requirement for GATA-1 during primitive (yolk sac-derived) erythropoiesis and to establish a rescue assay. We show that the block to development includes primitive, as well as definitive, erythroid cells and is complete at the level of globin RNA expression; that the introduction of a normal GATA-1 gene restores developmental potential both in vivo and in vitro; and that efficient rescue is dependent on a putative autoregulatory GATA-motif in the distal promoter. Use of in vitro differentiated ES cells bridges a gap between conventional approaches to gene function in cell lines and analysis of loss of function mutations in the whole animal.
Collapse
Affiliation(s)
- M C Simon
- Division of Hematology-Oncology, Children's Hospital, Boston, Massachusetts
| | | | | | | | | | | |
Collapse
|
16
|
Gutman PD, Cao SX, Dave HP, Mittelman M, Schechter AN. Binding of erythroid and non-erythroid nuclear proteins to the silencer of the human epsilon-globin-encoding gene. Gene 1992; 110:197-203. [PMID: 1537556 DOI: 10.1016/0378-1119(92)90648-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
To clarify the molecular mechanisms involved in the developmental control of hemoglobin-encoding genes we have been studying the expression of these genes in human cells in continuous culture. We have previously reported the presence of a transcriptional control element with the properties of a silencer extending from -392 to -177 bp relative to the cap site of the human epsilon-globin-encoding gene [Cao et al., Proc. Natl. Acad. Sci. USA 86 (1989) 5306-5309]. We also showed that this silencer has stronger inhibitory activity in HeLa cells, as compared to K562 human erythroleukemia cells. Using deletion mutants and cis-cloned synthetic oligodeoxyribonucleotides in transient expression assays, nucleotide sequences responsible for this effect have now been further delimited to 44 bp located from -294 to -251 bp. Gel electrophoresis mobility shift assays and DNaseI footprinting assays demonstrate that these negative regulatory sequences are recognized differently by proteins present in nuclear extracts obtained from HeLa and K562 cells. Two binding proteins are detected in K562 nuclear extracts, while only one is found in extracts from HeLa cells. Possible mechanisms by which these proteins may regulate transcription of the epsilon-globin-encoding gene in erythroid and non-erythroid cells are discussed.
Collapse
Affiliation(s)
- P D Gutman
- National Institute of Diabetes, Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892
| | | | | | | | | |
Collapse
|
17
|
Sankaranarayanan K. Ionizing radiation and genetic risks. I. Epidemiological, population genetic, biochemical and molecular aspects of Mendelian diseases. Mutat Res 1991; 258:3-49. [PMID: 2023599 DOI: 10.1016/0165-1110(91)90027-s] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
This paper reviews the currently available information on naturally occurring Mendelian diseases in man; it is aimed at providing a background and framework for discussion of experimental data on radiation-induced mutations (papers II and III) and for the estimation of the risk of Mendelian disease in human populations exposed to ionizing radiation (paper IV). Current consensus estimates indicate that a total of about 125 per 10(4) livebirths are directly affected by one or another naturally occurring Mendelian disease (autosomal dominants, 95/10(4); X-linked ones, 5/10(4); and autosomal recessives, 25/10(4). These estimates are conservative and take into account conditions which are very rare and for which prevalence estimates are unavailable. Most, although not all, of the recognized "common" dominants have onset in adult ages while most sex-linked and autosomal recessives have onset at birth or in childhood. Autosomal dominant and X-linked diseases (i.e., the responsible mutant alleles) presumed to be maintained in the population due to a balance between mutation and selection are the ones which may be expected to increase in frequency as a result of radiation exposures. Viewed from this standpoint, the above assumption seems safe only for a small proportion of such diseases; for the remainder, there is no easy way to discriminate between different mechanisms that may be responsible or to rigorously exclude some in favor of some others. Mutations in genes that code for enzymic proteins are more often recessive in contrast to those that code for non-enzymic proteins, which are more often dominant. At the molecular level, with recessives, a wide variety of changes is possible and these include specific types of point mutations, small and large intragenic deletions, multilocus deletions and rearrangements. In the case of dominants, however, the kinds of recoverable point mutations and deletion-type changes are less extensive because of functional constraints. The mutational potential of genes varies, depending on the gene, its size, sequence content and arrangement, location and its normal functions, and can be grouped into three groups: those in which only point mutations have been found to occur, those in which only deletions or other gross changes have been recovered and those in which both kinds of changes are known. Molecular data are available for about 75 Mendelian conditions and these suggest that in approximately 50% of them, the changes categorized to date are point mutations and in the remainder, intragenic deletions or other gross changes; there does not seem to be any fundamental difference between dominants and recessives with respect to the underlying molecular defect.(ABSTRACT TRUNCATED AT 400 WORDS)
Collapse
Affiliation(s)
- K Sankaranarayanan
- MGC Department of Radiation Genetics and Chemical Mutagenesis, Sylvius Laboratories, State University of Leiden, The Netherlands
| |
Collapse
|
18
|
Li Q, Zhou B, Powers P, Enver T, Stamatoyannopoulos G. Primary structure of the goat beta-globin locus control region. Genomics 1991; 9:488-99. [PMID: 2032720 DOI: 10.1016/0888-7543(91)90415-b] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The goat beta-globin cluster is composed of a triplicated four-gene set. A locus control region (LCR) containing elements homologous to 5'DNase I hypersensitive sites (HS) 1, 2, and 3 of the human beta-globin LCR has been identified at the 5' end of this locus. We determined 10.2 kb of nucleotide sequence from the goat beta-globin locus control region. Self-comparison of this sequence by dot matrix analysis revealed the presence of six complete and three incomplete artiodactyl repeats. A novel repeated element, termed D repeat, was also identified. Southern blotting analysis demonstrated that these elements exist in the goat genome as a low to medium frequency interspersed repeat family. The absence of any other large region of self-homology (direct or inverted) in the goat LCR suggests that 5'HSs 1, 2, and 3 did not arise through duplication, but rather evolved independently. By comparing goat 5'HS 1 to those of human, rabbit, and mouse, we show a greater than 80% conservation in sequence between the four species. This level of evolutionary conservation suggests that 5'HS 1 plays an important role in the regulation of beta-globin loci.
Collapse
Affiliation(s)
- Q Li
- Shanghai Institute of Biochemistry, Chinese Academy of Sciences
| | | | | | | | | |
Collapse
|
19
|
Enver T, Raich N, Ebens AJ, Papayannopoulou T, Costantini F, Stamatoyannopoulos G. Developmental regulation of human fetal-to-adult globin gene switching in transgenic mice. Nature 1990; 344:309-13. [PMID: 2314472 DOI: 10.1038/344309a0] [Citation(s) in RCA: 294] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Transgenic mice containing a human fetal (gamma-) or adult (beta-) globin gene linked to the beta-globin gene locus activation region (LAR) express the gene throughout development. By contrast, transgenic mice containing LAR linked to both a fetal and an adult globin gene display the normal developmental switch from fetal to adult gene expression. This suggests that the human fetal-to-adult globin gene switch is controlled through a mutually exclusive interaction between LAR and either the gamma- or beta-globin gene, resulting in the expression of only one gene at any given moment.
Collapse
Affiliation(s)
- T Enver
- Department of Medicine, University of Washington, Seattle 98195
| | | | | | | | | | | |
Collapse
|
20
|
Moyzis RK, Torney DC, Meyne J, Buckingham JM, Wu JR, Burks C, Sirotkin KM, Goad WB. The distribution of interspersed repetitive DNA sequences in the human genome. Genomics 1989; 4:273-89. [PMID: 2714792 DOI: 10.1016/0888-7543(89)90331-5] [Citation(s) in RCA: 147] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The distribution of interspersed repetitive DNA sequences in the human genome has been investigated, using a combination of biochemical, cytological, computational, and recombinant DNA approaches. "Low-resolution" biochemical experiments indicate that the general distribution of repetitive sequences in human DNA can be adequately described by models that assume a random spacing, with an average distance of 3 kb. A detailed "high-resolution" map of the repetitive sequence organization along 400 kb of cloned human DNA, including 150 kb of DNA fragments isolated for this study, is consistent with this general distribution pattern. However, a higher frequency of spacing distances greater than 9.5 kb was observed in this genomic DNA sample. While the overall repetitive sequence distribution is best described by models that assume a random distribution, an analysis of the distribution of Alu repetitive sequences appearing in the GenBank sequence database indicates that there are local domains with varying Alu placement densities. In situ hybridization to human metaphase chromosomes indicates that local density domains for Alu placement can be observed cytologically. Centric heterochromatin regions, in particular, are at least 50-fold underrepresented in Alu sequences. The observed distribution for repetitive sequences in human DNA is the expected result for sequences that transpose throughout the genome, with local regions of "preference" or "exclusion" for integration.
Collapse
Affiliation(s)
- R K Moyzis
- Genetics Group, Los Alamos National Laboratory, University of California, New Mexico 87545
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Ryan TM, Behringer RR, Martin NC, Townes TM, Palmiter RD, Brinster RL. A single erythroid-specific DNase I super-hypersensitive site activates high levels of human beta-globin gene expression in transgenic mice. Genes Dev 1989; 3:314-23. [PMID: 2721958 DOI: 10.1101/gad.3.3.314] [Citation(s) in RCA: 239] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Erythroid-specific DNase I super-hypersensitive (HS) sites that are normally located far upstream of the human beta-globin locus were inserted immediately upstream of a 4.1-kb fragment containing the human beta-globin gene. These constructs (HS beta) and a construct containing the beta-globin gene alone (beta) were microinjected into fertilized mouse eggs, and expression was analyzed in erythroid fetal liver and brain of day-16 embryos that developed. Only 7 of 23 animals that contained the beta gene alone expressed human beta-globin mRNA in erythroid tissue, and the average level of expression per gene copy was 0.3% of endogenous mouse beta-globin mRNA. In contrast, 50 of 51 transgenic mice that contained various HS beta constructs expressed the transgene specifically in erythroid tissue. The average level of expression per gene copy for constructs containing all five upstream HS sites was 109% of endogenous mouse beta-globin mRNA. Constructs that contained a single super-hypersensitive site (HS II beta) expressed 40% as much human beta-globin as mouse beta-globin mRNA per gene copy. These results demonstrate that the HS VI site, normally located downstream of the human beta-globin locus, is not required for high-level expression. Furthermore, the results demonstrate that high levels of human beta-globin gene expression can be obtained in transgenic mice even when a relatively small fragment of DNA (1.9 kb) containing erythroid-specific super-hypersensitive site II (HS II) is inserted upstream of the human beta-globin gene.
Collapse
Affiliation(s)
- T M Ryan
- Department of Biochemistry, School of Medicine, University of Alabama, Birmingham 35294
| | | | | | | | | | | |
Collapse
|
22
|
Margot JB, Demers GW, Hardison RC. Complete nucleotide sequence of the rabbit beta-like globin gene cluster. Analysis of intergenic sequences and comparison with the human beta-like globin gene cluster. J Mol Biol 1989; 205:15-40. [PMID: 2486295 DOI: 10.1016/0022-2836(89)90362-8] [Citation(s) in RCA: 63] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
The nucleotide sequence of the entire beta-like globin gene cluster of rabbits has been determined. This sequence of a continuous stretch of 44.5 x 10(3) base-pairs (bp) starts about 6 x 10(3) bp upstream from epsilon (the 5'-most gene) and ends about 12 x 10(3) bp downstream from beta (the 3'-most gene). Analysis of the sequence reveals that: (1) the sequence is relatively A + T rich (about 60%); (2) regions with high G + C content are associated with OcC repeats, a short interspersed repeated DNA in rabbits; (3) the distribution of polypurines, polypyrimidines and alternating purine/pyrimidine tracts is not random within the cluster; (4) most open reading frames are associated with known globin coding regions, OcC repeats or long interspersed repeats (L1 repeats); (5) the most prominent open reading frames are found in the L1 repeats; (6) different strand asymmetries in base composition are associated with embyronic and adult genes as well as the tandem L1 repeats at the 3' end of the cluster; and (7) essentially all the repeats appear to have been inserted by a transposon mechanism. A comparison of the sequence with itself by a dot-plot analysis has revealed nine new members of the OcC family of repeats in addition to the six previously reported. The OcC repeats tend to be clustered, particularly in the epsilon-gamma and gamma-psi delta intergenic regions. Dot-plot comparisons between the rabbit and the human clusters have revealed extensive sequence matches. Homology starts about 6 x 10(3) bp 5' to epsilon or as far upstream as the rabbit sequence is available. It continues throughout the entire cluster and stops about 0.7 x 10(3) bp 3' to beta, at which point several repeats have inserted in both rabbits and humans. Throughout the gene cluster, the homology is interrupted mainly by insertions or deletions in either the rabbit or the human genome. Almost all of the insertions are of known short or long repeated DNAs. The positions of the insertions are different in the two gene clusters, which indicates that both short and long repeats have been transposing throughout the genome for the time since the mammalian radiation. An alignment of rabbit and human sequences allows the calculation of the substitution rate around epsilon. Sequences far removed from the gene are evolving at a rate equivalent to the pseudogene rate, although some short regions show an apparently higher rate.(ABSTRACT TRUNCATED AT 400 WORDS)
Collapse
Affiliation(s)
- J B Margot
- Department of Molecular and Cell Biology, Paul M. Althouse Laboratory, Pennsylvania State University, University Park 16802
| | | | | |
Collapse
|
23
|
Abstract
Reports of single base-pair mutations within gene coding regions causing human genetic disease were collated. Thirty-five per cent of mutations were found to have occurred within CpG dinucleotides. Over 90% of these mutations were C----T or G----A transitions, which thus occur within coding regions at a frequency 42-fold higher than that predicted from random mutations. These findings are consistent with methylation-induced deamination of 5-methyl cytosine and suggest that methylation of DNA within coding regions may contribute significantly to the incidence of human genetic disease.
Collapse
Affiliation(s)
- D N Cooper
- Haematology Department, King's College School of Medicine and Dentistry, London, UK
| | | |
Collapse
|
24
|
Scott AF, Schmeckpeper BJ, Abdelrazik M, Comey CT, O'Hara B, Rossiter JP, Cooley T, Heath P, Smith KD, Margolet L. Origin of the human L1 elements: proposed progenitor genes deduced from a consensus DNA sequence. Genomics 1987; 1:113-25. [PMID: 3692483 PMCID: PMC7135745 DOI: 10.1016/0888-7543(87)90003-6] [Citation(s) in RCA: 256] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
A consensus sequence for the human long interspersed repeated DNA element, L1Hs (LINE or KpnI sequence), is presented. The sequence contains two open reading frames (ORFs) which are homologous to ORFs in corresponding regions of L1 elements in other species. The L1Hs ORFs are separated by a small evolutionarily nonconserved region. The 5' end of the consensus contains frequent terminators in all three reading frames and has a relatively high GC content with numerous stretches of weak homology with AluI repeats. The 5' ORF extends for a minimum of 723 bp (241 codons). The 3' ORF is 3843 bp (1281 codons) and predicts a protein of 149 kD which has regions of weak homology to the polymerase domain of various reverse transcriptases. The 3' end of the consensus has a 208-bp nonconserved region followed by an adenine-rich end. The organization of the L1Hs consensus sequence resembles the structure of eukaryotic mRNAs except for the noncoding region between ORFs. However, due to base substitutions or truncation most elements appear incapable of producing mRNA that can be translated. Our observation that individual elements cluster into subfamilies on the basis of the presence or absence of blocks of sequence, or by the linkage of alternative bases at multiple positions, suggests that most L1 sequences were derived from a small number of structural genes. An estimate of the mammalian L1 substitution rate was derived and used to predict the age of individual human elements. From this it follows that the majority of human L1 sequences have been generated within the last 30 million years. The human elements studied here differ from each other, yet overall the L1Hs sequences demonstrate a pattern of species-specificity when compared to the L1 families of other mammals. Possible mechanisms that may account for the origin and evolution of the L1 family are discussed. These include pseudogene formation (retroposition), transposition, gene conversion, and RNA recombination.
Collapse
Affiliation(s)
- A F Scott
- Department of Medicine, Johns Hopkins University School of Medicine, Johns Hopkins Hospital, Baltimore, Maryland 21205
| | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Gibbs PE, Zielinski R, Boyd C, Dugaiczyk A. Structure, polymorphism, and novel repeated DNA elements revealed by a complete sequence of the human alpha-fetoprotein gene. Biochemistry 1987; 26:1332-43. [PMID: 2436661 DOI: 10.1021/bi00379a020] [Citation(s) in RCA: 76] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The human alpha-fetoprotein gene spans 19,489 base pairs from the putative "Cap" site to the polyadenylation site. It is composed of 15 exons separated by 14 introns, which are symmetrically placed within the three domains of alpha-fetoprotein. In the 5' region, a putative TATAAA box is at position -21, and a variant sequence, CCAAC, of the common CAT box is at -65. Enhancer core sequences GTGGTTTAAAG are found in introns 3 and 4, and several copies of glucocorticoid response sequences AGATACAGTA are found on the template strand of the gene. There are six polymorphic sites within 4690 base pairs of contiguous DNA derived from two allelic alpha-fetoprotein genes. This amounts to a measured polymorphic frequency of 0.13%, or 6.4 X 10(-4)/site, which is about 5-10 times lower than values estimated from studies on polymorphic restriction sites in other regions of the human genome. There are four types of repetitive sequence elements in the introns and flanking regions of the human alpha-fetoprotein gene. At least one of these is apparently a novel structure (designated Xba) and is found as a pair of direct repeats, with one copy in intron 7 and the other in intron 8. It is conceivable that within the last 2 million years the copy in intron 8 gave rise to the repeat in intron 7. Their present location on both sides of exon 8 gives these sequences a potential for disrupting the functional integrity of the gene in the event of an unequal crossover between them. There are three Alu elements, one of which is in intron 4; the others are located in the 3' flanking region. A solitary Kpn repeat is found in intron 3. The Xba and Kpn repeats were only detected by complete sequencing of the introns. Neither X, Xba, nor Kpn elements are present in the related human albumin gene, whereas Alu's are present in different positions. From phylogenetic evidence, it appears that Alu elements were inserted into the alpha-fetoprotein gene at some time postdating the mammalian radiation 85 million years ago.
Collapse
|
26
|
Primate Sequences. Primates 1987. [DOI: 10.1016/b978-0-12-512511-6.50005-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
27
|
Abstract
We describe a versatile and simplified non-random strategy for nucleotide sequencing. This procedure avoids exposure of the vector DNA to endo- or exonucleases during construction of the subfragment library. The advantages resulting from this strategy are: (1) minimal manipulation in construction of the subfragment library, (2) no need for compatible restriction sites, (3) no insert size limitation, (4) can be used with both chemical and enzymatic sequencing methods. Hence, the procedure provides simplicity, universality and versatility for non-random nucleotide sequencing. We demonstrate the usefulness of this procedure by obtaining the nucleotide sequence of a goat 3.7-kb EcoRI fragment, which is 5' of the epsilon globin gene.
Collapse
Affiliation(s)
- Q L Li
- Shanghai Institute of Biochemistry, Chinese Academy of Sciences, China
| | | |
Collapse
|
28
|
Koop BF, Miyamoto MM, Embury JE, Goodman M, Czelusniak J, Slightom JL. Nucleotide sequence and evolution of the orangutan epsilon globin gene region and surrounding Alu repeats. J Mol Evol 1986; 24:94-102. [PMID: 3031321 DOI: 10.1007/bf02099956] [Citation(s) in RCA: 48] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
We have mapped and sequenced the epsilon globin gene and seven surrounding Alu repeat sequences in the orangutan beta globin gene cluster and have compared these and other orangutan sequences to orthologously related human sequences. Noncoding flanking and intron sequences, synonymous sites of alpha, gamma, and epsilon globin coding regions, and Alu sequences in human and orangutan diverge by 3.2%, 2.7%, and 3.7%, respectively. These values compare to 3.6% from DNA hybridizations and 3.4% from the psi eta globin gene region. If as suggested by fossil evidence and "molecular clock" calculations, human and orangutan lineages diverged about 10-15 MYA, the rate of noncoding DNA evolution in the two species is 1.0-1.5 X 10(-9) substitutions per site per year. We found no evidence for either the addition or deletion of Alu sequences from the beta globin gene cluster nor is there any evidence for recent concerted evolution among the Alu sequences examined. Both phylogenetic and phenetic distance analyses suggest that Alu sequences within the alpha and beta globin gene clusters arose close to the time of simian and prosimian primate divergence (about 50-60 MYA). We conclude that Alu sequences have been evolving at the rate typical of noncoding DNA for the majority of primate history.
Collapse
|