1
|
Hollox EJ, Barber JCK, Brookes AJ, Armour JAL. Defensins and the dynamic genome: what we can learn from structural variation at human chromosome band 8p23.1. Genome Res 2009; 18:1686-97. [PMID: 18974263 DOI: 10.1101/gr.080945.108] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Over the past four years, genome-wide studies have uncovered numerous examples of structural variation in the human genome. This includes structural variation that changes copy number, such as deletion and duplication, and structural variation that does not change copy number, such as orientation and positional polymorphism. One region that contains all these types of variation spans the chromosome band 8p23.1. This region has been studied in some depth, and the focus of this review is to examine our current understanding of the variation of this region. We also consider whether this region is a good model for other structurally variable regions in the genome and what the implications of this variation are for clinical studies. Finally, we discuss the bioinformatics challenges raised, discuss the evolution of the region, and suggest some future priorities for structural variation research.
Collapse
Affiliation(s)
- Edward J Hollox
- Department of Genetics, University of Leicester, Leicester LE1 7RH, United Kingdom.
| | | | | | | |
Collapse
|
2
|
Abstract
Population genetics is central to our understanding of human variation, and by linking medical and evolutionary themes, it enables us to understand the origins and impacts of our genomic differences. Despite current limitations in our knowledge of the locations, sizes and mutational origins of structural variants, our characterization of their population genetics is developing apace, bringing new insights into recent human adaptation, genome biology and disease. We summarize recent dramatic advances, describe the diverse mutational origins of chromosomal rearrangements and argue that their complexity necessitates a re-evaluation of existing population genetic methods.
Collapse
Affiliation(s)
- Donald F Conrad
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
| | | |
Collapse
|
3
|
Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W, Cho EK, Dallaire S, Freeman JL, Gonzalez JR, Gratacos M, Huang J, Kalaitzopoulos D, Komura D, MacDonald JR, Marshall CR, Mei R, Montgomery L, Nishimura K, Okamura K, Shen F, Somerville MJ, Tchinda J, Valsesia A, Woodwark C, Yang F, Zhang J, Zerjal T, Zhang J, Armengol L, Conrad DF, Estivill X, Tyler-Smith C, Carter NP, Aburatani H, Lee C, Jones KW, Scherer SW, Hurles ME. Global variation in copy number in the human genome. Nature 2006; 444:444-54. [PMID: 17122850 PMCID: PMC2669898 DOI: 10.1038/nature05329] [Citation(s) in RCA: 3010] [Impact Index Per Article: 158.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2006] [Accepted: 10/10/2006] [Indexed: 01/08/2023]
Abstract
Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single-nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. A total of 1,447 copy number variable regions (CNVRs), which can encompass overlapping or adjacent gains or losses, covering 360 megabases (12% of the genome) were identified in these populations. These CNVRs contained hundreds of genes, disease loci, functional elements and segmental duplications. Notably, the CNVRs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal marked variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.
Collapse
Affiliation(s)
- Richard Redon
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Shumpei Ishikawa
- Genome Science, Research Center for Advanced Science and Technology, University of Tokyo, 4-6-1 Komaba, Meguro, Tokyo 153-8904 Japan
| | | | - Lars Feuk
- The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, MaRS Centre- East Tower, 101 College Street, Rm. 14-701, Toronto, Ontario, M5G 1L7, Canada
- Department of Molecular and Medical Genetics, University of Toronto
| | - George H. Perry
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA 02115
| | - T. Daniel Andrews
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Heike Fiegler
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | | | - Andrew R. Carson
- The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, MaRS Centre- East Tower, 101 College Street, Rm. 14-701, Toronto, Ontario, M5G 1L7, Canada
- Department of Molecular and Medical Genetics, University of Toronto
| | | | - Eun Kyung Cho
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA 02115
| | - Stephanie Dallaire
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA 02115
| | - Jennifer L. Freeman
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA 02115
| | - Juan R. Gonzalez
- Genes and Disease Program, and Barcelona CeGen Unit, Center for Genomic Regulation, Barcelona, Catalonia, Spain
| | - Monica Gratacos
- Genes and Disease Program, and Barcelona CeGen Unit, Center for Genomic Regulation, Barcelona, Catalonia, Spain
| | | | - Dimitrios Kalaitzopoulos
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Daisuke Komura
- Dependable and High Performance Computing, Research Center for Advanced Science and Technology, University of Tokyo, 4-6-1 Komaba, Meguro, Tokyo 153-8904 Japan
| | - Jeffrey R. MacDonald
- The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, MaRS Centre- East Tower, 101 College Street, Rm. 14-701, Toronto, Ontario, M5G 1L7, Canada
| | - Christian R. Marshall
- The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, MaRS Centre- East Tower, 101 College Street, Rm. 14-701, Toronto, Ontario, M5G 1L7, Canada
- Department of Molecular and Medical Genetics, University of Toronto
| | - Rui Mei
- Affymetrix, Inc., Santa Clara, CA, USA
| | - Lyndal Montgomery
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Kunihiro Nishimura
- Genome Science, Research Center for Advanced Science and Technology, University of Tokyo, 4-6-1 Komaba, Meguro, Tokyo 153-8904 Japan
| | - Kohji Okamura
- The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, MaRS Centre- East Tower, 101 College Street, Rm. 14-701, Toronto, Ontario, M5G 1L7, Canada
- Department of Molecular and Medical Genetics, University of Toronto
| | - Fan Shen
- Affymetrix, Inc., Santa Clara, CA, USA
| | - Martin J. Somerville
- Departments of Medical Genetics and Pediatrics, University of Alberta, Edmonton, Canada
| | - Joelle Tchinda
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA 02115
| | - Armand Valsesia
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Cara Woodwark
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Fengtang Yang
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Junjun Zhang
- The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, MaRS Centre- East Tower, 101 College Street, Rm. 14-701, Toronto, Ontario, M5G 1L7, Canada
| | - Tatiana Zerjal
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | | | - Lluis Armengol
- Genes and Disease Program, and Barcelona CeGen Unit, Center for Genomic Regulation, Barcelona, Catalonia, Spain
| | - Donald F. Conrad
- Department of Human Genetics, University of Chicago, 920 East 58th Street, Chicago, Illinois, USA
| | - Xavier Estivill
- Genes and Disease Program, and Barcelona CeGen Unit, Center for Genomic Regulation, Barcelona, Catalonia, Spain
- Department of Life and Health Sciences, Pompeu Fabra University, Barcelona, Catalonia, Spain
| | - Chris Tyler-Smith
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Nigel P. Carter
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Hiroyuki Aburatani
- Genome Science, Research Center for Advanced Science and Technology, University of Tokyo, 4-6-1 Komaba, Meguro, Tokyo 153-8904 Japan
- Japan Science and Technology Agency Kawaguchi, Saitama, 332-0012, Japan
| | - Charles Lee
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA 02115
- Harvard Medical School, Boston, MA, USA 02115
| | | | - Stephen W. Scherer
- The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, MaRS Centre- East Tower, 101 College Street, Rm. 14-701, Toronto, Ontario, M5G 1L7, Canada
- Department of Molecular and Medical Genetics, University of Toronto
| | - Matthew E. Hurles
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
4
|
Barber JCK, Zhang S, Friend N, Collins AL, Maloney VK, Hastings R, Farren B, Barnicoat A, Polityko AD, Rumyantseva NV, Starke H, Ye S. Duplications of proximal 16q flanked by heterochromatin are not euchromatic variants and show no evidence of heterochromatic position effect. Cytogenet Genome Res 2006; 114:351-8. [PMID: 16954678 DOI: 10.1159/000094225] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2006] [Accepted: 03/22/2006] [Indexed: 11/19/2022] Open
Abstract
Extra euchromatic material was found within the major heterochromatic block of chromosome 16 (16qh) in one de novo case and seven members of two families. In contrast to the euchromatic variants of chromosome 9 (9qh), which are derived from pericentromeric euchromatin, molecular cytogenetics confirmed that these duplications were of 16q11.2-->q12.2 in the de novo case, of 16q11.2-->q13 in three members of family 1 and 16q11.2-->q12.1 in four members of family 2. The duplication had arisen as a post-zygotic mitotic event in the mother of family 1 and been transmitted paternally in family 2. An insertional mechanism of origin is proposed for the duplications in case 1 and family 1. Expression at the 16q13 matrix metalloproteinase-2 (MMP2)locus in families 1 and 2 was proportional to genomic copy number and not therefore consistent with position effect silencing due to the flanking blocks of heterochromatin. We conclude that proximal 16q duplications within 16qh are not novel euchromatic variants but associated with a variable phenotype including developmental delay, speech delay, learning difficulties and behavioural problems. The behavioural problems in families ascertained through affected children are much less severe than those encountered in previous patients ascertained as adults.
Collapse
Affiliation(s)
- J C K Barber
- Wessex Regional Genetics Laboratory, Salisbury District Hospital, Salisbury, UK.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Linardopoulou EV, Williams EM, Fan Y, Friedman C, Young JM, Trask BJ. Human subtelomeres are hot spots of interchromosomal recombination and segmental duplication. Nature 2005; 437:94-100. [PMID: 16136133 PMCID: PMC1368961 DOI: 10.1038/nature04029] [Citation(s) in RCA: 290] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2005] [Accepted: 07/05/2005] [Indexed: 01/09/2023]
Abstract
Human subtelomeres are polymorphic patchworks of interchromosomal segmental duplications at the ends of chromosomes. Here we provide evidence that these patchworks arose recently through repeated translocations between chromosome ends. We assess the relative contribution of the principal mechanisms of ectopic DNA repair to the formation of subtelomeric duplications and find that non-homologous end-joining predominates. Once subtelomeric duplications arise, they are prone to homology-based sequence transfers as shown by the incongruent phylogenetic relationships of neighbouring sections. Interchromosomal recombination of subtelomeres is a potent force for recent change. Cytogenetic and sequence analyses reveal that pieces of the subtelomeric patchwork have changed location and copy number with unprecedented frequency during primate evolution. Half of the known subtelomeric sequence has formed recently, through human-specific sequence transfers and duplications. Subtelomeric dynamics result in a gene duplication rate significantly higher than the genome average and could have both advantageous and pathological consequences in human biology. More generally, our analyses suggest an evolutionary cycle between segmental polymorphisms and genome rearrangements.
Collapse
Affiliation(s)
- Elena V. Linardopoulou
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North C3-168, Seattle, Washington 98109, USA
- Department of Bioengineering, University of Washington, Box 357962, Seattle, Washington 98195-7962, USA
| | - Eleanor M. Williams
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North C3-168, Seattle, Washington 98109, USA
| | - Yuxin Fan
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North C3-168, Seattle, Washington 98109, USA
| | - Cynthia Friedman
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North C3-168, Seattle, Washington 98109, USA
| | - Janet M. Young
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North C3-168, Seattle, Washington 98109, USA
| | - Barbara J. Trask
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North C3-168, Seattle, Washington 98109, USA
- Department of Bioengineering, University of Washington, Box 357962, Seattle, Washington 98195-7962, USA
- Department of Genome Sciences, University of Washington, Box 357730, Seattle, Washington 98195-7730, USA
- Correspondence and requests for materials should be addressed to B.J.T. (e-mail:
)
| |
Collapse
|
6
|
Bailey JA, Yavor AM, Massa HF, Trask BJ, Eichler EE. Segmental duplications: organization and impact within the current human genome project assembly. Genome Res 2001; 11:1005-17. [PMID: 11381028 PMCID: PMC311093 DOI: 10.1101/gr.gr-1871r] [Citation(s) in RCA: 471] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Segmental duplications play fundamental roles in both genomic disease and gene evolution. To understand their organization within the human genome, we have developed the computational tools and methods necessary to detect identity between long stretches of genomic sequence despite the presence of high copy repeats and large insertion-deletions. Here we present our analysis of the most recent genome assembly (January 2001) in which we focus on the global organization of these segments and the role they play in the whole-genome assembly process. Initially, we considered only large recent duplication events that fell well-below levels of draft sequencing error (alignments 90%-98% similar and > or =1 kb in length). Duplications (90%-98%; > or =1 kb) comprise 3.6% of all human sequence. These duplications show clustering and up to 10-fold enrichment within pericentromeric and subtelomeric regions. In terms of assembly, duplicated sequences were found to be over-represented in unordered and unassigned contigs indicating that duplicated sequences are difficult to assign to their proper position. To assess coverage of these regions within the genome, we selected BACs containing interchromosomal duplications and characterized their duplication pattern by FISH. Only 47% (106/224) of chromosomes positive by FISH had a corresponding chromosomal position by comparison. We present data that indicate that this is attributable to misassembly, misassignment, and/or decreased sequencing coverage within duplicated regions. Surprisingly, if we consider putative duplications >98% identity, we identify 10.6% (286 Mb) of the current assembly as paralogous. The majority of these alignments, we believe, represent unmerged overlaps within unique regions. Taken together the above data indicate that segmental duplications represent a significant impediment to accurate human genome assembly, requiring the development of specialized techniques to finish these exceptional regions of the genome. The identification and characterization of these highly duplicated regions represents an important step in the complete sequencing of a human reference genome.
Collapse
Affiliation(s)
- J A Bailey
- Department of Genetics and Center for Human Genetics, Case Western Reserve School of Medicine and University Hospitals of Cleveland, Cleveland, Ohio 44106, USA
| | | | | | | | | |
Collapse
|
7
|
Bailey JA, Yavor AM, Massa HF, Trask BJ, Eichler EE. Segmental Duplications: Organization and Impact Within the Current Human Genome Project Assembly. Genome Res 2001. [DOI: 10.1101/gr.187101] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Segmental duplications play fundamental roles in both genomic disease and gene evolution. To understand their organization within the human genome, we have developed the computational tools and methods necessary to detect identity between long stretches of genomic sequence despite the presence of high copy repeats and large insertion-deletions. Here we present our analysis of the most recent genome assembly (January 2001) in which we focus on the global organization of these segments and the role they play in the whole-genome assembly process. Initially, we considered only large recent duplication events that fell well-below levels of draft sequencing error (alignments 90%–98% similar and ≥1 kb in length). Duplications (90%–98%; ≥1 kb) comprise 3.6% of all human sequence. These duplications show clustering and up to 10-fold enrichment within pericentromeric and subtelomeric regions. In terms of assembly, duplicated sequences were found to be over-represented in unordered and unassigned contigs indicating that duplicated sequences are difficult to assign to their proper position. To assess coverage of these regions within the genome, we selected BACs containing interchromosomal duplications and characterized their duplication pattern by FISH. Only 47% (106/224) of chromosomes positive by FISH had a corresponding chromosomal position by BLAST comparison. We present data that indicate that this is attributable to misassembly, misassignment, and/or decreased sequencing coverage within duplicated regions. Surprisingly, if we consider putative duplications >98% identity, we identify 10.6% (286 Mb) of the current assembly as paralogous. The majority of these alignments, we believe, represent unmerged overlaps within unique regions. Taken together the above data indicate that segmental duplications represent a significant impediment to accurate human genome assembly, requiring the development of specialized techniques to finish these exceptional regions of the genome. The identification and characterization of these highly duplicated regions represents an important step in the complete sequencing of a human reference genome.
Collapse
|
8
|
Varley H, Di S, Scherer SW, Royle NJ. Characterization of terminal deletions at 7q32 and 22q13.3 healed by De novo telomere addition. Am J Hum Genet 2000; 67:610-22. [PMID: 10924407 PMCID: PMC1287521 DOI: 10.1086/303050] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2000] [Accepted: 07/11/2000] [Indexed: 11/03/2022] Open
Abstract
We have developed a strategy for the isolation of terminal deletion breakpoints from any chromosome that has been healed by de novo addition of a telomere repeat array. Breakpoints at 7q32 and 22q13.3 have been isolated and characterized in two patients (patients FB336R and AJ). Both truncated chromosomes have been healed by the addition of a novel telomere, with such an addition possibly mediated by the enzyme telomerase. The breakpoint at 7q32 in patient FB336R shows a structure similar to that of breakpoints on other chromosomes that have been healed in this way. However, the breakpoint at 22q13.3 in patient AJ has 10 nucleotides of unknown origin inserted between the sequence unique to chromosome 22q and the start of the telomere repeat array. This unusual structure is suggestive of a multistep healing event resulting in de novo telomere addition at this breakpoint, and possible mechanisms are discussed.
Collapse
Affiliation(s)
- Helen Varley
- Department of Genetics, University of Leicester, Leicester, United Kingdom; Cancer Research Laboratory, Cell Works Incorporated, Baltimore; and Department of Genetics, The Hospital for Sick Children, Toronto
| | - Shaojie Di
- Department of Genetics, University of Leicester, Leicester, United Kingdom; Cancer Research Laboratory, Cell Works Incorporated, Baltimore; and Department of Genetics, The Hospital for Sick Children, Toronto
| | - Stephen W. Scherer
- Department of Genetics, University of Leicester, Leicester, United Kingdom; Cancer Research Laboratory, Cell Works Incorporated, Baltimore; and Department of Genetics, The Hospital for Sick Children, Toronto
| | - Nicola J. Royle
- Department of Genetics, University of Leicester, Leicester, United Kingdom; Cancer Research Laboratory, Cell Works Incorporated, Baltimore; and Department of Genetics, The Hospital for Sick Children, Toronto
| |
Collapse
|
9
|
Ji Y, Eichler EE, Schwartz S, Nicholls RD. Structure of chromosomal duplicons and their role in mediating human genomic disorders. Genome Res 2000; 10:597-610. [PMID: 10810082 DOI: 10.1101/gr.10.5.597] [Citation(s) in RCA: 168] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Chromosome-specific low-copy repeats, or duplicons, occur in multiple regions of the human genome. Homologous recombination between different duplicon copies leads to chromosomal rearrangements, such as deletions, duplications, inversions, and inverted duplications, depending on the orientation of the recombining duplicons. When such rearrangements cause dosage imbalance of a developmentally important gene(s), genetic diseases now termed genomic disorders result, at a frequency of 0.7-1/1000 births. Duplicons can have simple or very complex structures, with variation in copy number from 2 to >10 repeats, and each varying in size from a few kilobases in length to hundreds of kilobases. Analysis of the different duplicons involved in human genomic disorders identifies features that may predispose to recombination, including large size and high sequence identity between the recombining copies, putative recombination promoting features, and the presence of multiple genes/pseudogenes that may include genes expressed in germ cells. Most of the chromosome rearrangements involve duplicons near pericentromeric regions, which may relate to the propensity of such regions to accumulate duplicons. Detailed analyses of the structure, polymorphic variation, and mechanisms of recombination in genomic disorders, as well as the evolutionary origin of various duplicons will further our understanding of the structure, function, and fluidity of the human genome.
Collapse
Affiliation(s)
- Y Ji
- Department of Genetics, Case Western Reserve University School of Medicine, and Center for Human Genetics, University Hospitals of Cleveland, Cleveland, Ohio 44106 USA
| | | | | | | |
Collapse
|
10
|
Eichler EE, Archidiacono N, Rocchi M. CAGGG repeats and the pericentromeric duplication of the hominoid genome. Genome Res 1999; 9:1048-58. [PMID: 10568745 DOI: 10.1101/gr.9.11.1048] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Gene duplication is one of the primary forces of evolutionary change. We present data from three different pericentromeric regions of human chromosomes, which indicate that such regions of the genome have been sites of recent genomic duplication. This form of duplication has involved the evolutionary movement of segments of genomic material, including both intronic and exonic sequence, from diverse regions of the genome toward the pericentromeric regions. Sequence analyses of the target sites of duplication have identified a novel class of interspersed GC-rich repeats located precisely at the boundaries of duplication. Estimates of the evolutionary age of these duplications indicate that they have occurred between 10 and 25 mya. In contrast, comparative analyses confirm that the GC-rich pericentromeric repeats have existed within the pericentromeric regions of primate chromosomes before the divergence of the cercopithecoid and hominoid lineages ( approximately 30 mya). These data provide molecular evidence for considerable interchromosomal duplication of genic segments during the evolution of the hominoid genome and strongly implicate GC-rich repeat elements as playing a direct role in the pericentromeric localization of these events
Collapse
MESH Headings
- Animals
- Base Sequence
- Centromere
- Chromosomes, Human, Pair 1
- Chromosomes, Human, Pair 10
- Chromosomes, Human, Pair 14
- Chromosomes, Human, Pair 15
- Chromosomes, Human, Pair 16
- Chromosomes, Human, Pair 2
- Chromosomes, Human, Pair 22
- Chromosomes, Human, Pair 9
- Evolution, Molecular
- Gene Duplication
- Hominidae/genetics
- Humans
- In Situ Hybridization, Fluorescence
- Microsatellite Repeats/genetics
- Molecular Sequence Data
- Sequence Alignment
- X Chromosome
Collapse
Affiliation(s)
- E E Eichler
- Department of Genetics and Center for Human Genetics, Case Western Reserve School of Medicine and University Hospitals of Cleveland, Cleveland, Ohio 44106, USA.
| | | | | |
Collapse
|
11
|
Abstract
Comparative gene mapping and chromosome painting permit the tentative reconstruction of ancestral karyotypes. The modern human karyotype is proposed to differ from that of the most recent common ancestor of catarrhine primates by two major rearrangements. The first was the fission of an ancestral chromosome to produce the homologues of human chromosomes 14 and 15. This fission occurred before the divergence of gibbons from humans and other apes. The second was the fusion of two ancestral chromosomes to form human chromosome 2. This fusion occurred after the divergence of humans and chimpanzees. Moving further back in time, homologues of human chromosomes 3 and 21 were formed by the fission of an ancestral linkage group that combined loci of both human chromosomes, whereas homologues of human chromosomes 12 and 22 were formed by a reciprocal translocation between two ancestral chromosomes. Both events occurred at some time after our most recent common ancestor with lemurs. Less direct evidence suggests that the short and long arms of human chromosomes 8, 16 and 19 were unlinked in this ancestor. Finally, the most recent common ancestor of primates and artiodactyls is proposed to have possessed a chromosome that combined loci from human chromosomes 4 and 8p, a chromosome that combined loci from human chromosomes 16q and 19q, and a chromosome that combined loci from human chromosomes 2p and 20.
Collapse
Affiliation(s)
- D Haig
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
12
|
Amarger V, Gauguier D, Yerle M, Apiou F, Pinton P, Giraudeau F, Monfouilloux S, Lathrop M, Dutrillaux B, Buard J, Vergnaud G. Analysis of distribution in the human, pig, and rat genomes points toward a general subtelomeric origin of minisatellite structures. Genomics 1998; 52:62-71. [PMID: 9740672 DOI: 10.1006/geno.1998.5365] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
We have developed approaches for the cloning of minisatellites from total genomic libraries and applied these approaches to the human, rat, and pig genomes. The chromosomal distribution of minisatellites in the three genomes is strikingly different, with clustering at chromosome ends in human, a seemingly almost even distribution in rat, and an intermediate situation in pig. A closer analysis, however, reveals that interstitial sites in pig and rat often correspond to terminal cytogenetic bands in human. This observation suggests that minisatellites are created toward chromosome ends and their internalization represents secondary events resulting from rearrangements involving chromosome ends.
Collapse
Affiliation(s)
- V Amarger
- Laboratoire de Recherche en Génétique des Espèces, Institut de Biologie des Hôpitaux de Nantes, 9, Quai Moncousu, Nantes Cedex, 44035, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Affiliation(s)
- E E Eichler
- Department of Genetics, School of Medicine, Case Western Reserve University, Cleveland, Ohio 44106, USA.
| |
Collapse
|
14
|
Rozmahel R, Heng HH, Duncan AM, Shi XM, Rommens JM, Tsui LC. Amplification of CFTR exon 9 sequences to multiple locations in the human genome. Genomics 1997; 45:554-61. [PMID: 9367680 DOI: 10.1006/geno.1997.4968] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Cloning and characterization of the cystic fibrosis transmembrane conductance regulator (CFTR) gene led to the identification and isolation of cDNA and genomic sequences that cross-hybridized to the first nucleotide binding fold of CFTR. DNA sequence analysis of these clones showed that the cross-hybridizing sequences corresponded to CFTR exon 9 and its flanking introns, juxtapositioned with two segments of LINE1 sequences. The CFTR sequence appeared to have been transcribed from the opposite direction of the gene, reversely transcribed, and co-integrated with the L1 sequences into a chromosome location distinct from that of the CFTR locus. Based on hybridization intensity and complexity of the restriction fragments, it was estimated that there were at least 10 copies of the "amplified" CFTR exon 9 sequences in the human genome. Furthermore, when DNA segments adjacent to the insertion site were used in genomic DNA blot hybridization analysis, multiple copies were also detected. The overall similarity between these CFTR exon 9-related sequences suggested that they were derived from a single retrotransposition event and subsequent sequence amplification. The amplification unit appeared to be greater than 30 kb. Physical mapping studies including in situ hybridization to human metaphase chromosomes showed that multiple copies of these amplified sequences (with and without the CFTR exon 9 insertion) were dispersed throughout the genome. These findings provide insight into the structure and evolution of the human genome.
Collapse
Affiliation(s)
- R Rozmahel
- Department of Molecular and Medical Genetics, The University of Toronto, Toronto, Ontario, M5S 1A7
| | | | | | | | | | | |
Collapse
|
15
|
Armour JA, Crosier M, Malcolm S, Chan JC, Jeffreys AJ. Human minisatellite loci composed of interspersed GGA-GGT triplet repeats. Proc Biol Sci 1995; 261:345-9. [PMID: 8587876 DOI: 10.1098/rspb.1995.0157] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
We have isolated two tandemly repeated loci from human DNA which contain long blocks of GGA and GGT trinucleotide repeats. These two repeat unit types, together with other less common variants, are apparently irregularly interspersed along each repeat array. Genotyping methods have been developed for these highly polymorphic loci, including typing by polymerase chain reaction followed by Southern blot hybridization. Linkage analysis in Centre d'Etude du Polymorphisme Humain (CEPH) pedigrees has been used to map the loci to chromosomes 15 and 22. In normal individuals, alleles at these loci can contain thousands of repeats, greatly exceeding repeat copy number at most trinucleotide and other simple repeat loci. No evidence for longer, higher-order repeats was observed among the limited number of repeats sequenced. These loci may represent a transitional state between simple repeat loci and some minisatellites.
Collapse
Affiliation(s)
- J A Armour
- Department of Genetics, University of Leicester, U.K
| | | | | | | | | |
Collapse
|
16
|
Kusunoki Y, Hayashi T, Hirai Y, Kushiro J, Tatsumi K, Kurihara T, Zghal M, Kamoun MR, Takebe H, Jeffreys A. Increased rate of spontaneous mitotic recombination in T lymphocytes from a Bloom's syndrome patient using a flow-cytometric assay at HLA-A locus. Jpn J Cancer Res 1994; 85:610-8. [PMID: 8063614 PMCID: PMC5919530 DOI: 10.1111/j.1349-7006.1994.tb02403.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Bloom's syndrome (BS) is an autosomal recessive disorder conferring high propensity for cancer and displaying a high degree of genetic instability; the frequency of sister chromatid exchange is characteristically 10 times above background. The symmetrical four-armed chromatid interchanges are much more readily detected in peripheral blood lymphocytes of BS patients, suggesting that the frequency of somatic recombination is also increased. In the present study, the rate of spontaneous loss of HLA-A allele expression was estimated following fluctuation analysis in cultured T lymphocytes using a flow-cytometric assay. It was found to be 10 times or more higher than normal in lymphocytes from a BS patient. Molecular and chromosome analyses showed that all 13 independent variants from the patient were most likely derived from somatic recombinations. Further tests for loss of heterozygosity at a closely linked proximal locus, HLA-DQA1, showed that as many as half of the recombinants retained heterozygosity irrespective of the donor. The results suggest that the HLA region is hyperrecombinogenic in somatic cells and that the elevated recombination rate in BS cells results from the general increase at ordinary sites and not from random creation of unusual sites for recombination.
Collapse
Affiliation(s)
- Y Kusunoki
- Department of Radiobiology, Radiation Effects Research Foundation, Hiroshima
| | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Wauters JG, Bossuyt PJ, Roelen L, van Roy B, Dumon J. Application of fluorescence in situ hybridization for early prenatal diagnosis of partial trisomy 6p/monosomy 6q due to a familial pericentric inversion. Clin Genet 1993; 44:262-9. [PMID: 8313624 DOI: 10.1111/j.1399-0004.1993.tb03894.x] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
We report the prenatal diagnosis of a karyotype 46,XY,rec(6)dup p, inv(6) (p23q27) mat detected by fluorescence in situ hybridization using chromosome 6pter and 6qter specific DNA markers. This partial duplication-deletion (6p12-->pter; 6q27-->qter) emanated from a balanced pericentric inversion 46,XX inv(6) (p23q27)pat present in the mother. The phenotypes of two relatives with the same unbalanced anomaly are described. This report illustrates the sensitivity and specificity of fluorescence in situ hybridization (FISH) and its benefit in rapid and unequivocal prenatal diagnosis of subtle chromosomal rearrangements.
Collapse
Affiliation(s)
- J G Wauters
- Department of Medical Genetics, University of Antwerp-U.I.A., Belgium
| | | | | | | | | |
Collapse
|
18
|
Kushiro J, Hirai Y, Kusunoki Y, Kyoizumi S, Kodama Y, Wakisaka A, Jeffreys A, Cologne JB, Dohi K, Nakamura N. Development of a flow-cytometric HLA-A locus mutation assay for human peripheral blood lymphocytes. Mutat Res 1992; 272:17-29. [PMID: 1380116 DOI: 10.1016/0165-1161(92)90005-7] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
A flow-cytometric technique was developed to measure the frequency of variant lymphocytes lacking expression of HLA-A2 or A24 allele products among donors heterozygous for HLA-A2 or A24. It was found that the variant frequency of lymphocytes in peripheral blood was of the order of 10(-4) and increased with donor age. Molecular analyses of mutant clones revealed that about one-third were derived from somatic recombinations and that the remaining two-thirds did not show any alterations after Southern blotting analysis. In contrast, mutants obtained after in vitro X-ray mutagenesis study were found to be mostly derived from large chromosomal deletions. A small-scale study on atomic bomb survivors did not show a significant dose effect.
Collapse
Affiliation(s)
- J Kushiro
- Department of Radiobiology, Radiation Effects Research Foundation, Hiroshima, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Callen DF, Doggett NA, Stallings RL, Chen LZ, Whitmore SA, Lane SA, Nancarrow JK, Apostolou S, Thompson AD, Lapsys NM. High-resolution cytogenetic-based physical map of human chromosome 16. Genomics 1992; 13:1178-85. [PMID: 1505951 DOI: 10.1016/0888-7543(92)90035-q] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
A panel of 54 mouse/human somatic cell hybrids, each possessing various portions of chromosome 16, was constructed; 46 were constructed from naturally occurring rearrangements of this chromosome, which were ascertained in clinical cytogenetics laboratories, and a further 8 from rearrangements spontaneously arising during tissue culture. By mapping 235 DNA markers to this panel of hybrids, and in relation to four fragile sites and the centromere, a cytogenetic-based physical map of chromosome 16 with an average resolution of 1.6 Mb was generated. Included are 66 DNA markers that have been typed in the CEPH pedigrees, and these will allow the construction of a detailed correlation of the cytogenetic-based physical map and the genetic map of this chromosome. Cosmids from chromosome 16 that have been assembled into contigs by use of repetitive sequence fingerprinting have been mapped to the hybrid panel. Approximately 11% of the euchromatin is now both represented in such contigs and located on the cytogenetic-based physical map. This high-resolution cytogenetic-based physical map of chromosome 16 will provide the basis for the cloning of genetically mapped disease genes, genes disrupted in cytogenetic rearrangements that have produced abnormal phenotypes, and cancer breakpoints.
Collapse
Affiliation(s)
- D F Callen
- Department of Cytogenetics and Molecular Genetics, Adelaide Children's Hospital, Australia
| | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Fields CA, Grady DL, Moyzis RK. The human THE-LTR(O) and MstII interspersed repeats are subfamilies of a single widely distributed highly variable repeat family. Genomics 1992; 13:431-6. [PMID: 1319393 DOI: 10.1016/0888-7543(92)90264-s] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Fifteen examples of the transposon-like human element (THE) LTR and thirteen examples of the MstII interspersed repeat are aligned to generate new consensus sequences for these human repetitive elements. The consensus sequences of these elements are very similar, indicating that they compose subfamilies of a single human interspersed repetitive sequence family. Members of this highly polymorphic repeat family have been mapped to at least 11 chromosomes. Seven examples of the THE internal sequence are also aligned to generate a new consensus sequence for this element. Estimates of the abundance of this repetitive sequence family, derived from both hybridization analysis and frequency of occurrence in GenBank, indicate that THE-LTR/MstII sequences are present every 100-3000 kb in human DNA. The widespread occurrence of members of this family makes them useful landmarks, like Alu, L1, and (GT)n repeats, for physical and genetic mapping of human DNA.
Collapse
Affiliation(s)
- C A Fields
- Computing Research Laboratory, New Mexico State University, Las Cruces 88003-0001
| | | | | |
Collapse
|
21
|
Fedorov A, Suboch G, Bujakov M, Fedorova L. Analysis of nonuniformity in intron phase distribution. Nucleic Acids Res 1992; 20:2553-7. [PMID: 1598214 PMCID: PMC312392 DOI: 10.1093/nar/20.10.2553] [Citation(s) in RCA: 57] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The distribution of different intron groups with respect to phases has been analyzed. It has been established that group II introns and nuclear introns have a minimum frequency of phase 2 introns. Since the phase of introns is an extremely conservative measure the observed minimum reflects evolutionary processes. A sample of all known, group I introns was too small to provide a valid characteristic of their phase distribution. The findings observed for the unequal distribution of phases cannot be explained solely on the basis of the mobile properties of introns. One of the most likely explanations for this nonuniformity in the intron phase distribution is the process of exon shuffling. It is proposed that group II introns originated at the early stages of evolution and were involved in the process of exon shuffling.
Collapse
Affiliation(s)
- A Fedorov
- Department of Molecular Basis of Human Genetics, Academy of Sciences of Russia, Moscow
| | | | | | | |
Collapse
|
22
|
Abstract
We present evidence that a proportion of alleles at two human minisatellite loci is undetected by standard Southern blot hybridization. In each case the missing allele(s) can be identified after PCR amplification and correspond to tandem arrays too short to detect by hybridization. At one locus, there is only one undetected allele (population frequency 0.3), which contains just three repeat units. At the second locus, there are at least five undetected alleles (total population frequency 0.9) containing 60-120 repeats; they are not detected because these tandem repeats give very poor signals when used as a probe in standard Southern blot hybridization, and also cross-hybridize with other sequences in the genome. Under these circumstances only signals from the longest tandemly repeated alleles are detectable above the nonspecific background. The structures of these loci have been compared in human and primate DNA, and at one locus the short human allele containing three repeat units is shown to be an intermediate state in the expansion of a monomeric precursor allele in primates to high copy number in the longer human arrays. We discuss the implications of such loci for studies of human populations, minisatellite isolation by cloning, and the evolution of highly variable tandem arrays.
Collapse
Affiliation(s)
- J A Armour
- Department of Genetics, University of Leicester, United Kingdom
| | | | | |
Collapse
|
23
|
Rogers J, Ruano G, Kidd KK. Variability in nuclear DNA among nonhuman primates: Application of molecular genetic techniques to intra- and inter-species genetic analyses. Am J Primatol 1992; 27:93-105. [DOI: 10.1002/ajp.1350270205] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/1990] [Revised: 03/29/1991] [Indexed: 11/06/2022]
|
24
|
Vergnaud G, Mariat D, Zoroastro M, Lauthier V. Detection of single and multiple polymorphic loci by synthetic tandem repeats of short oligonucleotides. Electrophoresis 1991; 12:134-40. [PMID: 2040260 DOI: 10.1002/elps.1150120207] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Loci containing tandem repeats of short sequences are sometimes associated with a high level of polymorphism due to variations in the number of repeats. The different variants can be easily characterized by Southern blotting when the repeats span a range from a few hundred bases to a few kilobases, and probes derived from such tandem repeats constitute convenient genetic markers. These structures, usually called minisatellites, are best documented in the human genome, where their number has been estimated to be at least 1500. However, their role and mode of evolution are poorly understood. We are developing tools to evaluate the number of such redundant sequences in a genome and to gain access to new polymorphic loci. Our strategy is based on the use of polymers of oligonucleotides as DNA probes for hybridization on Southern blots. In a previous report, we made polymers with random units of 14 bp and showed that they detect multiple polymorphic loci on human genomic DNA. At present, we are testing the effect of an increase in the complexity of the polymer, as obtained by the use of a longer random unit, and the effect of slight sequence modifications to a particular tandem repeat sequence. In addition, some of these synthetic probes can detect a single polymorphic locus and directly provide new genetic markers.
Collapse
Affiliation(s)
- G Vergnaud
- Centre d'Etudes du Bouchet, Vert le Petit, France
| | | | | | | |
Collapse
|
25
|
Decorte R, Cassiman JJ. Detection of amplified VNTR alleles by direct chemiluminescence: application to the genetic identification of biological samples in forensic cases. EXS 1991; 58:371-90. [PMID: 1831164 DOI: 10.1007/978-3-0348-7312-3_28] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Minisatellite or variable number of tandem repeat (VNTR) regions contain such a high degree of polymorphism that they allow one to construct an individual-specific DNA "fingerprint". Analysis of these sequences by Southern blot however, consumes much DNA and is not applicable to degraded DNA samples often recovered from body-fluid stains found at crime scenes. The polymerase chain reaction (PCR) technique may overcome these problems. With oligonucleotide primers flanking the repeat region, amplification of the VNTR alleles followed by direct visualization on ethidium bromide-stained agarose gels is possible. In those cases were the PCR yield is too low for direct visualization, the product can be blotted to a nylon membrane and hybridized with a labelled internal probe. Alternatively, the PCR product can be biotinylated during amplification and visualized by direct chemiluminescence after Southern transfer. The remarkable sensitivity of the PCR technique has allowed the detection of genetic polymorphisms in single cells, hair roots and single sperm. A drawback of this very high sensitivity however is that special precautions have to be taken to prevent accidental contamination resulting in erroneous interpretation of the results.
Collapse
Affiliation(s)
- R Decorte
- Center for Human Genetics, University of Leuven, Belgium
| | | |
Collapse
|
26
|
Jeffreys AJ, Royle NJ, Patel I, Armour JA, MacLeod A, Collick A, Gray IC, Neumann R, Gibbs M, Crosier M. Principles and recent advances in human DNA fingerprinting. EXS 1991; 58:1-19. [PMID: 1831152 DOI: 10.1007/978-3-0348-7312-3_1] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Since 1985, DNA typing systems have played an increasingly important role in many aspects of human genetics, most notably in forensic and legal medicine. This article reviews the development of multilocus and single locus minisatellite DNA probes, and more recently the use of PCR to amplify hypervariable DNA loci, as well as discussing the biological properties of the unstable regions of DNA which form the basis of almost all DNA fingerprinting systems.
Collapse
Affiliation(s)
- A J Jeffreys
- Department of Genetics, University of Leicester, Great Britain
| | | | | | | | | | | | | | | | | | | |
Collapse
|