1
|
Dawkins RL, Lloyd SS. Commentary: Conserved polymorphic sequences protect themselves for future challenges. Front Genet 2022; 13:993944. [DOI: 10.3389/fgene.2022.993944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 10/31/2022] [Indexed: 11/13/2022] Open
|
2
|
Kulski JK, Suzuki S, Shiina T. SNP-Density Crossover Maps of Polymorphic Transposable Elements and HLA Genes Within MHC Class I Haplotype Blocks and Junction. Front Genet 2021; 11:594318. [PMID: 33537058 PMCID: PMC7848197 DOI: 10.3389/fgene.2020.594318] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 11/24/2020] [Indexed: 12/12/2022] Open
Abstract
The genomic region (~4 Mb) of the human major histocompatibility complex (MHC) on chromosome 6p21 is a prime model for the study and understanding of conserved polymorphic sequences (CPSs) and structural diversity of ancestral haplotypes (AHs)/conserved extended haplotypes (CEHs). The aim of this study was to use a set of 95 MHC genomic sequences downloaded from a publicly available BioProject database at NCBI to identify and characterise polymorphic human leukocyte antigen (HLA) class I genes and pseudogenes, MICA and MICB, and retroelement indels as haplotypic lineage markers, and single-nucleotide polymorphism (SNP) crossover loci in DNA sequence alignments of different haplotypes across the Olfactory Receptor (OR) gene region (~1.2 Mb) and the MHC class I region (~1.8 Mb) from the GPX5 to the MICB gene. Our comparative sequence analyses confirmed the identity of 12 haplotypic retroelement markers and revealed that they partitioned the HLA-A/B/C haplotypes into distinct evolutionary lineages. Crossovers between SNP-poor and SNP-rich regions defined the sequence range of haplotype blocks, and many of these crossover junctions occurred within particular transposable elements, lncRNA, OR12D2, MUC21, MUC22, PSORS1A3, HLA-C, HLA-B, and MICA. In a comparison of more than 250 paired sequence alignments, at least 38 SNP-density crossover sites were mapped across various regions from GPX5 to MICB. In a homology comparison of 16 different haplotypes, seven CEH/AH (7.1, 8.1, 18.2, 51.x, 57.1, 62.x, and 62.1) had no detectable SNP-density crossover junctions and were SNP poor across the entire ~2.8 Mb of sequence alignments. Of the analyses between different recombinant haplotypes, more than half of them had SNP crossovers within 10 kb of LTR16B/ERV3-16A3_I, MLT1, Charlie, and/or THE1 sequences and were in close vicinity to structurally polymorphic Alu and SVA insertion sites. These studies demonstrate that (1) SNP-density crossovers are associated with putative ancestral recombination sites that are widely spread across the MHC class I genomic region from at least the telomeric OR12D2 gene to the centromeric MICB gene and (2) the genomic sequences of MHC homozygous cell lines are useful for analysing haplotype blocks, ancestral haplotypic landscapes and markers, CPSs, and SNP-density crossover junctions.
Collapse
Affiliation(s)
- Jerzy K. Kulski
- Faculty of Health and Medical Sciences, Medical School, The University of Western Australia, Crawley, WA, Australia
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| | - Shingo Suzuki
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| | - Takashi Shiina
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| |
Collapse
|
3
|
Genomic evolution and polymorphism: Segmental duplications and haplotypes at 108 regions on 21 chromosomes. Genomics 2013; 102:15-26. [DOI: 10.1016/j.ygeno.2013.02.011] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2012] [Revised: 02/13/2013] [Accepted: 02/27/2013] [Indexed: 11/21/2022]
|
4
|
ADDISON JASONA. TECHNICAL ARTICLE: Simultaneous cloning of multiple nuclear genes by pooling PCR products of variable size: a cost-effective method of improving efficiency in large-scale genetic analyses. ACTA ACUST UNITED AC 2006. [DOI: 10.1111/j.1471-8286.2006.01660.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
5
|
Torres AR, Sweeten TL, Cutler A, Bedke BJ, Fillmore M, Stubbs EG, Odell D. The association and linkage of the HLA-A2 class I allele with autism. Hum Immunol 2006; 67:346-51. [PMID: 16720216 DOI: 10.1016/j.humimm.2006.01.001] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2005] [Revised: 01/02/2006] [Accepted: 01/04/2006] [Indexed: 11/21/2022]
Abstract
Previous research has revealed associations between autism and immune genes located in the human leukocyte antigen (HLA). To better understand which HLA genetic loci may be associated with autism, we compared the class I HLA-A and -B alleles in autistic probands with case control subjects from Caucasian families. The frequency of HLA-A2 alleles was significantly increased in autistic subjects compared with normal allelic frequencies from the National Marrow Donors Program (NMDP) (p = 0.0043 after allelic correction). The transmission disequilibrium test for the A2 allele revealed an increased frequency of inheritance for autistic children (p = 0.033). There were no significant associations of autism with HLA-B alleles; however, the A2-B44 and A2-B51 haplotypes were two times more frequent in autistic subjects. The association and linkage of the class I HLA-A2 allele with autism suggests its involvement in the etiology of autism. Possible roles are discussed for the HLA-A2 association in the presentation of microbial antigen within the central nervous system and/or in the establishment of synaptic and neuronal circuits in the developing brain.
Collapse
Affiliation(s)
- Anthony R Torres
- Center for Persons with Disabilities, Utah State University, Logan, UT 84322-6895, USA.
| | | | | | | | | | | | | |
Collapse
|
6
|
Kitcharoen K, Witt CS, Romphruk AV, Christiansen FT, Leelayuwat C. MICA, MICB, and MHC Beta Block Matching in Bone Marrow Transplantation: Relevance to Transplantation Outcome. Hum Immunol 2006; 67:238-46. [PMID: 16698448 DOI: 10.1016/j.humimm.2006.02.012] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2005] [Indexed: 11/19/2022]
Abstract
Genetic testing of the MHC is required for selection of donors for bone marrow transplantation. The outcome of related bone marrow transplantation is usually superior to that of unrelated bone marrow transplantation. This may be the result of inaccurate or incomplete genetic testing employed for selection of donor for transplantation. Based on MHC haplotype matching, MHC block matching has been developed for selection of potential unrelated donor. Block matching has been shown previously to improve outcome when added to the conventional method of human leukocyte antigen (HLA) typing for selection of donors. In this study, we have retrospectively analyzed 44 donor recipient pairs from the Australian Bone Marrow Donor Registry Repository with respect to matching of HLA-B and HLA-Cw by sequence-based typing and MICA and MICB by polymerase chain reaction-sequence specific primer and MHC beta block matching and correlated these results with survival. Beta block matching was correlated with MIC matching (p < 0.005) and with HLA-B and HLA-Cw matching. Patients who were HLA-B and -Cw matched had significantly improved survival when they were additionally matched for MHC beta block (p(c) = 0.04) or MIC (p(c) = 0.05).
Collapse
Affiliation(s)
- K Kitcharoen
- Medical Technology Program, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | | | | | | | | |
Collapse
|
7
|
Traherne JA, Horton R, Roberts AN, Miretti MM, Hurles ME, Stewart CA, Ashurst JL, Atrazhev AM, Coggill P, Palmer S, Almeida J, Sims S, Wilming LG, Rogers J, de Jong PJ, Carrington M, Elliott JF, Sawcer S, Todd JA, Trowsdale J, Beck S. Genetic analysis of completely sequenced disease-associated MHC haplotypes identifies shuffling of segments in recent human history. PLoS Genet 2006; 2:e9. [PMID: 16440057 PMCID: PMC1331980 DOI: 10.1371/journal.pgen.0020009] [Citation(s) in RCA: 145] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2005] [Accepted: 12/13/2005] [Indexed: 11/23/2022] Open
Abstract
The major histocompatibility complex (MHC) is recognised as one of the most important genetic regions in relation to common human disease. Advancement in identification of MHC genes that confer susceptibility to disease requires greater knowledge of sequence variation across the complex. Highly duplicated and polymorphic regions of the human genome such as the MHC are, however, somewhat refractory to some whole-genome analysis methods. To address this issue, we are employing a bacterial artificial chromosome (BAC) cloning strategy to sequence entire MHC haplotypes from consanguineous cell lines as part of the MHC Haplotype Project. Here we present 4.25 Mb of the human haplotype QBL (HLA-A26-B18-Cw5-DR3-DQ2) and compare it with the MHC reference haplotype and with a second haplotype, COX (HLA-A1-B8-Cw7-DR3-DQ2), that shares the same HLA-DRB1, -DQA1, and -DQB1 alleles. We have defined the complete gene, splice variant, and sequence variation contents of all three haplotypes, comprising over 259 annotated loci and over 20,000 single nucleotide polymorphisms (SNPs). Certain coding sequences vary significantly between different haplotypes, making them candidates for functional and disease-association studies. Analysis of the two DR3 haplotypes allowed delineation of the shared sequence between two HLA class II-related haplotypes differing in disease associations and the identification of at least one of the sites that mediated the original recombination event. The levels of variation across the MHC were similar to those seen for other HLA-disparate haplotypes, except for a 158-kb segment that contained the HLA-DRB1, -DQA1, and -DQB1 genes and showed very limited polymorphism compatible with identity-by-descent and relatively recent common ancestry (<3,400 generations). These results indicate that the differential disease associations of these two DR3 haplotypes are due to sequence variation outside this central 158-kb segment, and that shuffling of ancestral blocks via recombination is a potential mechanism whereby certain DR-DQ allelic combinations, which presumably have favoured immunological functions, can spread across haplotypes and populations.
Collapse
Affiliation(s)
- James A Traherne
- Department of Pathology, Immunology Division, University of Cambridge, Cambridge, United Kingdom
| | - Roger Horton
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Anne N Roberts
- Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Addenbrooke's Hospital, Cambridge, United Kingdom
| | - Marcos M Miretti
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Matthew E Hurles
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - C. Andrew Stewart
- Department of Pathology, Immunology Division, University of Cambridge, Cambridge, United Kingdom
| | - Jennifer L Ashurst
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Alexey M Atrazhev
- Alberta Diabetes Institute (ADI), Department of Medical Microbiology and Immunology, Division of Dermatology and Cutaneous Sciences, University of Alberta, Edmonton, Canada
| | - Penny Coggill
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Sophie Palmer
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Jeff Almeida
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Sarah Sims
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Laurens G Wilming
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Jane Rogers
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Pieter J. de Jong
- Children's Hospital Oakland Research Institute, Oakland, California, United States of America
| | - Mary Carrington
- Basic Research Program, SAIC-Frederick, Inc., Laboratory of Genomic Diversity, National Cancer Institute, Frederick, Maryland, United States of America
| | - John F Elliott
- Alberta Diabetes Institute (ADI), Department of Medical Microbiology and Immunology, Division of Dermatology and Cutaneous Sciences, University of Alberta, Edmonton, Canada
| | - Stephen Sawcer
- Department of Clinical Neurosciences, University of Cambridge, Addenbrooke's Hospital, Cambridge, United Kingdom
| | - John A Todd
- Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Addenbrooke's Hospital, Cambridge, United Kingdom
| | - John Trowsdale
- Department of Pathology, Immunology Division, University of Cambridge, Cambridge, United Kingdom
| | - Stephan Beck
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
8
|
Kulski JK, Anzai T, Inoko H. ERVK9, transposons and the evolution of MHC class I duplicons within the alpha-block of the human and chimpanzee. Cytogenet Genome Res 2005; 110:181-92. [PMID: 16093671 DOI: 10.1159/000084951] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2003] [Accepted: 10/21/2003] [Indexed: 11/19/2022] Open
Abstract
The genomic sequences within the alpha-block (approximately 288-310 kb) of the human and chimpanzee MHC class I region contains ten MHC class I genes and three MIC gene fragments grouped together within alternating duplicated genomic segments or duplicons. In this study, the chimpanzee and human genomic sequences were analyzed in order to determine whether the remnants of the ERVK9 and other retrotransposon sequences are useful genomic markers for reconstructing the evolutionary history of the duplicated MHC gene families within the alpha-block. A variety of genes, pseudogenes, autologous DNA transposons and retrotransposons such as Alu and ERVK9 were used to categorize the ten duplicons into four distinct structural groups. The phylogenetic relationship of the ten duplicons was examined by using the neighbour joining method to analyze transposon sequence topologies of selected Alu members, LTR16B and Charlie9. On the basis of these structural groups and the phylogeny of the duplicated transposon sequences, a duplication model was reconstructed involving four multipartite tandem duplication steps to explain the organization and evolution of the ten duplicons within the alpha-block of the chimpanzee and human. The phylogenetic analysis and inferred duplication history suggests that the Patr/HLA-F was the first MHC class I gene to have been fixed and not required as a precursor for further duplication within the alpha-block of the ancestral species.
Collapse
Affiliation(s)
- J K Kulski
- Centre for Bioinformatics and Biological Computing, School of Information Technology, Murdoch University, Murdoch, Western Australia.
| | | | | |
Collapse
|
9
|
Kulski JK, Dunn DS. Polymorphic Alu insertions within the Major Histocompatibility Complex class I genomic region: a brief review. Cytogenet Genome Res 2005; 110:193-202. [PMID: 16093672 DOI: 10.1159/000084952] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2003] [Accepted: 10/21/2003] [Indexed: 11/19/2022] Open
Abstract
Most polymorphic Alu insertions (POALINs) belong to a subgroup of the Alu multicopy retrotransposon family of short interspersed nucleotide elements (SINEs) that are categorized as AluYb8 and AluYa5. The number of AluYb8/AluYa5 members (approximately 4,492 copies) is significantly less than the approximately one million fixed Alu copies per human genome. We have studied the presence of POALINs within the Major Histocompatibility Complex (MHC) class I region on the short arm of chromosome 6 (6p21.3) because this region has a high gene density, many genes with immune system functions, large sequence variations and diversity, duplications and redundancy, and a strong association with more than 100 different diseases. Since little is known about POALINs within the MHC genomic region, we undertook to identify some of the members of the AluYb8/AluYa5 subfamily and to study their frequency of distribution and genetic characteristics in different populations. As a result of our comparative genomic analyses, we identified the insertion sites for five POALINs distributed within the MHC class I region. This brief review outlines the locations of the insertions and sequence features of the five MHC POALINs, their single site and haplotype frequencies in different geographic populations, and their association with different HLA class I genes and disease. We show that the MHC POALINs have a potential value as lineage and linkage markers for the study of human population genetics, disease associations, genomic diversity and evolution.
Collapse
Affiliation(s)
- J K Kulski
- Centre for Bioinformatics and Biological Computing, School of Information Technology, Murdoch University, Murdoch, Western Australia.
| | | |
Collapse
|
10
|
Li S, Kawata H, Katsuyama Y, Ota M, Morishima Y, Mano S, Kulski JK, Naruse T, Inoko H. Association of polymorphic MHC microsatellites with GVHD, survival, and leukemia relapse in unrelated hematopoietic stem cell transplant donor/recipient pairs matched at five HLA loci. ACTA ACUST UNITED AC 2004; 63:362-8. [PMID: 15009808 DOI: 10.1111/j.0001-2815.2004.00200.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
In order to determine whether matching/mismatching for microsatellite polymorphism provides useful information on acute graft-vs-host disease (GVHD), survival, and leukemia relapse in hematopoietic stem cell (HSC) transplantation, we genotyped for polymorphisms at 13 microsatellite loci within the major histocompatibility complex (MHC) of 100 unrelated HSC transplant donor-recipient pairs who were matched at five classical human leukocyte antigen (HLA) loci. A high percentage of allele matching was obtained for five microsatellite loci, DQCARII (96%), MICA (93%), MIB (89%), C1-3-1 (93%), and D6S510 (97%), that are localized within 100 kb of the HLA-DR, HLA-DQ, HLA-B, HLA-C, or HLA-A locus. In contrast, the other eight microsatellites are located farther away from the HLA classical loci and have much lower percentages of allele matching [e.g. tumor necrosis factor a (TNFa) (73%), TNFd (74%), D6S273 (64%), C3-2-11 (46%), C5-3-1 (50%), C5-4-5 (63%), C5-2-7 (68%), and D6S265 (81%)]. Therefore, there were at least eight microsatellite markers with relatively high percentages of mismatches in the donor/recipient pairs with acute or chronic GVHD, poor graft survival, and leukemia relapse. However, there were no statistically significant associations between mismatched donor-recipient pairs at the 13 microsatellite loci and acute or chronic GVHD, graft survival, and leukemia relapse. Nevertheless, allele matching at the microsatellite TNFd locus near the TNFa gene was found by the Fisher's exact double-sided test to be significantly associated with decreased survival in the grade III/IV acute GVHD group. Overall, these results suggest that the matching of microsatellite polymorphisms within the HLA region, especially the ones farthest from the classical HLA loci, was not useful indicator for the outcome of HSC transplantation from unrelated donors. In this regard, the future determination of the genome-wide microsatellite genotypes in HLA-matched donor-recipient pairs, outside the MHC, may be a better possibility for identifying minor histocompatibility genes in linkage disequilibria with microsatellites as potential predictive markers for the occurrence of acute GVHD and survival rate in HSC transplantation.
Collapse
Affiliation(s)
- S Li
- Department of Molecular Life Science, Division of Basic Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Kanagawa 259-1193, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
11
|
McLure CA, Williamson JF, Stewart BJ, Keating PJ, Dawkins RL. Genomic analysis reveals a duplication of eight rather than seven short consensus repeats in primate CR1 and CR1L: evidence for an additional set shared between CR1 and CR2. Immunogenetics 2004; 56:631-8. [PMID: 15526096 DOI: 10.1007/s00251-004-0731-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2004] [Revised: 10/04/2004] [Indexed: 11/24/2022]
Abstract
We report the discovery of previously unrecognised short consensus repeats (SCRs) within human and chimpanzee CR1 and CR1L. Analysis of available genomic, protein and expression databases suggests that these are actually genomic remnants of SCRs previously reported in other complement control proteins (CCPs). Comparison with the nucleotide motifs of the 11 defined subfamilies of SCRs justifies the designation g-like because of the close similarity to the g subfamily found in CR2 and MCP. To date, we have identified five such SCRs in human and chimpanzee CR1, one in human and chimpanzee CR1L, but none in either rat or mouse Crry in keeping with the number of internal duplications of the long homologous repeat (LHR) found in CR1 and CR1L. In fact, at the genomic level, the ancestral LHR must have contained eight SCRs rather than seven as previously thought. Since g-like SCRs are found immediately downstream of d SCRs, we suggest that there must have been a functional dg set which has been retained by CR2 and MCP but which is degenerate in CR1 or CR1L. Interestingly, dg is also present in the CR2 component of mouse CR1. The degeneration of the g SCR must have occurred prior to the formation of primate CR1L and prior to the duplication events which resulted in primate CR1. In this context, the apparent conservation of g-like SCRs may be surprising and may suggest the existence of mechanisms unrelated to protein coding. These results provide examples of the many processes which have contributed to the evolution of the extensive repertoire of CCPs.
Collapse
Affiliation(s)
- C A McLure
- Centre for Molecular Immunology and Instrumentation, University of Western Australia, Canning Vale South, P.O. Box 5100, 6907 Nedlands, Western Australia, Australia
| | | | | | | | | |
Collapse
|
12
|
Martin AM, Kulski JK, Gaudieri S, Witt CS, Freitas EM, Trowsdale J, Christiansen FT. Comparative genomic analysis, diversity and evolution of two KIR haplotypes A and B. Gene 2004; 335:121-31. [PMID: 15194195 DOI: 10.1016/j.gene.2004.03.018] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2003] [Revised: 02/05/2004] [Accepted: 03/18/2004] [Indexed: 11/29/2022]
Abstract
Members of the killer immunoglobulin (Ig)-like receptor (KIR) gene family are tightly clustered on human chromosome 19q13.4. Despite considerable variation in KIR gene content and allelic polymorphism, most KIR haplotypes belong to one of two broad groups termed A and B. The availability of contiguous genomic sequences for these haplotypes has allowed us to compare their genomic organization, nucleotide (nt) diversity and reconstruct their evolutionary history. The haplotypes have a framework of three conserved blocks containing (i) KIR3DL3, (ii) KIR3DP1, 2DL4, and (iii) KIR3DL2 that are interrupted by two variable segments that differ in the number and type of KIR genes. Low (0.05%) nucleotide diversity was detected across the centromeric and telomeric boundaries of the KIR gene cluster while higher SNP density (0.2%) occurred within the central region containing the KIR2DL4 gene. Phylogenetic and genomic analyses have permitted the reconstruction of a hypothetical ancestral haplotype that has revealed common groupings and differences between the KIR genes of the two haplotypes. The present phylogenetic and genomic comparison of the two sequenced KIR haplotypes provides a framework for a more thorough examination of KIR haplotype variations, diversity and evolution in human populations and between humans and non-human primates.
Collapse
Affiliation(s)
- Annalise M Martin
- Centre for Clinical Immunology and Biomedical Statistics, Murdoch University, Western Australia 6150, Australia
| | | | | | | | | | | | | |
Collapse
|
13
|
Stewart CA, Horton R, Allcock RJN, Ashurst JL, Atrazhev AM, Coggill P, Dunham I, Forbes S, Halls K, Howson JMM, Humphray SJ, Hunt S, Mungall AJ, Osoegawa K, Palmer S, Roberts AN, Rogers J, Sims S, Wang Y, Wilming LG, Elliott JF, de Jong PJ, Sawcer S, Todd JA, Trowsdale J, Beck S. Complete MHC haplotype sequencing for common disease gene mapping. Genome Res 2004; 14:1176-87. [PMID: 15140828 PMCID: PMC419796 DOI: 10.1101/gr.2188104] [Citation(s) in RCA: 247] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2003] [Accepted: 02/13/2004] [Indexed: 11/24/2022]
Abstract
The future systematic mapping of variants that confer susceptibility to common diseases requires the construction of a fully informative polymorphism map. Ideally, every base pair of the genome would be sequenced in many individuals. Here, we report 4.75 Mb of contiguous sequence for each of two common haplotypes of the major histocompatibility complex (MHC), to which susceptibility to >100 diseases has been mapped. The autoimmune disease-associated-haplotypes HLA-A3-B7-Cw7-DR15 and HLA-A1-B8-Cw7-DR3 were sequenced in their entirety through a bacterial artificial chromosome (BAC) cloning strategy using the consanguineous cell lines PGF and COX, respectively. The two sequences were annotated to encompass all described splice variants of expressed genes. We defined the complete variation content of the two haplotypes, revealing >18,000 variations between them. Average SNP densities ranged from less than one SNP per kilobase to >60. Acquisition of complete and accurate sequence data over polymorphic regions such as the MHC from large-insert cloned DNA provides a definitive resource for the construction of informative genetic maps, and avoids the limitation of chromosome regions that are refractory to PCR amplification.
Collapse
Affiliation(s)
- C Andrew Stewart
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Amadou C, Younger RM, Sims S, Matthews LH, Rogers J, Kumanovics A, Ziegler A, Beck S, Lindahl KF. Co-duplication of olfactory receptor and MHC class I genes in the mouse major histocompatibility complex. Hum Mol Genet 2003; 12:3025-40. [PMID: 14506126 DOI: 10.1093/hmg/ddg317] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We report the 897 kb sequence of a cluster of olfactory receptor (OR) genes located at the distal end of the major histocompatibility complex (MHC) class I region on mouse chromosome 17 of strain 129/SvJ (H2bc). With additional information from the mouse genome draft sequence, we identified 59 OR loci (approximately 20% pseudogenes) in contrast to only 25 OR loci (approximately 50% pseudogenes) in the corresponding centromeric OR cluster that is part of the 'extended MHC class I region' on human chromosome 6. Comparative analysis leads to three major observations: (i) most of the OR subfamilies have evolved independently in the two species, expanding more in the mouse, and resulting in co-orthologs--subfamilies of highly similar paralogs that keep orthologous relationships with their human counterparts; (ii) three of the mouse OR subfamilies have no orthologs in humans; and (iii) MHC class I loci are interspersed in the OR cluster in mouse but not in human, and were subjected to co-duplication with OR genes. Screening of our sequence against the available sequences of other strains/haplotypes revealed that most of the OR loci are polymorphic and that the number of OR loci may vary among strains/haplotypes. Our findings that MHC-linked OR loci share duplication with MHC class I loci, have duplicated extensively and are polymorphic revives questions about potential reciprocal influences acting on the dynamics and evolution of the H2 region and the H2-linked OR loci.
Collapse
MESH Headings
- Alleles
- Amino Acid Sequence
- Animals
- Chromosome Mapping
- Chromosomes, Human, Pair 6
- Chromosomes, Mammalian
- Consensus Sequence
- Evolution, Molecular
- Gene Duplication
- Genes, MHC Class I
- Haplotypes
- Histocompatibility Antigens Class II/genetics
- Humans
- Major Histocompatibility Complex/genetics
- Mice
- Mice, Inbred Strains
- Molecular Sequence Data
- Mutation
- Phylogeny
- Polymorphism, Genetic
- Protein Structure, Tertiary
- Receptors, Odorant/chemistry
- Receptors, Odorant/genetics
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
- Species Specificity
Collapse
Affiliation(s)
- Claire Amadou
- Howard Hughes Medical Institute and Center for Immunology, University of Texas Southwestern Medical Center, Dallas, 75390-9050, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Longman-Jacobsen N, Williamson JF, Dawkins RL, Gaudieri S. In polymorphic genomic regions indels cluster with nucleotide polymorphism: Quantum Genomics. Gene 2003; 312:257-61. [PMID: 12909362 DOI: 10.1016/s0378-1119(03)00621-8] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Previously, we have described polymorphic frozen blocks (PFBs) within the Major Histocompatibility Complex (MHC) as regions of several hundred kilobases characterised by high nucleotide diversity, little or no recombination, duplicated segments, disease susceptibility, and human endogenous retroviruses. The nucleotide diversity profile within these PFBs shows peaks and troughs outside of the Class I genes, reflecting other important genes (or sequences) in the region. Here we show that indel density is also clustered with similar peaks and troughs. In fact, SNPs and indels are co-located within PFBs.
Collapse
Affiliation(s)
- Natalie Longman-Jacobsen
- Centre for Molecular Immunology and Instrumentation, University of Western Australia, P.O. Box 5100, Canning Vale South, Western Australia 6155, Australia
| | | | | | | |
Collapse
|
16
|
Anzai T, Shiina T, Kimura N, Yanagiya K, Kohara S, Shigenari A, Yamagata T, Kulski JK, Naruse TK, Fujimori Y, Fukuzumi Y, Yamazaki M, Tashiro H, Iwamoto C, Umehara Y, Imanishi T, Meyer A, Ikeo K, Gojobori T, Bahram S, Inoko H. Comparative sequencing of human and chimpanzee MHC class I regions unveils insertions/deletions as the major path to genomic divergence. Proc Natl Acad Sci U S A 2003; 100:7708-13. [PMID: 12799463 PMCID: PMC164652 DOI: 10.1073/pnas.1230533100] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Despite their high degree of genomic similarity, reminiscent of their relatively recent separation from each other ( approximately 6 million years ago), the molecular basis of traits unique to humans vs. their closest relative, the chimpanzee, is largely unknown. This report describes a large-scale single-contig comparison between human and chimpanzee genomes via the sequence analysis of almost one-half of the immunologically critical MHC. This 1,750,601-bp stretch of DNA, which encompasses the entire class I along with the telomeric part of the MHC class III regions, corresponds to an orthologous 1,870,955 bp of the human HLA region. Sequence analysis confirms the existence of a high degree of sequence similarity between the two species. However, and importantly, this 98.6% sequence identity drops to only 86.7% taking into account the multiple insertions/deletions (indels) dispersed throughout the region. This is functionally exemplified by a large deletion of 95 kb between the virtual locations of human MICA and MICB genes, which results in a single hybrid chimpanzee MIC gene, in a segment of the MHC genetically linked to species-specific handling of several viral infections (HIV/SIV, hepatitis B and C) as well as susceptibility to various autoimmune diseases. Finally, if generalized, these data suggest that evolution may have used the mechanistically more drastic indels instead of the more subtle single-nucleotide substitutions for shaping the recently emerged primate species.
Collapse
Affiliation(s)
- Tatsuya Anzai
- Department of Genetic Information, Division of Molecular Life Science, Tokai University School of Medicine, Bohseidai, Isehara, Kanagawa 259-1193, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Saunders MA, Hammer MF, Nachman MW. Nucleotide variability at G6pd and the signature of malarial selection in humans. Genetics 2002; 162:1849-61. [PMID: 12524354 PMCID: PMC1462360 DOI: 10.1093/genetics/162.4.1849] [Citation(s) in RCA: 96] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Glucose-6-phosphate dehydrogenase (G6PD) deficiency is the most common enzymopathy in humans. Deficiency alleles for this X-linked disorder are geographically correlated with historical patterns of malaria, and the most common deficiency allele in Africa (G6PD A-) has been shown to confer some resistance to malaria in both hemizygous males and heterozygous females. We studied DNA sequence variation in 5.1 kb of G6pd from 47 individuals representing a worldwide sample to examine the impact of selection on patterns of human nucleotide diversity and to infer the evolutionary history of the G6PD A- allele. We also sequenced 3.7 kb of a neighboring locus, L1cam, from the same set of individuals to study the effect of selection on patterns of linkage disequilibrium. Despite strong clinical evidence for malarial selection maintaining G6PD deficiency alleles in human populations, the overall level of nucleotide heterozygosity at G6pd is typical of other genes on the X chromosome. However, the signature of selection is evident in the absence of genetic variation among A- alleles from different parts of Africa and in the unusually high levels of linkage disequilibrium over a considerable distance of the X chromosome. In spite of a long-term association between Plasmodium falciparum and the ancestors of modern humans, patterns of nucleotide variability and linkage disequilibrium suggest that the A- allele arose in Africa only within the last 10,000 years and spread due to selection.
Collapse
Affiliation(s)
- Matthew A Saunders
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA.
| | | | | |
Collapse
|
18
|
Kulski JK, Shiina T, Anzai T, Kohara S, Inoko H. Comparative genomic analysis of the MHC: the evolution of class I duplication blocks, diversity and complexity from shark to man. Immunol Rev 2002; 190:95-122. [PMID: 12493009 DOI: 10.1034/j.1600-065x.2002.19008.x] [Citation(s) in RCA: 175] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The major histocompatibility complex (MHC) genomic region is composed of a group of linked genes involved functionally with the adaptive and innate immune systems. The class I and class II genes are intrinsic features of the MHC and have been found in all the jawed vertebrates studied so far. The MHC genomic regions of the human and the chicken (B locus) have been fully sequenced and mapped, and the mouse MHC sequence is almost finished. Information on the MHC genomic structures (size, complexity, genic and intergenic composition and organization, gene order and number) of other vertebrates is largely limited or nonexistent. Therefore, we are mapping, sequencing and analyzing the MHC genomic regions of different human haplotypes and at least eight nonhuman species. Here, we review our progress with these sequences and compare the human MHC structure with that of the nonhuman primates (chimpanzee and rhesus macaque), other mammals (pigs, mice and rats) and nonmammalian vertebrates such as birds (chicken and quail), bony fish (medaka, pufferfish and zebrafish) and cartilaginous fish (nurse shark). This comparison reveals a complex MHC structure for mammals and a relatively simpler design for nonmammalian animals with a hypothetical prototypic structure for the shark. In the mammalian MHC, there are two to five different class I duplication blocks embedded within a framework of conserved nonclass I and/or nonclass II genes. With a few exceptions, the class I framework genes are absent from the MHC of birds, bony fish and sharks. Comparative genomics of the MHC reveal a highly plastic region with major structural differences between the mammalian and nonmammalian vertebrates. Additional genomic data are needed on animals of the reptilia, crocodilia and marsupial classes to find the origins of the class I framework genes and examples of structures that may be intermediate between the simple and complex MHC organizations of birds and mammals, respectively.
Collapse
Affiliation(s)
- Jerzy K Kulski
- Department of Molecular Life Science, Tokai University School of Medicine, Bohseidai, Isehara, Kanagawa, Japan
| | | | | | | | | |
Collapse
|
19
|
Abstract
We studied the effect of multilocus balancing selection on neutral nucleotide variability at linked sites by simulating a model where diallelic polymorphisms are maintained at an arbitrary number of selected loci by means of symmetric overdominance. Different combinations of alleles define different genetic backgrounds that subdivide the population and strongly affect variability. Several multilocus fitness regimes with different degrees of epistasis and gametic disequilibrium are allowed. Analytical results based on a multilocus extension of the structured coalescent predict that the expected linked neutral diversity increases exponentially with the number of selected loci and can become extremely large. Our simulation results show that although variability increases with the number of genetic backgrounds that are maintained in the population, it is reduced by random fluctuations in the frequencies of those backgrounds and does not reach high levels even in very large populations. We also show that previous results on balancing selection in single-locus systems do not extend to the multilocus scenario in a straightforward way. Different patterns of linkage disequilibrium and of the frequency spectrum of neutral mutations are expected under different degrees of epistasis. Interestingly, the power to detect balancing selection using deviations from a neutral distribution of allele frequencies seems to be diminished under the fitness regime that leads to the largest increase of variability over the neutral case. This and other results are discussed in the light of data from the Mhc.
Collapse
Affiliation(s)
- Arcadio Navarro
- Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| | | |
Collapse
|
20
|
Charlesworth D, Charlesworth B, McVean GA. Genome sequences and evolutionary biology, a two-way interaction. Trends Ecol Evol 2001; 16:235-242. [PMID: 11301152 DOI: 10.1016/s0169-5347(01)02126-7] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Complete genome sequences are accumulating rapidly, culminating with the announcement of the human genome sequence in February 2001. In addition to cataloguing the diversity of genes and other sequences, genome sequences will provide the first detailed and complete data on gene families and genome organization, including data on evolutionary changes. Reciprocally, evolutionary biology will make important contributions to the efforts to understand functions of genes and other sequences in genomes. Large-scale, detailed and unbiased comparisons between species will illuminate the evolution of genes and genomes, and population genetics methods will enable detection of functionally important genes or sequences, including sequences that have been involved in adaptive changes.
Collapse
|
21
|
Gaudieri S, Longman-Jacobsen N, Tay GK, Dawkins RL. Sequence analysis of the MHC class I region reveals the basis of the genomic matching technique. Hum Immunol 2001; 62:279-85. [PMID: 11250045 DOI: 10.1016/s0198-8859(01)00210-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
The genomic matching technique (GMT) improves survival following bone marrow transplantation (BMT) between unrelated donor and recipient pairs correlating with a decrease in incidence and severity of graft-versus-host disease (GvHD). The principles of this technique are based on the duplication and polymorphic characteristics of the major histocompatibility complex (MHC). Specifically, the beta block GMT matches for a 300 kb region that contains the human leukocyte antigen (HLA-B and -C) genes as well as other non-HLA genes such as the natural killer cell receptor ligand PERB11 (MIC). The block contains two large segmental duplications. One results in two PERB11 genes (11.1 and 11.2), the other in two class I genes (HLA-B and -C). With the complete sequencing of the class I region of the MHC in different haplotypes, we can now show that the beta block GMT profiles reflect amplification of the duplicated PERB11 segments and not the duplicated segments containing HLA-B and -C, and yet provide a signature that characterizes the entire block rather than individual loci.
Collapse
Affiliation(s)
- S Gaudieri
- Centre for Molecular Immunology and Instrumentation, University of Western Australia, Western, Nedlands, Australia
| | | | | | | |
Collapse
|
22
|
Gaudieri S, Dawkins RL, Habara K, Kulski JK, Gojobori T. SNP profile within the human major histocompatibility complex reveals an extreme and interrupted level of nucleotide diversity. Genome Res 2000; 10:1579-86. [PMID: 11042155 PMCID: PMC310975 DOI: 10.1101/gr.127200] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The human major histocompatibility complex (MHC) is characterized by polymorphic multicopy gene families, such as HLA and MIC (PERB11); duplications; insertions and deletions (indels); and uneven rates of recombination. Polymorphisms at the antigen recognition sites of the HLA class I and II genes and at associated neutral sites have been attributed to balancing selection and a hitchhiking effect, respectively. We, and others, have previously shown that nucleotide diversity between MHC haplotypes at non-HLA sites is unusually high (>10%) and up to several times greater than elsewhere in the genome (0.08%-0.2%). We report here the most extensive analysis of nucleotide diversity within a continuous sequence in the genome. We constructed a single nucleotide polymorphism (SNP) profile that reveals a pattern of extreme but interrupted levels of nucleotide diversity by comparing a continuous sequence within haplotypes in three genomic subregions of the MHC. A comparison of several haplotypes within one of the genomic subregions containing the HLA-B and -C loci suggests that positive selection is operating over the whole subgenomic region, including HLA and non-HLA genes. [The sequence data for the multiple haplotype comparisons within the class I region have been submitted to DDBJ/EMBL/GenBank under accession nos. AF029061, AF029062, and AB031005-AB031010. Additional sequence data have been submitted to the DDBJ data library under accession nos. AB031005-AB03101 and AF029061-AF029062.]
Collapse
Affiliation(s)
- S Gaudieri
- Center for Information Biology, National Institute of Genetics, Mishima, Shizuoka-ken 411-8540, Japan
| | | | | | | | | |
Collapse
|