1
|
Kulski JK, Suzuki S, Shiina T. SNP-Density Crossover Maps of Polymorphic Transposable Elements and HLA Genes Within MHC Class I Haplotype Blocks and Junction. Front Genet 2021; 11:594318. [PMID: 33537058 PMCID: PMC7848197 DOI: 10.3389/fgene.2020.594318] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 11/24/2020] [Indexed: 12/12/2022] Open
Abstract
The genomic region (~4 Mb) of the human major histocompatibility complex (MHC) on chromosome 6p21 is a prime model for the study and understanding of conserved polymorphic sequences (CPSs) and structural diversity of ancestral haplotypes (AHs)/conserved extended haplotypes (CEHs). The aim of this study was to use a set of 95 MHC genomic sequences downloaded from a publicly available BioProject database at NCBI to identify and characterise polymorphic human leukocyte antigen (HLA) class I genes and pseudogenes, MICA and MICB, and retroelement indels as haplotypic lineage markers, and single-nucleotide polymorphism (SNP) crossover loci in DNA sequence alignments of different haplotypes across the Olfactory Receptor (OR) gene region (~1.2 Mb) and the MHC class I region (~1.8 Mb) from the GPX5 to the MICB gene. Our comparative sequence analyses confirmed the identity of 12 haplotypic retroelement markers and revealed that they partitioned the HLA-A/B/C haplotypes into distinct evolutionary lineages. Crossovers between SNP-poor and SNP-rich regions defined the sequence range of haplotype blocks, and many of these crossover junctions occurred within particular transposable elements, lncRNA, OR12D2, MUC21, MUC22, PSORS1A3, HLA-C, HLA-B, and MICA. In a comparison of more than 250 paired sequence alignments, at least 38 SNP-density crossover sites were mapped across various regions from GPX5 to MICB. In a homology comparison of 16 different haplotypes, seven CEH/AH (7.1, 8.1, 18.2, 51.x, 57.1, 62.x, and 62.1) had no detectable SNP-density crossover junctions and were SNP poor across the entire ~2.8 Mb of sequence alignments. Of the analyses between different recombinant haplotypes, more than half of them had SNP crossovers within 10 kb of LTR16B/ERV3-16A3_I, MLT1, Charlie, and/or THE1 sequences and were in close vicinity to structurally polymorphic Alu and SVA insertion sites. These studies demonstrate that (1) SNP-density crossovers are associated with putative ancestral recombination sites that are widely spread across the MHC class I genomic region from at least the telomeric OR12D2 gene to the centromeric MICB gene and (2) the genomic sequences of MHC homozygous cell lines are useful for analysing haplotype blocks, ancestral haplotypic landscapes and markers, CPSs, and SNP-density crossover junctions.
Collapse
Affiliation(s)
- Jerzy K. Kulski
- Faculty of Health and Medical Sciences, Medical School, The University of Western Australia, Crawley, WA, Australia
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| | - Shingo Suzuki
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| | - Takashi Shiina
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| |
Collapse
|
2
|
Kulski JK. Long Noncoding RNA HCP5, a Hybrid HLA Class I Endogenous Retroviral Gene: Structure, Expression, and Disease Associations. Cells 2019; 8:cells8050480. [PMID: 31137555 PMCID: PMC6562477 DOI: 10.3390/cells8050480] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2019] [Revised: 05/16/2019] [Accepted: 05/17/2019] [Indexed: 02/06/2023] Open
Abstract
The HCP5 RNA gene (NCBI ID: 10866) is located centromeric of the HLA-B gene and between the MICA and MICB genes within the major histocompatibility complex (MHC) class I region. It is a human species-specific gene that codes for a long noncoding RNA (lncRNA), composed mostly of an ancient ancestral endogenous antisense 3′ long terminal repeat (LTR, and part of the internal pol antisense sequence of endogenous retrovirus (ERV) type 16 linked to a human leukocyte antigen (HLA) class I promoter and leader sequence at the 5′-end. Since its discovery in 1993, many disease association and gene expression studies have shown that HCP5 is a regulatory lncRNA involved in adaptive and innate immune responses and associated with the promotion of some autoimmune diseases and cancers. The gene sequence acts as a genomic anchor point for binding transcription factors, enhancers, and chromatin remodeling enzymes in the regulation of transcription and chromatin folding. The HCP5 antisense retroviral transcript also interacts with regulatory microRNA and immune and cellular checkpoints in cancers suggesting its potential as a drug target for novel antitumor therapeutics.
Collapse
Affiliation(s)
- Jerzy K Kulski
- Faculty of Health and Medical Sciences, UWA Medical School, The University of Western Australia, Crawley, WA 6009, Australia.
- Department of Molecular Life Science, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara 259-1193, Japan.
| |
Collapse
|
3
|
Wroblewski EE, Parham P, Guethlein LA. Two to Tango: Co-evolution of Hominid Natural Killer Cell Receptors and MHC. Front Immunol 2019; 10:177. [PMID: 30837985 PMCID: PMC6389700 DOI: 10.3389/fimmu.2019.00177] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Accepted: 01/21/2019] [Indexed: 12/16/2022] Open
Abstract
Natural killer (NK) cells have diverse roles in hominid immunity and reproduction. Modulating these functions are the interactions between major histocompatibility complex (MHC) class I molecules that are ligands for two NK cell surface receptor types. Diverse killer cell immunoglobulin-like receptors (KIR) bind specific motifs encoded within the polymorphic MHC class I cell surface glycoproteins, while, in more conserved interactions, CD94:NKG2A receptors recognize MHC-E with bound peptides derived from MHC class I leader sequences. The hominid lineage presents a choreographed co-evolution of KIR with their MHC class I ligands. MHC-A, -B, and -C are present in all great apes with species-specific haplotypic variation in gene content. The Bw4 epitope recognized by lineage II KIR is restricted to MHC-B but also present on some gorilla and human MHC-A. Common to great apes, but rare in humans, are MHC-B possessing a C1 epitope recognized by lineage III KIR. MHC-C arose from duplication of MHC-B and is fixed in all great apes except orangutan, where it exists on approximately 50% of haplotypes and all allotypes are C1-bearing. Recent study showed that gorillas possess yet another intermediate MHC organization compared to humans. Like orangutans, but unlike the Pan-Homo species, duplication of MHC-B occurred. However, MHC-C is fixed, and the MHC-C C2 epitope (absent in orangutans) emerges. The evolution of MHC-C drove expansion of its cognate lineage III KIR. Recently, position −21 of the MHC-B leader sequence has been shown to be critical in determining NK cell educational outcome. In humans, methionine (−21M) results in CD94:NKG2A-focused education whereas threonine (−21T) produces KIR-focused education. This is another dynamic position among hominids. Orangutans have exclusively −21M, consistent with their intermediate stage in lineage III KIR-focused evolution. Gorillas have both −21M and −21T, like humans, but they are unequally encoded by their duplicated B genes. Chimpanzees have near-fixed −21T, indicative of KIR-focused NK education. Harmonious with this observation, chimpanzee KIR exhibit strong binding and, compared to humans, smaller differences between binding levels of activating and inhibitory KIR. Consistent between these MHC-NK cell receptor systems over the course of hominid evolution is the evolution of polymorphism favoring the more novel and dynamic KIR system.
Collapse
Affiliation(s)
- Emily E Wroblewski
- Department of Anthropology, Washington University, St. Louis, MO, United States
| | - Peter Parham
- Departments of Structural Biology and Microbiology & Immunology, Stanford University School of Medicine, Stanford, CA, United States
| | - Lisbeth A Guethlein
- Departments of Structural Biology and Microbiology & Immunology, Stanford University School of Medicine, Stanford, CA, United States
| |
Collapse
|
4
|
Abstract
In comparison to humans and chimpanzees, gorillas show low diversity at MHC class I genes (Gogo), as reflected by an overall reduced level of allelic variation as well as the absence of a functionally important sequence motif that interacts with killer cell immunoglobulin-like receptors (KIR). Here, we use recently generated large-scale genomic sequence data for a reassessment of allelic diversity at Gogo-C, the gorilla orthologue of HLA-C. Through the combination of long-range amplifications and long-read sequencing technology, we obtained, among the 35 gorillas reanalyzed, three novel full-length genomic sequences including a coding region sequence that has not been previously described. The newly identified Gogo-C*03:01 allele has a divergent recombinant structure that sets it apart from other Gogo-C alleles. Domain-by-domain phylogenetic analysis shows that Gogo-C*03:01 has segments in common with Gogo-B*07, the additional B-like gene that is present on some gorilla MHC haplotypes. Identified in ~ 50% of the gorillas analyzed, the Gogo-C*03:01 allele exclusively encodes the C1 epitope among Gogo-C allotypes, indicating its important function in controlling natural killer cell (NK cell) responses via KIR. We further explored the hypothesis whether gorillas experienced a selective sweep which may have resulted in a general reduction of the gorilla MHC class I repertoire. Our results provide little support for a selective sweep but rather suggest that the overall low Gogo class I diversity can be best explained by drastic demographic changes gorillas experienced in the ancient and recent past.
Collapse
|
5
|
Katsura Y, Stanley CE, Kumar S, Nei M. The Reliability and Stability of an Inferred Phylogenetic Tree from Empirical Data. Mol Biol Evol 2017; 34:718-723. [PMID: 28100791 PMCID: PMC5850655 DOI: 10.1093/molbev/msw272] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
The reliability of a phylogenetic tree obtained from empirical data is usually measured by the bootstrap probability (Pb) of interior branches of the tree. If the bootstrap probability is high for most branches, the tree is considered to be reliable. If some interior branches show relatively low bootstrap probabilities, we are not sure that the inferred tree is really reliable. Here, we propose another quantity measuring the reliability of the tree called the stability of a subtree. This quantity refers to the probability of obtaining a subtree (Ps) of an inferred tree obtained. We then show that if the tree is to be reliable, both Pb and Ps must be high. We also show that Ps is given by a bootstrap probability of the subtree with the closest outgroup sequence, and computer program RESTA for computing the Pb and Ps values will be presented.
Collapse
Affiliation(s)
- Yukako Katsura
- Department of Biology and Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA.,Department of Biology and Institute of Molecular Evolutionary Genetics, Pennsylvania State University, State College, PA
| | - Craig E Stanley
- Department of Biology and Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA
| | - Sudhir Kumar
- Department of Biology and Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA
| | - Masatoshi Nei
- Department of Biology and Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA.,Department of Biology and Institute of Molecular Evolutionary Genetics, Pennsylvania State University, State College, PA
| |
Collapse
|
6
|
Signatures of Crested Ibis MHC Revealed by Recombination Screening and Short-Reads Assembly Strategy. PLoS One 2016; 11:e0168744. [PMID: 27997612 PMCID: PMC5173252 DOI: 10.1371/journal.pone.0168744] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2016] [Accepted: 12/06/2016] [Indexed: 02/04/2023] Open
Abstract
Whole-genome shotgun (WGS) sequencing has become a routine method in genome research over the past decade. However, the assembly of highly polymorphic regions in WGS projects remains a challenge, especially for large genomes. Employing BAC library constructing, PCR screening and Sanger sequencing, traditional strategy is laborious and expensive, which hampers research on polymorphic genomic regions. As one of the most highly polymorphic regions, the major histocompatibility complex (MHC) plays a central role in the adaptive immunity of all jawed vertebrates. In this study, we introduced an efficient procedure based on recombination screening and short-reads assembly. With this procedure, we constructed a high quality 488-kb region of crested ibis MHC that consists of 3 superscaffolds and contains 50 genes. Our sequence showed comparable quality (97.29% identity) to traditional Sanger assembly, while the workload was reduced almost 7 times. Comparative study revealed distinctive features of crested ibis by exhibiting the COL11A2-BLA-BLB-BRD2 cluster and presenting both ADPRH and odorant receptor (OR) gene in the MHC region. Furthermore, the conservation of the BF-TAP1-TAP2 structure in crested ibis and other vertebrate lineages is interesting in light of the hypothesis that coevolution of functionally related genes in the primordial MHC is responsible for the appearance of the antigen presentation pathways at the birth of the adaptive immune system.
Collapse
|
7
|
Su Z, Zhang J, Kumar C, Molony C, Lu H, Chen R, Stone DJ, Ling F, Liu X. Species specific exome probes reveal new insights in positively selected genes in nonhuman primates. Sci Rep 2016; 6:33876. [PMID: 27659771 PMCID: PMC5034232 DOI: 10.1038/srep33876] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2016] [Accepted: 09/05/2016] [Indexed: 01/20/2023] Open
Abstract
Nonhuman primates (NHP) are important biomedical animal models for the study of human disease. Of these, the most widely used models in biomedical research currently are from the genus Macaca. However, evolutionary genetic divergence between human and NHP species makes human-based probes inefficient for the capture of genomic regions of NHP for sequencing and study. Here we introduce a new method to resequence the exome of NHP species by a designed capture approach specifically targeted to the NHP, and demonstrate its superior performance on four NHP species or subspecies. Detailed investigation on biomedically relevant genes demonstrated superior capture by the new approach. We identified 28 genes that appeared to be pseudogenized and inactivated in macaque. Finally, we identified 187 genes showing strong evidence for positive selection across all branches of the primate phylogeny including many novel findings.
Collapse
Affiliation(s)
- Zheng Su
- BGI-Shenzhen, Shenzhen, Guangdong 518083, China
| | - Junjie Zhang
- Shool of bioscience &bioengineering, South China University of Technology, Guangzhou, Guangdong 510006, China
| | - Chanchal Kumar
- Translational Medicine Research Centre, Merck Research Laboratories, MSD, 8 Biomedical Grove, Neuros #04-01, Singapore 138665, Singapore
| | - Cliona Molony
- Merck Research Laboratories, Merck &Co. Inc., 33 Avenue Louis Pasteur, Boston, MA 02115, USA
| | - Hongchao Lu
- Informatics IT, MSD R&D (China) Co., Ltd., Beijing, China
| | - Ronghua Chen
- Informatics IT, MSD R&D (China) Co., Ltd., Beijing, China.,Informatics IT, Merck &Co., Inc., Boston, MA, USA
| | - David J Stone
- Merck Research Laboratories, Merck &Co. Inc., 770 Sumneytown Pike, WP53B-120 West Point, PA 19486, USA
| | - Fei Ling
- Shool of bioscience &bioengineering, South China University of Technology, Guangzhou, Guangdong 510006, China
| | - Xiao Liu
- BGI-Shenzhen, Shenzhen, Guangdong 518083, China.,Department of Biology, University of Copenhagen, Copenhagen 2200, Denmark
| |
Collapse
|
8
|
Abstract
NKG2D ligands (NKG2DLs) are a group of stress-inducible major histocompatibility complex (MHC) class I-like molecules that act as a danger signal alerting the immune system to the presence of abnormal cells. In mammals, two families of NKG2DL genes have been identified: the MIC gene family encoded in the MHC region and the ULBP gene family encoded outside the MHC region in most species. Some mammals have a third family of NKG2DL-like class I genes which we named MILL (MHC class I-like located near the leukocyte receptor complex). Despite the fact that MILL genes are more closely related to MIC genes than ULBP genes are to MIC genes, MILL molecules do not function as NKG2DLs, and their function remains unknown. With the progress of mammalian genome projects, information on the MIC, ULBP, and MILL gene families became available in many mammalian species. Here, we summarize such information and discuss the origin and evolution of the NKG2DL gene family from the viewpoint of host-pathogen coevolution.
Collapse
Affiliation(s)
- Masanori Kasahara
- Department of Pathology, Hokkaido University Graduate School of Medicine, Sapporo, Japan
| | - Yoichi Sutoh
- Emory Vaccine Center and Department of Pathology and Laboratory Medicine, Emory University, Atlanta, GA, USA
| |
Collapse
|
9
|
Jaratlerdsiri W, Deakin J, Godinez RM, Shan X, Peterson DG, Marthey S, Lyons E, McCarthy FM, Isberg SR, Higgins DP, Chong AY, John JS, Glenn TC, Ray DA, Gongora J. Comparative genome analyses reveal distinct structure in the saltwater crocodile MHC. PLoS One 2014; 9:e114631. [PMID: 25503521 PMCID: PMC4263668 DOI: 10.1371/journal.pone.0114631] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 11/11/2014] [Indexed: 12/22/2022] Open
Abstract
The major histocompatibility complex (MHC) is a dynamic genome region with an essential role in the adaptive immunity of vertebrates, especially antigen presentation. The MHC is generally divided into subregions (classes I, II and III) containing genes of similar function across species, but with different gene number and organisation. Crocodylia (crocodilians) are widely distributed and represent an evolutionary distinct group among higher vertebrates, but the genomic organisation of MHC within this lineage has been largely unexplored. Here, we studied the MHC region of the saltwater crocodile (Crocodylus porosus) and compared it with that of other taxa. We characterised genomic clusters encompassing MHC class I and class II genes in the saltwater crocodile based on sequencing of bacterial artificial chromosomes. Six gene clusters spanning ∼452 kb were identified to contain nine MHC class I genes, six MHC class II genes, three TAP genes, and a TRIM gene. These MHC class I and class II genes were in separate scaffold regions and were greater in length (2-6 times longer) than their counterparts in well-studied fowl B loci, suggesting that the compaction of avian MHC occurred after the crocodilian-avian split. Comparative analyses between the saltwater crocodile MHC and that from the alligator and gharial showed large syntenic areas (>80% identity) with similar gene order. Comparisons with other vertebrates showed that the saltwater crocodile had MHC class I genes located along with TAP, consistent with birds studied. Linkage between MHC class I and TRIM39 observed in the saltwater crocodile resembled MHC in eutherians compared, but absent in avian MHC, suggesting that the saltwater crocodile MHC appears to have gene organisation intermediate between these two lineages. These observations suggest that the structure of the saltwater crocodile MHC, and other crocodilians, can help determine the MHC that was present in the ancestors of archosaurs.
Collapse
Affiliation(s)
- Weerachai Jaratlerdsiri
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales 2006, Australia
| | - Janine Deakin
- Evolution Ecology and Genetics, Research School of Biology, Australian National University, Canberra, Australian Capital Territory 2601, Australia
- Institute for Applied Ecology, University of Canberra, Canberra, Australian Capital Territory 2601, Australia
| | - Ricardo M. Godinez
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, United States of America
- Department of Genetics, Harvard Medical School, 77 Louis Pasteur Ave., Boston, Massachusetts 02115, United States of America
| | - Xueyan Shan
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, Mississippi 39762, United States of America
| | - Daniel G. Peterson
- Institute for Genomics, Biocomputing and Biotechnology (IGBB), Mississippi State University, Mississippi State, Mississippi 39762, United States of America
| | - Sylvain Marthey
- Animal Genetics and Integrative Biology, INRA, UMR 1313 Jouy-en-Josas 78352, France
| | - Eric Lyons
- School of Plant Science, University of Arizona, Tucson, Arizona 85721, United States of America
| | - Fiona M. McCarthy
- School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, Arizona 85721, United States of America
| | - Sally R. Isberg
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales 2006, Australia
- Center for Crocodile Research, P.O. Box 329, Noonamah, Northern Territory 0837, Australia
| | - Damien P. Higgins
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales 2006, Australia
| | - Amanda Y. Chong
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales 2006, Australia
| | - John St John
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California 95064, United States of America
| | - Travis C. Glenn
- Department of Environmental Health Science, University of Georgia, Athens, Georgia 30602, United States of America
| | - David A. Ray
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, Mississippi 39762, United States of America
- Institute for Genomics, Biocomputing and Biotechnology (IGBB), Mississippi State University, Mississippi State, Mississippi 39762, United States of America
| | - Jaime Gongora
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales 2006, Australia
| |
Collapse
|
10
|
|
11
|
Harrow JL, Steward CA, Frankish A, Gilbert JG, Gonzalez JM, Loveland JE, Mudge J, Sheppard D, Thomas M, Trevanion S, Wilming LG. The Vertebrate Genome Annotation browser 10 years on. Nucleic Acids Res 2013; 42:D771-9. [PMID: 24316575 PMCID: PMC3964964 DOI: 10.1093/nar/gkt1241] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The Vertebrate Genome Annotation (VEGA) database (http://vega.sanger.ac.uk), initially designed as a community resource for browsing manual annotation of the human genome project, now contains five reference genomes (human, mouse, zebrafish, pig and rat). Its introduction pages have been redesigned to enable the user to easily navigate between whole genomes and smaller multi-species haplotypic regions of interest such as the major histocompatibility complex. The VEGA browser is unique in that annotation is updated via the Human And Vertebrate Analysis aNd Annotation (HAVANA) update track every 2 weeks, allowing single gene updates to be made publicly available to the research community quickly. The user can now access different haplotypic subregions more easily, such as those from the non-obese diabetic mouse, and display them in a more intuitive way using the comparative tools. We also highlight how the user can browse manually annotated updated patches from the Genome Reference Consortium (GRC).
Collapse
Affiliation(s)
- Jennifer L Harrow
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1HH, UK
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Both positive and negative selection pressures contribute to the polymorphism pattern of the duplicated human CYP21A2 gene. PLoS One 2013; 8:e81977. [PMID: 24312389 PMCID: PMC3843699 DOI: 10.1371/journal.pone.0081977] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2013] [Accepted: 10/20/2013] [Indexed: 11/19/2022] Open
Abstract
The human steroid 21-hydroxylase gene (CYP21A2) participates in cortisol and aldosterone biosynthesis, and resides together with its paralogous (duplicated) pseudogene in a multiallelic copy number variation (CNV), called RCCX CNV. Concerted evolution caused by non-allelic gene conversion has been described in great ape CYP21 genes, and the same conversion activity is responsible for a serious genetic disorder of CYP21A2, congenital adrenal hyperplasia (CAH). In the current study, 33 CYP21A2 haplotype variants encoding 6 protein variants were determined from a European population. CYP21A2 was shown to be one of the most diverse human genes (HHe=0.949), but the diversity of intron 2 was greater still. Contrary to previous findings, the evolution of intron 2 did not follow concerted evolution, although the remaining part of the gene did. Fixed sites (different fixed alleles of sites in human CYP21 paralogues) significantly accumulated in intron 2, indicating that the excess of fixed sites was connected to the lack of effective non-allelic conversion and concerted evolution. Furthermore, positive selection was presumably focused on intron 2, and possibly associated with the previous genetic features. However, the positive selection detected by several neutrality tests was discerned along the whole gene. In addition, the clear signature of negative selection was observed in the coding sequence. The maintenance of the CYP21 enzyme function is critical, and could lead to negative selection, whereas the presumed gene regulation altering steroid hormone levels via intron 2 might help fast adaptation, which broadly characterizes the genes of human CNVs responding to the environment.
Collapse
|