1
|
Sirupurapu V, Safonova Y, Pevzner P. Gene prediction in the immunoglobulin loci. Genome Res 2022; 32:1152-1169. [PMID: 35545447 DOI: 10.1101/gr.276676.122] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 05/06/2022] [Indexed: 11/25/2022]
Abstract
The V(D)J recombination process rearranges the variable (V), diversity (D), and joining (J) genes in the immunoglobulin loci to generate antibody repertoires. Annotation of these loci across various species and predicting the V, D, and J genes (IG genes) is critical for studies of the adaptive immune system. However, since the standard gene finding algorithms are not suitable for predicting IG genes, they have been semi-manually annotated in very few species. We developed the IGDetective algorithm for predicting IG genes and applied it to species with the assembled IG loci. IGDetective generated the first large collection of IG genes across many species and enabled their evolutionary analysis, including the analysis of the "bat IG diversity" hypothesis. This analysis revealed extremely conserved V genes in evolutionary distant species indicating that these genes may be subjected to the same selective pressure, e.g., pressure driven by common pathogens. IGDetective also revealed extremely diverged V genes and a new family of evolutionary conserved V genes in bats with unusual noncanonical cysteines. Moreover, in difference from all other previously reported antibodies, these cysteines are located within complementarity-determining regions. Since cysteines form disulfide bonds, we hypothesize that these cysteine-rich V genes might generate antibodies with noncanonical conformations and could potentially form a unique part of the immune repertoire in bats. We also analyzed the diversity landscape of the recombination signal sequences and revealed their features that trigger the high/low usage of the IG genes.
Collapse
|
2
|
Di Y, Cai S, Zheng S, Huang J, Du L, Song Y, Zhang M, Wang Z, Yu G, Ren L, Han H, Zhao Y. Reshaping the murine immunoglobulin heavy chain repertoire with bovine DH genes. Immunology 2021; 165:74-87. [PMID: 34428313 DOI: 10.1111/imm.13407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 08/04/2021] [Accepted: 08/19/2021] [Indexed: 11/30/2022] Open
Abstract
Having a limited number of VH segments, cattle rely on uniquely long DH gene segments to generate CDRH3 length variation (3-70 aa) far greater than that in humans or mice. Bovine antibodies with ultralong CDRH3s (>50 aa) possess unusual structures and abilities to bind to special antigens. In this study, we replaced most murine endogenous DH segments with bovine DH genes, generating a mouse line termed B-DH. The use of bovine DH genes significantly increased the length variation of CDRH3 and consequently the Ig heavy chain repertoire in B-DH mice. However, no ultralong CDRH3 was observed in B-DH mice, suggesting that other factors, in addition to long DH genes, are also involved in the formation of ultralong CDRH3. The B-DH mice mounted a normal humoral immune response to various antigens, although the B-cell developmental paradigm was obviously altered compared with wild-type mice. Additionally, B-DH mice are not predisposed to the generation of autoantibodies despite the interspecies DH gene replacement. The B-DH mice reported in this study provide a unique model to answer basic questions regarding the synergistic evolution of DH and VH genes, VDJ recombination and BCR selection in B-cell development.
Collapse
Affiliation(s)
- Yu Di
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Shuyi Cai
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Shunan Zheng
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Jinwei Huang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Lijuan Du
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Yu Song
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Ming Zhang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Zhao Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Guotao Yu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Liming Ren
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Haitang Han
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| | - Yaofeng Zhao
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| |
Collapse
|
3
|
Luo S, Yu JA, Li H, Song YS. Worldwide genetic variation of the IGHV and TRBV immune receptor gene families in humans. Life Sci Alliance 2019; 2:2/2/e201800221. [PMID: 30808649 PMCID: PMC6391684 DOI: 10.26508/lsa.201800221] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 02/14/2019] [Accepted: 02/14/2019] [Indexed: 12/31/2022] Open
Abstract
This article presents a comprehensive study of the IGHV and TRBV gene families in a globally diverse sample of humans and shows that the two gene families exhibit starkly different patterns of variation. The immunoglobulin heavy variable (IGHV) and T cell beta variable (TRBV) loci are among the most complex and variable regions in the human genome. Generated through a process of gene duplication/deletion and diversification, these loci can vary extensively between individuals in copy number and contain genes that are highly similar, making their analysis technically challenging. Here, we present a comprehensive study of the functional gene segments in the IGHV and TRBV loci, quantifying their copy number and single-nucleotide variation in a globally diverse sample of 109 (IGHV) and 286 (TRBV) humans from over a 100 populations. We find that the IGHV and TRBV gene families exhibit starkly different patterns of variation. In addition to providing insight into the different evolutionary paths of the IGHV and TRBV loci, our results are also important to the adaptive immune repertoire sequencing community, where the lack of frequencies of common alleles and copy number variants is hampering existing analytical pipelines.
Collapse
Affiliation(s)
- Shishi Luo
- Computer Science Division, University of California, Berkeley, Berkeley, CA, USA.,Department of Statistics, University of California, Berkeley, Berkeley, CA, USA
| | - Jane A Yu
- Computer Science Division, University of California, Berkeley, Berkeley, CA, USA
| | - Heng Li
- Department of Biostatistics, Harvard Medical School, Boston, MA, USA
| | - Yun S Song
- Computer Science Division, University of California, Berkeley, Berkeley, CA, USA .,Department of Statistics, University of California, Berkeley, Berkeley, CA, USA.,Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
4
|
Watson CT, Glanville J, Marasco WA. The Individual and Population Genetics of Antibody Immunity. Trends Immunol 2017; 38:459-470. [PMID: 28539189 PMCID: PMC5656258 DOI: 10.1016/j.it.2017.04.003] [Citation(s) in RCA: 80] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Revised: 04/06/2017] [Accepted: 04/10/2017] [Indexed: 12/12/2022]
Abstract
Antibodies (Abs) produced by immunoglobulin (IG) genes are the most diverse proteins expressed in humans. While part of this diversity is generated by recombination during B-cell development and mutations during affinity maturation, the germ-line IG loci are also diverse across human populations and ethnicities. Recently, proof-of-concept studies have demonstrated genotype–phenotype correlations between specific IG germ-line variants and the quality of Ab responses during vaccination and disease. However, the functional consequences of IG genetic variation in Ab function and immunological outcomes remain underexplored. In this opinion article, we outline interconnections between IG genomic diversity and Ab-expressed repertoires and structure. We further propose a strategy for integrating IG genotyping with functional Ab profiling data as a means to better predict and optimize humoral responses in genetically diverse human populations, with immediate implications for personalized medicine. Genetic variation in human populations affects how individuals are able to mount functional antibody responses. Different alleles can encode convergent binding motifs that result in successful Ab responses against specific infections and vaccinations. Given the complexity of the IG loci and the diversity of the antibody repertoire, links between IG polymorphism and antibody repertoire variability have not been thoroughly explored. We present a strategy to mine genotype–repertoire–disease associations.
Collapse
Affiliation(s)
- Corey T Watson
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA.
| | - Jacob Glanville
- Institute for Immunity, Transplantation and Infection, and Computational and Systems Immunology, Stanford University School of Medicine, Stanford, CA, USA.
| | - Wayne A Marasco
- Department of Cancer Immunology and Virology, Dana-Farber Cancer Institute, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
5
|
Watson CT, Matsen FA, Jackson KJL, Bashir A, Smith ML, Glanville J, Breden F, Kleinstein SH, Collins AM, Busse CE. Comment on “A Database of Human Immune Receptor Alleles Recovered from Population Sequencing Data”. THE JOURNAL OF IMMUNOLOGY 2017; 198:3371-3373. [DOI: 10.4049/jimmunol.1700306] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
6
|
Luo S, Yu JA, Song YS. Estimating Copy Number and Allelic Variation at the Immunoglobulin Heavy Chain Locus Using Short Reads. PLoS Comput Biol 2016; 12:e1005117. [PMID: 27632220 PMCID: PMC5025152 DOI: 10.1371/journal.pcbi.1005117] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2016] [Accepted: 08/23/2016] [Indexed: 11/28/2022] Open
Abstract
The study of genomic regions that contain gene copies and structural variation is a major challenge in modern genomics. Unlike variation involving single nucleotide changes, data on the variation of copy number is difficult to collect and few tools exist for analyzing the variation between individuals. The immunoglobulin heavy variable (IGHV) locus, which plays an integral role in the adaptive immune response, is an example of a complex genomic region that varies in gene copy number. Lack of standard methods to genotype this region prevents it from being included in association studies and is holding back the growing field of antibody repertoire analysis. Here we develop a method that takes short reads from high-throughput sequencing and outputs a genetic profile of the IGHV locus with the read coverage depth and a putative nucleotide sequence for each operationally defined gene cluster. Our operationally defined gene clusters aim to address a major challenge in studying the IGHV locus: the high sequence similarity between gene segments in different genomic locations. Tests on simulated data demonstrate that our approach can accurately determine the presence or absence of a gene cluster from reads as short as 70 bp. More detailed resolution on the copy number of gene clusters can be obtained from read coverage depth using longer reads (e.g., ≥ 100 bp). Detail at the nucleotide resolution of single copy genes (genes present in one copy per haplotype) can be determined with 250 bp reads. For IGHV genes with more than one copy, accurate nucleotide-resolution reconstruction is currently beyond the means of our approach. When applied to a family of European ancestry, our pipeline outputs genotypes that are consistent with the family pedigree, confirms existing multigene variants and suggests new copy number variants. This study paves the way for analyzing population-level patterns of variation in IGHV gene clusters in larger diverse datasets and for quantitatively handling regions of copy number variation in other structurally varying and complex loci.
Collapse
Affiliation(s)
- Shishi Luo
- Computer Science Division, University of California, Berkeley, Berkeley, California, United States of America
- Department of Statistics, University of California, Berkeley, Berkeley, California, United States of America
| | - Jane A. Yu
- Computer Science Division, University of California, Berkeley, Berkeley, California, United States of America
| | - Yun S. Song
- Computer Science Division, University of California, Berkeley, Berkeley, California, United States of America
- Department of Statistics, University of California, Berkeley, Berkeley, California, United States of America
- Departments of Mathematics and Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| |
Collapse
|
7
|
Steele EJ, Lloyd SS. Soma-to-germline feedback is implied by the extreme polymorphism at IGHV relative to MHC: The manifest polymorphism of the MHC appears greatly exceeded at Immunoglobulin loci, suggesting antigen-selected somatic V mutants penetrate Weismann's Barrier. Bioessays 2015; 37:557-69. [PMID: 25810320 DOI: 10.1002/bies.201400213] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2014] [Revised: 02/15/2015] [Accepted: 02/24/2015] [Indexed: 01/22/2023]
Abstract
Soma-to-germline feedback is forbidden under the neo-Darwinian paradigm. Nevertheless, there is a growing realization it occurs frequently in immunoglobulin (Ig) variable (V) region genes. This is a surprising development. It arises from a most unlikely source in light of the exposure of co-author EJS to the haplotype data of RL Dawkins and others on the polymorphism of the Major Histocompatibility Complex, which is generally assumed to be the most polymorphic region in the genome (spanning ∼4 Mb). The comparison between the magnitude of MHC polymorphism with estimates for the human heavy chain immunoglobulin V locus (spanning ∼1 Mb), suggests IGHV could be many orders of magnitude more polymorphic than the MHC. This conclusion needs airing in the literature as it implies generational churn and soma-to-germline gene feedback. Pedigree-based experimental strategies to resolve the IGHV issue are outlined.
Collapse
Affiliation(s)
- Edward J Steele
- C.Y. O'Connor ERADE Village Foundation, Piara Waters, WA, Australia
| | | |
Collapse
|
8
|
Paciello G, Acquaviva A, Pighi C, Ferrarini A, Macii E, Zamo’ A, Ficarra E. VDJSeq-Solver: in silico V(D)J recombination detection tool. PLoS One 2015; 10:e0118192. [PMID: 25799103 PMCID: PMC4370828 DOI: 10.1371/journal.pone.0118192] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2014] [Accepted: 01/05/2015] [Indexed: 11/29/2022] Open
Abstract
In this paper we present VDJSeq-Solver, a methodology and tool to identify clonal lymphocyte populations from paired-end RNA Sequencing reads derived from the sequencing of mRNA neoplastic cells. The tool detects the main clone that characterises the tissue of interest by recognizing the most abundant V(D)J rearrangement among the existing ones in the sample under study. The exact sequence of the clone identified is capable of accounting for the modifications introduced by the enzymatic processes. The proposed tool overcomes limitations of currently available lymphocyte rearrangements recognition methods, working on a single sequence at a time, that are not applicable to high-throughput sequencing data. In this work, VDJSeq-Solver has been applied to correctly detect the main clone and identify its sequence on five Mantle Cell Lymphoma samples; then the tool has been tested on twelve Diffuse Large B-Cell Lymphoma samples. In order to comply with the privacy, ethics and intellectual property policies of the University Hospital and the University of Verona, data is available upon request to supporto.utenti@ateneo.univr.it after signing a mandatory Materials Transfer Agreement. VDJSeq-Solver JAVA/Perl/Bash software implementation is free and available at http://eda.polito.it/VDJSeq-Solver/.
Collapse
Affiliation(s)
- Giulia Paciello
- Department of Control and Computer Engineering, Politecnico di Torino, Torino, Italy
- * E-mail:
| | - Andrea Acquaviva
- Department of Control and Computer Engineering, Politecnico di Torino, Torino, Italy
| | - Chiara Pighi
- Department of Pathology and Diagnostics, University of Verona, Verona, Italy
- Department of Pathology, Children Hospital Boston, Harvard Medical School, Boston, USA
| | | | - Enrico Macii
- Department of Control and Computer Engineering, Politecnico di Torino, Torino, Italy
| | - Alberto Zamo’
- Department of Pathology and Diagnostics, University of Verona, Verona, Italy
| | - Elisa Ficarra
- Department of Control and Computer Engineering, Politecnico di Torino, Torino, Italy
| |
Collapse
|
9
|
Abstract
Pseudogenes are very common in the genomes of a wide range of organisms and, although they were originally considered as genetic junk, now several functions have been attributed to them. One important function of pseudogenes, as discussed in this chapter, is to provide material for genetic diversity. This is most prominent in the case of immunological recognition molecules such as immunoglobulins and B- and T-cell receptors, as well as in the case of antigenic variation in intracellular pathogens. Other examples discussed are olfactory receptors, ribosomal proteins, cytochrome P450s, and pseudokinases.
Collapse
|
10
|
Niku M, Liljavirta J, Durkin K, Schroderus E, Iivanainen A. The bovine genomic DNA sequence data reveal three IGHV subgroups, only one of which is functionally expressed. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2012; 37:457-61. [PMID: 22369780 DOI: 10.1016/j.dci.2012.02.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2011] [Revised: 02/13/2012] [Accepted: 02/16/2012] [Indexed: 05/05/2023]
Abstract
A comprehensive analysis of cattle shotgun sequencing data reveals 36 immunoglobulin heavy chain variable genes. The previously described bovine subgroup IGHV1 contains 10 functional genes with a conserved promoter including the consensus octamer and several other transcription factor binding sites, intact exons and matching cDNA sequences. Subgroups IGHV2 and IGHV3 consist entirely of pseudogenes. Thus, the bovine germline IGHV repertoire is very limited. The IGHV genes are distributed in mammalian clans I and II, while no clan III genes were detected. Clan-specific PCR of genomic DNA from cattle, sheep, Eurasian elk, white-tailed deer, pig and dolphin indicates highly dynamic evolution of IGHV gene usage within Cetartiodactyla. The bovine germline IGHV repertoire was probably generated by recent duplications of an IGHV1-IGHV2 homology unit. Immunoglobulin heavy chain genes are largely incorrectly assembled in the current cattle genome versions Btau_4.2 and UMD_3.1. FISH experiments confirm an IGHV locus close to terminus of BTA21.
Collapse
Affiliation(s)
- Mikael Niku
- Department of Veterinary Biosciences, University of Helsinki, PO Box 66, FI-00014 University of Helsinki, Finland
| | | | | | | | | |
Collapse
|
11
|
Differential reactivity of germ line allelic variants of a broadly neutralizing HIV-1 antibody to a gp41 fusion intermediate conformation. J Virol 2011; 85:11725-31. [PMID: 21917975 DOI: 10.1128/jvi.05680-11] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Genetic factors, as well as antigenic stimuli, can influence antibody repertoire formation. Moreover, the affinity of antigen for unmutated naïve B cell receptors determines the threshold for activation of germinal center antibody responses. The gp41 2F5 broadly neutralizing antibody (bNAb) uses the V(H)2-5 gene, which has 10 distinct alleles that use either a heavy-chain complementarity-determining region 2 (HCDR2) aspartic acid (D(H54)) or an HCDR2 asparagine (N(H54)) residue. The 2F5 HCDR2 D(H54) residue has been shown to form a salt bridge with gp41 (665)K; the V(H)2-5 germ line allele variant containing N(H54) cannot do so and thus should bind less avidly to gp41. Thus, the induction of 2F5 bNAb is dependent on both genetic and structural factors that could affect antigen affinity of unmutated naïve B cell receptors. Here, we studied allelic variants of the V(H)2-5 inferred germ line forms of the HIV-1 gp41 bNAb 2F5 for their antigen binding affinities to gp41 linear peptide and conformational protein antigens. Both V(H)2-5 2F5 inferred germ line variants bound to gp41 peptides and protein, including the fusion intermediate protein mimic, although more weakly than the mature 2F5 antibody. As predicted, the affinity of the N(H54) variant for fusion-intermediate conformation was an order of magnitude lower than that of the D(H54) V(H)2-5 germ line antibody, demonstrating that allelic variants of 2F5 germ line antibodies differentially bind to gp41. Thus, these data demonstrate a genetically determined trait that may affect host responses to HIV-1 envelope epitopes recognized by broadly neutralizing antibodies and has implications for unmutated ancestor-based immunogen design.
Collapse
|
12
|
Genomic organization, molecular diversification, and evolution of antimicrobial peptide myticin-C genes in the mussel (Mytilus galloprovincialis). PLoS One 2011; 6:e24041. [PMID: 21904604 PMCID: PMC3164099 DOI: 10.1371/journal.pone.0024041] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2011] [Accepted: 08/02/2011] [Indexed: 12/21/2022] Open
Abstract
Myticin-C is a highly variable antimicrobial peptide associated to immune response in Mediterranean mussel (Mytilus galloprovincialis). In this study, we tried to ascertain the genetic organization and the mechanisms underlying myticin-C variation and evolution of this gene family. We took advantage of the large intron size variation to find out the number of myticin-C genes. Using fragment analysis a maximum of four alleles was detected per individual at both introns in a large mussel sample suggesting a minimum of two myticin-C genes. The transmission pattern of size variants in two full-sib families was also used to ascertain the number of myticin-C genes underlying the variability observed. Results in both families were in accordance with two myticin-C genes organized in tandem. A more detailed analysis of myticin-C variation was carried out by sequencing a large sample of complementary (cDNA) and genomic DNA (gDNA) in 10 individuals. Two basic sequences were detected at most individuals and several sequences were constituted by combination of two different basic sequences, strongly suggesting somatic recombination or gene conversion. Slight within-basic sequence variation detected in all individuals was attributed to somatic mutation. Such mutations were more frequently at the C-terminal domain and mostly determined non-synonymous substitutions. The mature peptide domain showed the highest variation both in the whole cDNA and in the basic-sequence samples, which is in accordance with the pathogen recognition function associated to this domain. Although most tests suggested neutrality for myticin-C variation, evidence indicated positive selection in the mature peptide and C-terminal region. Three main highly supported clusters were observed when reconstructing phylogeny on basic sequences, meiotic recombination playing a relevant role on myticin-C evolution. This study demonstrates that mechanisms to generate molecular variation similar to that observed in vertebrates are also operating in molluscs.
Collapse
|
13
|
Jackson KJL, Wang Y, Gaeta BA, Pomat W, Siba P, Rimmer J, Sewell WA, Collins AM. Divergent human populations show extensive shared IGK rearrangements in peripheral blood B cells. Immunogenetics 2011; 64:3-14. [PMID: 21789596 DOI: 10.1007/s00251-011-0559-z] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2011] [Accepted: 07/12/2011] [Indexed: 11/28/2022]
Abstract
We have analysed the transcribed immunoglobulin kappa (IGK) repertoire of peripheral blood B cells from four individuals from two genetically distinct populations, Papua New Guinean and Australian, using high-throughput DNA sequencing. The depth of sequencing data for each individual averaged 5,548 high-quality IGK reads, and permitted genotyping of the inferred IGKV and IGKJ germline gene segments for each individual. All individuals were homozygous at each IGKJ locus and had highly similar inferred IGKV genotypes. Preferential gene usage was seen at both the IGKV and IGKJ loci, but only IGKV segment usage varied significantly between individuals. Despite the differences in IGKV gene utilisation, the rearranged IGK repertoires showed extensive identity at the amino acid level. Public rearrangements (those shared by two or more individuals) made up 60.2% of the total sequenced IGK rearrangements. The total diversity of IGK rearrangements of each individual was estimated to range from just 340 to 549 unique amino acid sequences. Thus, the repertoire of unique expressed IGK rearrangements is dramatically less than previous theoretical estimates of IGK diversity, and the majority of expressed IGK rearrangements are likely to be extensively shared in individual human beings.
Collapse
|
14
|
Pramanik S, Cui X, Wang HY, Chimge NO, Hu G, Shen L, Gao R, Li H. Segmental duplication as one of the driving forces underlying the diversity of the human immunoglobulin heavy chain variable gene region. BMC Genomics 2011; 12:78. [PMID: 21272357 PMCID: PMC3042411 DOI: 10.1186/1471-2164-12-78] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2010] [Accepted: 01/27/2011] [Indexed: 11/10/2022] Open
Abstract
Background Segmental duplication and deletion were implicated for a region containing the human immunoglobulin heavy chain variable (IGHV) gene segments, 1.9III/hv3005 (possible allelic variants of IGHV3-30) and hv3019b9 (a possible allelic variant of IGHV3-33). However, very little is known about the ranges of the duplication and the polymorphic region. This is mainly because of the difficulty associated with distinguishing between allelic and paralogous sequences in the IGHV region containing extensive repetitive sequences. Inability to separate the two parental haploid genomes in the subjects is another serious barrier. To address these issues, unique DNA sequence tags evenly distributed within and flanking the duplicated region implicated by the previous studies were selected. The selected tags in single sperm from six unrelated healthy donors were amplified by multiplex PCR followed by microarray detection. In this way, individual haplotypes of different parental origins in the sperm donors could be analyzed separately and precisely. The identified polymorphic region was further analyzed at the nucleotide sequence level using sequences from the three human genomic sequence assemblies in the database. Results A large polymorphic region was identified using the selected sequence tags. Four of the 12 haplotypes were shown to contain consecutively undetectable tags spanning in a variable range. Detailed analysis of sequences from the genomic sequence assemblies revealed two large duplicate sequence blocks of 24,696 bp and 24,387 bp, respectively, and an incomplete copy of 961 bp in this region. It contains up to 13 IGHV gene segments depending on haplotypes. A polymorphic region was found to be located within the duplicated blocks. The variants of this polymorphism unusually diverged at the nucleotide sequence level and in IGHV gene segment number, composition and organization, indicating a limited selection pressure in general. However, the divergence level within the gene segments is significantly different from that in the intergenic regions indicating that these regions may have been subject to different selection pressures and that the IGHV gene segments in this region are functionally important. Conclusions Non-reciprocal genetic rearrangements associated with large duplicate sequence blocks could substantially contribute to the IGHV region diversity. Since the resulting polymorphisms may affect the number, composition and organization of the gene segments in this region, it may have significant impact on the function of the IGHV gene segment repertoire, antibody diversity, and therefore, the immune system. Because one of the gene segments, 3-30 (1.9III), is associated with autoimmune diseases, it could be of diagnostic significance to learn about the variants in the haplotypes by using the multiplex haplotype analysis system used in the present study with DNA sequence tags specific for the variants of all gene segments in this region.
Collapse
Affiliation(s)
- Sreemanta Pramanik
- Department of Molecular Genetics, Microbiology, and Immunology, University of Medicine and Dentistry of New Jersey-Robert Wood Johnson Medical School, Piscataway, NJ 08854, USA
| | | | | | | | | | | | | | | |
Collapse
|
15
|
Boyd SD, Gaëta BA, Jackson KJ, Fire AZ, Marshall EL, Merker JD, Maniar JM, Zhang LN, Sahaf B, Jones CD, Simen BB, Hanczaruk B, Nguyen KD, Nadeau KC, Egholm M, Miklos DB, Zehnder JL, Collins AM. Individual variation in the germline Ig gene repertoire inferred from variable region gene rearrangements. THE JOURNAL OF IMMUNOLOGY 2010; 184:6986-92. [PMID: 20495067 DOI: 10.4049/jimmunol.1000445] [Citation(s) in RCA: 195] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Individual variation in the Ig germline gene repertoire leads to individual differences in the combinatorial diversity of the Ab repertoire, but the study of such variation has been problematic. The application of high-throughput DNA sequencing to the study of rearranged Ig genes now makes this possible. The sequencing of thousands of VDJ rearrangements from an individual, either from genomic DNA or expressed mRNA, should allow their germline IGHV, IGHD, and IGHJ repertoires to be inferred. In addition, where previously mere glimpses of diversity could be gained from sequencing studies, new large data sets should allow the rearrangement frequency of different genes and alleles to be seen with clarity. We analyzed the DNA of 108,210 human IgH chain rearrangements from 12 individuals and determined their individual IGH genotypes. The number of reportedly functional IGHV genes and allelic variants ranged from 45 to 60, principally because of variable levels of gene heterozygosity, and included 14 previously unreported IGHV polymorphisms. New polymorphisms of the IGHD3-16 and IGHJ6 genes were also seen. At heterozygous loci, remarkably different rearrangement frequencies were seen for the various IGHV alleles, and these frequencies were consistent between individuals. The specific alleles that make up an individual's Ig genotype may therefore be critical in shaping the combinatorial repertoire. The extent of genotypic variation between individuals is highlighted by an individual with aplastic anemia who appears to lack six contiguous IGHD genes on both chromosomes. These deletions significantly alter the potential expressed IGH repertoire, and possibly immune function, in this individual.
Collapse
Affiliation(s)
- Scott D Boyd
- Department of Pathology, Stanford University, Stanford, CA 94305, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Gaëta BA, Malming HR, Jackson KJL, Bain ME, Wilson P, Collins AM. iHMMune-align: hidden Markov model-based alignment and identification of germline genes in rearranged immunoglobulin gene sequences. Bioinformatics 2007; 23:1580-7. [PMID: 17463026 DOI: 10.1093/bioinformatics/btm147] [Citation(s) in RCA: 105] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Immunoglobulin heavy chain (IGH) genes in mature B lymphocytes are the result of recombination of IGHV, IGHD and IGHJ germline genes, followed by somatic mutation. The correct identification of the germline genes that make up a variable VH domain is essential to our understanding of the process of antibody diversity generation as well as to clinical investigations of some leukaemias and lymphomas. RESULTS We have developed iHMMune-align, an alignment program that uses a hidden Markov model (HMM) to model the processes involved in human IGH gene rearrangement and maturation. The performance of iHMMune-align was compared to that of other immunoglobulin gene alignment utilities using both clonally related and randomly selected IGH sequences. This evaluation suggests that iHMMune-align provides a more accurate identification of component germline genes than other currently available IGH gene characterization programs. AVAILABILITY iHMMune-align cross-platform Java executable and web interface are freely available to academic users and can be accessed at http://www.emi.unsw.edu.au/~ihmmune/.
Collapse
Affiliation(s)
- Bruno A Gaëta
- School of Biotechnology and Biomolecular Sciences, The University of New South Wales, Sydney, NSW 2052, Australia.
| | | | | | | | | | | |
Collapse
|
17
|
Chimge NO, Pramanik S, Hu G, Lin Y, Gao R, Shen L, Li H. Determination of gene organization in the human IGHV region on single chromosomes. Genes Immun 2005; 6:186-93. [PMID: 15744329 DOI: 10.1038/sj.gene.6364176] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Organization of the IGHV genes (n=108) on single human chromosomes has been determined by detecting these sequences in single sperm using multiplex PCR amplification followed by microarray detection. A total of 374 single sperm samples from five Caucasian males were studied. Three deletion/insertion polymorphisms (Del I-Del III) with deletion allele frequencies ranging from 0.1 to 0.3 were identified. Del I is a previously reported polymorphism affecting three IGHV genes (IGHV1-8, IGHV3-9, and IGHV2-10). Del II affects a region 2-18 kb containing two pseudogenes IGHV(II)-28.1 and IGHV3-29, and Del III spans approximately 21-53 kb involving genes IGHV4-39, IGHV7-40, IGHV(II)-40-1, and IGHV3-41. Deletion alleles of both Dels II and III were found in a heterozygous state, and therefore, could not be easily detected if haploid samples were not used in the study. Results of the present study indicate that deletions/insertions together with other possible chromosomal rearrangements may play an important role in forming the genetic structure of the IGHV region, and may significantly contribute to antibody diversity. Since these three polymorphisms are located within or next to the 3' half of the IGHV region, they may have an important role in the expressed IGHV gene repertoire during immune response.
Collapse
Affiliation(s)
- N-O Chimge
- Department of Molecular Genetics, Microbiology and Immunology, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, New Brunswick, NJ, USA
| | | | | | | | | | | | | |
Collapse
|
18
|
Romo-González T, Morales-Montor J, Rodríguez-Dorantes M, Vargas-Madrazo E. Novel Substitution Polymorphisms of Human Immunoglobulin VH Genes in Mexicans. Hum Immunol 2005; 66:732-40. [PMID: 15993719 DOI: 10.1016/j.humimm.2005.03.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2005] [Revised: 02/26/2005] [Accepted: 03/01/2005] [Indexed: 11/18/2022]
Abstract
It has been proposed that the defense and recognition functions of the immune system, especially those mediated by antibodies, require a great diversity of receptors. Nonetheless, functional and structural evidence has demonstrated the presence of restrictions, both in the use of the repertoire and in the recognition of antigens. Fifty-one functional genes have been described in the IghV locus; however, there is a variety of evidences indicating that only a small fraction of the immunoglobulin genes plays a central role in determining the fundamental properties of the antibody repertoire of the immune system. On the basis of this functional and structural information, we selected four IghV genes and characterized their polymorphism in a sample of Mexican individuals. We also analyzed the implications for the recognition mechanism of the substitutions found in the sequenced alleles. We found that diversification through allelism varies from segment to segment, both in the amount of alleles encountered and in the nature and distribution of mutations in the codifying zone, which might depend on its importance for the repertoire. Such functional characteristics may be useful in the interpretation of differential gene usage in certain physiological, ontological, and/or pathological conditions.
Collapse
Affiliation(s)
- Tania Romo-González
- Departamento de Biología Sistémica, Instituto de Investigaciones Biológicas, Universidad Veracruzana, Xalapa, Veracruz, México.
| | | | | | | |
Collapse
|
19
|
McLean GR, Olsen OA, Watt IN, Rathanaswami P, Leslie KB, Babcook JS, Schrader JW. Recognition of human cytomegalovirus by human primary immunoglobulins identifies an innate foundation to an adaptive immune response. THE JOURNAL OF IMMUNOLOGY 2005; 174:4768-78. [PMID: 15814702 DOI: 10.4049/jimmunol.174.8.4768] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Most primates, including humans, are chronically infected with cospecifically evolved, potentially pathogenic CMV. Abs that bind a 10-aa linear epitope (antigenic determinant 2 site 1) within the extracellular domain of human CMV glycoprotein B neutralize viral infectivity. In this study, we show that genes generated by recombinations involving two well-conserved human germline V elements (IGHV3-30 and IGKV3-11), and IGHJ4, encode primary Ig molecules that bind glycoprotein B at this key epitope. These particular V(H), J(H), and V(kappa) genes enable humans to generate through recombination and N nucleotide addition, a useful frequency of primary Igs that efficiently target this critical site on human CMV and thus confer an innate foundation for a specific adaptive response to this pathogen.
Collapse
Affiliation(s)
- Gary R McLean
- The Biomedical Research Centre, University of British Columbia, Vancouver, British Columbia, Canada
| | | | | | | | | | | | | |
Collapse
|
20
|
Romo-González T, Vargas-Madrazo E. Structural analysis of substitution patterns in alleles of human immunoglobulin VH genes. Mol Immunol 2004; 42:1085-97. [PMID: 15829298 DOI: 10.1016/j.molimm.2004.11.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2004] [Indexed: 10/26/2022]
Abstract
The diversity in repertoires of antibodies (Abs) needed in response to the antigen challenge is produced by evolutionary and somatic processes. The mechanisms operating at a somatic level have been studied in great detail. In contrast, neither the mechanisms nor the strategies of diversification at an evolutionary level have yet been understood in similar detail. Particularly, the substitution patterns in alleles of immunoglobulin genes (Igs) have not been systematically studied. Furthermore, there is a scarcity of studies which link the analysis at a genetic level of the diversification of repertoires with the structural consequences at the protein level of the changes in DNA information. For the purpose of systematically characterizing the strategies of evolutionary diversification through sequence variation at alleles, in this work, we built a database for all the alleles of the IGHV locus in humans reported until now. Based on these data, we performed diverse analyses of substitution patterns and linked these results with studies at the protein level. We found that the sequence diversification in different alleles does not operate with equal intensity for all V genes. Our studies, both of the number of substitutions and of the type of amino acid change per sub-segment of the V-REGION evidenced differences in the selective pressure to which these regions are exposed. The implications of these results for understanding the evolutionary diversification strategies, as well as for the somatic generation of antibody repertoires are discussed.
Collapse
Affiliation(s)
- Tania Romo-González
- Departamento Biología Sistémica, Instituto de Investigaciones Biológicas, Universidad Veracruzana, Morelos 9 La Pitaya, Zoncuantla, 91500 Coatepec, Xalapa, Veracruz, Mexico
| | | |
Collapse
|
21
|
Collins AM, Sewell WA, Edwards MR. Immunoglobulin gene rearrangement, repertoire diversity, and the allergic response. Pharmacol Ther 2003; 100:157-70. [PMID: 14609718 DOI: 10.1016/j.pharmthera.2003.07.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The immunoglobulin repertoire arises as a consequence of combinatorial diversity, junctional diversity, and the process of somatic point mutation. Each of these processes involves biases that limit and shape the available immunoglobulin repertoire. The expressed repertoire is further shaped by selection, to the extent that biased gene usage can become apparent in many disease states. The study of rearranged immunoglobulin genes therefore may not only provide insights into the molecular processes involved in the generation of antibody diversity but also inform us of pathogenic processes and perhaps identify particular lymphocyte clones as therapeutic targets. Partly as a consequence of the low numbers of circulating IgE-committed B-cells, studies of rearranged IgE genes in allergic individuals have commenced relatively recently. In this review, recent advances in our understanding of the processes of immunoglobulin gene rearrangement and somatic point mutation are described, and biases inherent to these processes are discussed. The evidence that some diseases may be associated with particular gene rearrangements is then considered, with a particular focus on allergic disease. Reviewed data suggest that an important contribution to the IgE response may come from cells that use relatively rare heavy chain V (V(H)) segment genes, which display little somatic point mutation. Some IgE antibodies also seem to display polyreactive binding. In other contexts, these 3 characteristics have been associated with antibodies of the B-1 B-cell subset, and the possibility that B-1 B-cells contribute to the allergic response is therefore considered.
Collapse
Affiliation(s)
- A M Collins
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, New South Wales, Sydney, Australia.
| | | | | |
Collapse
|
22
|
Liu L, Lucas AH. IGH V3-23*01 and its allele V3-23*03 differ in their capacity to form the canonical human antibody combining site specific for the capsular polysaccharide of Haemophilus influenzae type b. Immunogenetics 2003; 55:336-8. [PMID: 12845501 DOI: 10.1007/s00251-003-0583-8] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2003] [Revised: 05/28/2003] [Indexed: 10/26/2022]
Abstract
The IGH V3-23*01 gene is used in the formation of the canonical combining site which dominates the human antibody repertoire to the Haemophilus influenzae type b (Hib) polysaccharide (PS). An allele of the human IGH V3-23*01 gene, known as V3-23*03, differs from V3-23*01 in nine bases, eight of which are located in the second complementarity determining region. These eight differences encode five amino acid substitutions. In this study we investigated whether the V3-23*03 sequence polymorphism affected Hib PS binding. We constructed two Fab fragments that had the canonical Hib PS combining site VH-VL configuration but that had either V3-23*01 or V3-23*03. Radioantigen binding assay showed that on a concentration basis the V3-23*03 Fab was 20-fold more effective in binding Hib PS than the V3-23*01 Fab. The V3-23*03 Fab was 4-fold more effective than the V3-23*01 Fab in mediating facilitated bactericidal activity against Hib organisms. These findings identify a functional consequence of V3-23 allelism, and suggest that utilization of the V3-23*03 gene in the human Hib PS repertoire would generate canonical antibodies with higher affinity and protective efficacy than canonical antibodies utilizing V3-23*01. Thus, IGH V gene allelic variation has the potential to impact the generation of protective immunity to Hib.
Collapse
Affiliation(s)
- Leyu Liu
- Children's Hospital Oakland Research Institute, 5700 Martin Luther King Jr. Way, Oakland, CA 94609, USA
| | | |
Collapse
|