1
|
Li H, Xu S, Liu Y, Lu Y, Ning Y. Efficient De Novo Assembly of 100 kb-Scale Human Functional Immunoglobulin Heavy Variable (IGHV) Gene Fragments In Vitro. ACS Synth Biol 2025. [PMID: 40135783 DOI: 10.1021/acssynbio.5c00011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/27/2025]
Abstract
Synthetic biology provides a powerful approach to functional studies of viral and microbial genomes. However, in vitro, efficient and scarless DNA manipulation on large and complex human genomes remains an inevitable challenge. Here, we de novo design and successfully assemble human functional immunoglobulin heavy variable (IGHV) gene fragments up to hundred-kilobase (kb)-sized, using an iterative in vitro assembly via Escherichia coli (E. coli) based on Gibson isothermal assembly. We describe an efficient method for "scarless" (without leaving any non-native sequences) engineering of the assembled ordered functional IGHV gene fragments, which contain complex and highly repetitive regions. Our method provides a suitable way to construct bacterial artificial chromosomes (BACs) (30-100 kb) with common materials, easy manipulations, and low cost. The construction of ordered functional IGHV gene BACs expands the synthetic biologist's chassis repertoire. It is essential for the adaptive immune response and constructing immunity humanized animal models.
Collapse
Affiliation(s)
- Haiqiong Li
- School of Laboratory Medicine and Biotechnology, Southern Medical University, Guangzhou 510515, China
- Guangdong Provincial Key Laboratory of Immune Regulation and Immunotherapy, Guangzhou 510515, China
| | - Shuyao Xu
- School of Laboratory Medicine and Biotechnology, Southern Medical University, Guangzhou 510515, China
- Guangdong Provincial Key Laboratory of Immune Regulation and Immunotherapy, Guangzhou 510515, China
| | - Yurui Liu
- School of Laboratory Medicine and Biotechnology, Southern Medical University, Guangzhou 510515, China
- Guangdong Provincial Key Laboratory of Immune Regulation and Immunotherapy, Guangzhou 510515, China
| | - Yongqi Lu
- School of Laboratory Medicine and Biotechnology, Southern Medical University, Guangzhou 510515, China
- Guangdong Provincial Key Laboratory of Immune Regulation and Immunotherapy, Guangzhou 510515, China
| | - Yunshan Ning
- School of Laboratory Medicine and Biotechnology, Southern Medical University, Guangzhou 510515, China
- Guangdong Provincial Key Laboratory of Immune Regulation and Immunotherapy, Guangzhou 510515, China
| |
Collapse
|
2
|
Marsden AA, Corcoran M, Karlsson Hedestam G, Garrett N, Karim SSA, Moore PL, Kitchin D, Morris L, Scheepers C. Novel polymorphic and copy number diversity in the antibody IGH locus of South African individuals. Immunogenetics 2024; 77:6. [PMID: 39627383 PMCID: PMC11615098 DOI: 10.1007/s00251-024-01363-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Accepted: 11/19/2024] [Indexed: 12/06/2024]
Abstract
The heavy chain of an antibody is crucial for mediating antigen binding. IGHV genes, which partially encode the heavy chain of antibodies, exhibit vast genetic diversity largely through polymorphism and copy number variation (CNV). These genetic variations impact population-level expression levels. In this study, we analyzed expressed antibody transcriptomes and matched germline IGHV genes from donors from KwaZulu-Natal, South Africa. Amplicon NGS targeting germline IGHV sequences was performed on genomic DNA from 70 participants, eight of whom had matched datasets of expressed antibody transcriptomes. Germline IGHV sequencing identified 161 unique IGHV alleles, of which 32 were novel. A further 21 novel IGHV alleles were detected in the expressed transcriptomes of these donors. We also examined the datasets for CNV, uncovering gene duplications of 10 IGHV genes from germline sequencing and 33 genes in the expressed transcriptomes. Many of the IGHV gene duplications have not been described in other populations. This study expands our understanding of genetic differences in distinct populations and suggests the potential impact of genetic diversity on immune responses.
Collapse
Affiliation(s)
- Alaine A Marsden
- SA MRC Antibody Immunity Research Unit (AIRU), University of the Witwatersrand, Johannesburg, South Africa
- Centre for HIV and STIs, HIV Virology Section, National Institute for Communicable Diseases (NICD), a Division of the National Health Laboratory Service (NHLS), Johannesburg, South Africa
| | - Martin Corcoran
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
| | | | - Nigel Garrett
- Centre for the AIDS Programme of Research in South Africa (CAPRISA), University of KwaZulu-Natal, Durban, South Africa
- Discipline of Public Health Medicine, School of Nursing and Public Health, University of KwaZulu-Natal, Durban, South Africa
| | - Salim S Abdool Karim
- Centre for the AIDS Programme of Research in South Africa (CAPRISA), University of KwaZulu-Natal, Durban, South Africa
- Department of Epidemiology, Mailman School of Public Health, Columbia University, Columbia, NY, USA
| | - Penny L Moore
- SA MRC Antibody Immunity Research Unit (AIRU), University of the Witwatersrand, Johannesburg, South Africa
- Centre for HIV and STIs, HIV Virology Section, National Institute for Communicable Diseases (NICD), a Division of the National Health Laboratory Service (NHLS), Johannesburg, South Africa
- Centre for the AIDS Programme of Research in South Africa (CAPRISA), University of KwaZulu-Natal, Durban, South Africa
| | - Dale Kitchin
- SA MRC Antibody Immunity Research Unit (AIRU), University of the Witwatersrand, Johannesburg, South Africa
- Centre for HIV and STIs, HIV Virology Section, National Institute for Communicable Diseases (NICD), a Division of the National Health Laboratory Service (NHLS), Johannesburg, South Africa
| | - Lynn Morris
- SA MRC Antibody Immunity Research Unit (AIRU), University of the Witwatersrand, Johannesburg, South Africa
- Centre for the AIDS Programme of Research in South Africa (CAPRISA), University of KwaZulu-Natal, Durban, South Africa
| | - Cathrine Scheepers
- SA MRC Antibody Immunity Research Unit (AIRU), University of the Witwatersrand, Johannesburg, South Africa.
| |
Collapse
|
3
|
Konstantinovsky T, Peres A, Polak P, Yaari G. An unbiased comparison of immunoglobulin sequence aligners. Brief Bioinform 2024; 25:bbae556. [PMID: 39489605 PMCID: PMC11531861 DOI: 10.1093/bib/bbae556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Revised: 09/11/2024] [Accepted: 10/19/2024] [Indexed: 11/05/2024] Open
Abstract
Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) is critical for our understanding of the adaptive immune system's dynamics in health and disease. Reliable analysis of AIRR-seq data depends on accurate rearranged immunoglobulin (Ig) sequence alignment. Various Ig sequence aligners exist, but there is no unified benchmarking standard representing the complexities of AIRR-seq data, obscuring objective comparisons of aligners across tasks. Here, we introduce GenAIRR, a modular simulation framework for generating Ig sequences alongside their ground truths. GenAIRR realistically simulates the intricacies of V(D)J recombination, somatic hypermutation, and an array of sequence corruptions. We comprehensively assessed prominent Ig sequence aligners across various metrics, unveiling unique performance characteristics for each aligner. The GenAIRR-produced datasets, combined with the proposed rigorous evaluation criteria, establish a solid basis for unbiased benchmarking of immunogenetics computational tools. It sets up the ground for further improving the crucial task of Ig sequence alignment, ultimately enhancing our understanding of adaptive immunity.
Collapse
Affiliation(s)
- Thomas Konstantinovsky
- Faculty of Engineering, Bar Ilan University, 5290002 Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnology and Advanced Materials, Bar Ilan University, 5290002 Ramat Gan, Israel
| | - Ayelet Peres
- Faculty of Engineering, Bar Ilan University, 5290002 Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnology and Advanced Materials, Bar Ilan University, 5290002 Ramat Gan, Israel
| | - Pazit Polak
- Faculty of Engineering, Bar Ilan University, 5290002 Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnology and Advanced Materials, Bar Ilan University, 5290002 Ramat Gan, Israel
| | - Gur Yaari
- Faculty of Engineering, Bar Ilan University, 5290002 Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnology and Advanced Materials, Bar Ilan University, 5290002 Ramat Gan, Israel
- Department of Pathology, Yale School of Medicine, New Haven, CT, USA
| |
Collapse
|
4
|
Lees WD, Saha S, Yaari G, Watson CT. Digger: directed annotation of immunoglobulin and T cell receptor V, D, and J gene sequences and assemblies. Bioinformatics 2024; 40:btae144. [PMID: 38478393 PMCID: PMC10957512 DOI: 10.1093/bioinformatics/btae144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 02/11/2024] [Accepted: 03/11/2024] [Indexed: 03/23/2024] Open
Abstract
SUMMARY Knowledge of immunoglobulin and T cell receptor encoding genes is derived from high-quality genomic sequencing. High-throughput sequencing is delivering large volumes of data, and precise, high-throughput approaches to annotation are needed. Digger is an automated tool that identifies coding and regulatory regions of these genes, with results comparable to those obtained by current expert curational methods. AVAILABILITY AND IMPLEMENTATION Digger is published under open source license at https://github.com/williamdlees/Digger and is available as a Python package and a Docker container.
Collapse
Affiliation(s)
- William D Lees
- Bioengineering Program, Faculty of Engineering, Bar-Ilan University, Ramat Gan, 5290002, Israel
| | - Swati Saha
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, Kentucky 40292, United States
| | - Gur Yaari
- Bioengineering Program, Faculty of Engineering, Bar-Ilan University, Ramat Gan, 5290002, Israel
| | - Corey T Watson
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, Kentucky 40292, United States
| |
Collapse
|
5
|
Collins AM, Ohlin M, Corcoran M, Heather JM, Ralph D, Law M, Martínez-Barnetche J, Ye J, Richardson E, Gibson WS, Rodriguez OL, Peres A, Yaari G, Watson CT, Lees WD. AIRR-C IG Reference Sets: curated sets of immunoglobulin heavy and light chain germline genes. Front Immunol 2024; 14:1330153. [PMID: 38406579 PMCID: PMC10884231 DOI: 10.3389/fimmu.2023.1330153] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 12/27/2023] [Indexed: 02/27/2024] Open
Abstract
Introduction Analysis of an individual's immunoglobulin (IG) gene repertoire requires the use of high-quality germline gene reference sets. When sets only contain alleles supported by strong evidence, AIRR sequencing (AIRR-seq) data analysis is more accurate and studies of the evolution of IG genes, their allelic variants and the expressed immune repertoire is therefore facilitated. Methods The Adaptive Immune Receptor Repertoire Community (AIRR-C) IG Reference Sets have been developed by including only human IG heavy and light chain alleles that have been confirmed by evidence from multiple high-quality sources. To further improve AIRR-seq analysis, some alleles have been extended to deal with short 3' or 5' truncations that can lead them to be overlooked by alignment utilities. To avoid other challenges for analysis programs, exact paralogs (e.g. IGHV1-69*01 and IGHV1-69D*01) are only represented once in each set, though alternative sequence names are noted in accompanying metadata. Results and discussion The Reference Sets include less than half the previously recognised IG alleles (e.g. just 198 IGHV sequences), and also include a number of novel alleles: 8 IGHV alleles, 2 IGKV alleles and 5 IGLV alleles. Despite their smaller sizes, erroneous calls were eliminated, and excellent coverage was achieved when a set of repertoires comprising over 4 million V(D)J rearrangements from 99 individuals were analyzed using the Sets. The version-tracked AIRR-C IG Reference Sets are freely available at the OGRDB website (https://ogrdb.airr-community.org/germline_sets/Human) and will be regularly updated to include newly observed and previously reported sequences that can be confirmed by new high-quality data.
Collapse
Affiliation(s)
- Andrew M. Collins
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Mats Ohlin
- Department of Immunotechnology, and SciLifeLab, Lund University, Lund, Sweden
| | - Martin Corcoran
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden
| | - James M. Heather
- Mass General Cancer Center, Massachusetts General Hospital, Charlestown, MA, United States
- Department of Medicine, Harvard Medical School, Boston, MA, United States
| | - Duncan Ralph
- Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | - Mansun Law
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, United States
| | - Jesus Martínez-Barnetche
- Centro de Investigación Sobre Enfermedades Infecciosas, Instituto Nacional de Salud Pública, Cuernavaca, Morelos, Mexico
| | - Jian Ye
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States
| | - Eve Richardson
- La Jolla Institute for Immunology, San Diego, CA, United States
| | - William S. Gibson
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, KY, United States
| | - Oscar L. Rodriguez
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, KY, United States
| | - Ayelet Peres
- Bioengineering Program, Faculty of Engineering, Bar-Ilan University, Ramat Gan, Israel
| | - Gur Yaari
- Bioengineering Program, Faculty of Engineering, Bar-Ilan University, Ramat Gan, Israel
| | - Corey T. Watson
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, KY, United States
| | - William D. Lees
- Institute of Structural and Molecular Biology, Birkbeck College, London, United Kingdom
- Human-Centered Computing and Information Science, Institute for Systems and Computer Engineering, Technology and Science, Porto, Portugal
| |
Collapse
|
6
|
Peres A, Lees WD, Rodriguez OL, Lee NY, Polak P, Hope R, Kedmi M, Collins AM, Ohlin M, Kleinstein S, Watson C, Yaari G. IGHV allele similarity clustering improves genotype inference from adaptive immune receptor repertoire sequencing data. Nucleic Acids Res 2023; 51:e86. [PMID: 37548401 PMCID: PMC10484671 DOI: 10.1093/nar/gkad603] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 06/26/2023] [Accepted: 08/03/2023] [Indexed: 08/08/2023] Open
Abstract
In adaptive immune receptor repertoire analysis, determining the germline variable (V) allele associated with each T- and B-cell receptor sequence is a crucial step. This process is highly impacted by allele annotations. Aligning sequences, assigning them to specific germline alleles, and inferring individual genotypes are challenging when the repertoire is highly mutated, or sequence reads do not cover the whole V region. Here, we propose an alternative naming scheme for the V alleles, as well as a novel method to infer individual genotypes. We demonstrate the strengths of the two by comparing their outcomes to other genotype inference methods. We validate the genotype approach with independent genomic long-read data. The naming scheme is compatible with current annotation tools and pipelines. Analysis results can be converted from the proposed naming scheme to the nomenclature determined by the International Union of Immunological Societies (IUIS). Both the naming scheme and the genotype procedure are implemented in a freely available R package (PIgLET https://bitbucket.org/yaarilab/piglet). To allow researchers to further explore the approach on real data and to adapt it for their uses, we also created an interactive website (https://yaarilab.github.io/IGHV_reference_book).
Collapse
Affiliation(s)
- Ayelet Peres
- Faculty of Engineering, Bar Ilan University, 5290002 Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnology and Advanced Materials, Bar Ilan University, 5290002 Ramat Gan, Israel
| | - William D Lees
- Institute of Structural and Molecular Biology, Birkbeck College, University of London, London, WC1E 7JE, UK
| | - Oscar L Rodriguez
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, 40202, USA
| | - Noah Y Lee
- Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT, 06511, USA
- Department of Pathology, Yale School of Medicine, New Haven, CT, 06520, USA
| | - Pazit Polak
- Faculty of Engineering, Bar Ilan University, 5290002 Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnology and Advanced Materials, Bar Ilan University, 5290002 Ramat Gan, Israel
| | - Ronen Hope
- Faculty of Engineering, Bar Ilan University, 5290002 Ramat Gan, Israel
| | - Meirav Kedmi
- Department of Pathology, Yale School of Medicine, New Haven, CT, 06520, USA
- Division of Hematology and Bone Marrow Transplantation, Chaim Sheba Medical Center, Tel-Hashomer, 5262000, Israel
- Sackler School of Medicine, Tel-Aviv University, Tel-Aviv, 69978, Israel
| | - Andrew M Collins
- School of Biotechnology and Biomedical Sciences, University of New South Wales, Sydney, NSW 2052, Australia
| | - Mats Ohlin
- Department of Immunotechnology Lund University, Lund, 221 00, Sweden
| | - Steven H Kleinstein
- Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT, 06511, USA
- Department of Pathology, Yale School of Medicine, New Haven, CT, 06520, USA
| | - Corey T Watson
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, 40202, USA
| | - Gur Yaari
- Faculty of Engineering, Bar Ilan University, 5290002 Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnology and Advanced Materials, Bar Ilan University, 5290002 Ramat Gan, Israel
| |
Collapse
|
7
|
Dhande IS, Zhu Y, Joshi AS, Hicks MJ, Braun MC, Doris PA. Polygenic genetic variation affecting antibody formation underlies hypertensive renal injury in the stroke-prone spontaneously hypertensive rat. Am J Physiol Renal Physiol 2023; 325:F317-F327. [PMID: 37439198 PMCID: PMC10511163 DOI: 10.1152/ajprenal.00058.2023] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 07/07/2023] [Accepted: 07/07/2023] [Indexed: 07/14/2023] Open
Abstract
During development of the spontaneously hypertensive rat (SHR), several distinct but closely related lines were generated. Most lines are resistant to hypertensive renal disease. However, the SHR-A3 line (stroke-prone SHR) experiences end-organ injury (EOI) and provides a model of injury susceptibility that can be used to uncover genetic causation. In the present study, we generated a congenic line in which three distinct disease loci in SHR-A3 are concurrently replaced with homologous loci from an injury-resistant SHR line (SHR-B2). Verification that all three loci were homozygously replaced in this triple congenic line [SHR-A3(Trip B2)] while the genetic background of SHR-A3 was fully retained was obtained by whole genome sequencing. Congenic genome substitution was without effect on systolic blood pressure [198.9 ± 3.34 mmHg, mean ± SE, SHR-A3(Trip B2) = 194.7 ± 2.55 mmHg]. Measures of renal injury (albuminuria, histological injury scores, and urinary biomarker levels) were reduced in SHR-A3(Trip B2) animals, even though only 4.5 Mbases of the 2.8 Gbases of the SHR-B2 genome (0.16% of the genome) was transferred into the congenic line. The gene content of the three congenic loci and the functional effects of gene polymorphism within suggest a role of immunoglobulin in EOI pathogenesis. To prove the role of antibodies in EOI in SHR-A3, we generated an SHR-A3 line in which expression from the immunoglobulin heavy chain gene was knocked out (SHR-A3-IGHKO). Animals in the SHR-A3-IGHKO line lack B cells and immunoglobulin, but the hypertensive phenotype is not affected. Renal injury, however, was reduced in this line, confirming a pathogenic role for immunoglobulin in hypertensive EOI in this model of heritable risk.NEW & NOTEWORTHY Here, we used a polygenic animal model of hypertensive renal disease to show that genetic variation affecting antibody formation underlies hypertensive renal disease. We proved the genetic thesis by generating an immunoglobulin knockout in the susceptible animal model.
Collapse
Affiliation(s)
- Isha S Dhande
- Institute of Molecular Medicine, University of Texas Health Science Center at Houston, Houston, Texas, United States
| | - Yaming Zhu
- Institute of Molecular Medicine, University of Texas Health Science Center at Houston, Houston, Texas, United States
| | - Aniket S Joshi
- Institute of Molecular Medicine, University of Texas Health Science Center at Houston, Houston, Texas, United States
| | - M John Hicks
- Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas, United States
| | - Michael C Braun
- Department of Pediatrics, Baylor College of Medicine, Houston, Texas, United States
- Department of Obstetrics and Gynecology, Baylor College of Medicine, Houston, Texas, United States
| | - Peter A Doris
- Institute of Molecular Medicine, University of Texas Health Science Center at Houston, Houston, Texas, United States
| |
Collapse
|
8
|
Zhang Y, Li Q, Luo L, Duan C, Shen J, Wang Z. Application of germline antibody features to vaccine development, antibody discovery, antibody optimization and disease diagnosis. Biotechnol Adv 2023; 65:108143. [PMID: 37023966 DOI: 10.1016/j.biotechadv.2023.108143] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 03/26/2023] [Accepted: 03/29/2023] [Indexed: 04/08/2023]
Abstract
Although the efficacy and commercial success of vaccines and therapeutic antibodies have been tremendous, designing and discovering new drug candidates remains a labor-, time- and cost-intensive endeavor with high risks. The main challenges of vaccine development are inducing a strong immune response in broad populations and providing effective prevention against a group of highly variable pathogens. Meanwhile, antibody discovery faces several great obstacles, especially the blindness in antibody screening and the unpredictability of the developability and druggability of antibody drugs. These challenges are largely due to poorly understanding of germline antibodies and the antibody responses to pathogen invasions. Thanks to the recent developments in high-throughput sequencing and structural biology, we have gained insight into the germline immunoglobulin (Ig) genes and germline antibodies and then the germline antibody features associated with antigens and disease manifestation. In this review, we firstly outline the broad associations between germline antibodies and antigens. Moreover, we comprehensively review the recent applications of antigen-specific germline antibody features, physicochemical properties-associated germline antibody features, and disease manifestation-associated germline antibody features on vaccine development, antibody discovery, antibody optimization, and disease diagnosis. Lastly, we discuss the bottlenecks and perspectives of current and potential applications of germline antibody features in the biotechnology field.
Collapse
Affiliation(s)
- Yingjie Zhang
- National Key Laboratory of Veterinary Public Health Security, Beijing Key Laboratory of Detection Technology for Animal-Derived Food, College of Veterinary Medicine, China Agricultural University, 100193 Beijing, People's Republic of China
| | - Qing Li
- National Key Laboratory of Veterinary Public Health Security, Beijing Key Laboratory of Detection Technology for Animal-Derived Food, College of Veterinary Medicine, China Agricultural University, 100193 Beijing, People's Republic of China
| | - Liang Luo
- National Key Laboratory of Veterinary Public Health Security, Beijing Key Laboratory of Detection Technology for Animal-Derived Food, College of Veterinary Medicine, China Agricultural University, 100193 Beijing, People's Republic of China
| | - Changfei Duan
- National Key Laboratory of Veterinary Public Health Security, Beijing Key Laboratory of Detection Technology for Animal-Derived Food, College of Veterinary Medicine, China Agricultural University, 100193 Beijing, People's Republic of China
| | - Jianzhong Shen
- National Key Laboratory of Veterinary Public Health Security, Beijing Key Laboratory of Detection Technology for Animal-Derived Food, College of Veterinary Medicine, China Agricultural University, 100193 Beijing, People's Republic of China
| | - Zhanhui Wang
- National Key Laboratory of Veterinary Public Health Security, Beijing Key Laboratory of Detection Technology for Animal-Derived Food, College of Veterinary Medicine, China Agricultural University, 100193 Beijing, People's Republic of China.
| |
Collapse
|
9
|
Hardt U, Corcoran MM, Narang S, Malmström V, Padyukov L, Karlsson Hedestam GB. Analysis of IGH allele content in a sample group of rheumatoid arthritis patients demonstrates unrevealed population heterogeneity. Front Immunol 2023; 14:1073414. [PMID: 36798124 PMCID: PMC9927645 DOI: 10.3389/fimmu.2023.1073414] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Accepted: 01/09/2023] [Indexed: 02/04/2023] Open
Abstract
Immunoglobulin heavy chain (IGH) germline gene variations influence the B cell receptor repertoire, with resulting biological consequences such as shaping our response to infections and altering disease susceptibilities. However, the lack of information on polymorphism frequencies in the IGH loci at the population level makes association studies challenging. Here, we genotyped a pilot group of 30 individuals with rheumatoid arthritis (RA) to examine IGH allele content and frequencies in this group. Eight novel IGHV alleles and one novel IGHJ allele were identified in the study. 15 cases were haplotypable using heterozygous IGHJ6 or IGHD anchors. One variant, IGHV4-34*01_S0742, was found in three out of 30 cases and included a single nucleotide change resulting in a non-canonical recombination signal sequence (RSS) heptamer. This variant allele, shown by haplotype analysis to be non-expressed, was also found in three out of 30 healthy controls and matched a single nucleotide polymorphism (SNP) described in the 1000 Genomes Project (1KGP) collection with frequencies that varied between population groups. Our finding of previously unreported alleles in a relatively small group of individuals with RA illustrates the need for baseline information about IG allelic frequencies in targeted study groups in preparation for future analysis of these genes in disease association studies.
Collapse
Affiliation(s)
- Uta Hardt
- Division of Rheumatology, Department of Medicine Solna, Center for Molecular Medicine, Karolinska Institutet, Stockholm, Sweden and Karolinska University Hospital, Stockholm, Sweden
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
| | - Martin M. Corcoran
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
| | - Sanjana Narang
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
| | - Vivianne Malmström
- Division of Rheumatology, Department of Medicine Solna, Center for Molecular Medicine, Karolinska Institutet, Stockholm, Sweden and Karolinska University Hospital, Stockholm, Sweden
| | - Leonid Padyukov
- Division of Rheumatology, Department of Medicine Solna, Center for Molecular Medicine, Karolinska Institutet, Stockholm, Sweden and Karolinska University Hospital, Stockholm, Sweden
| | | |
Collapse
|
10
|
Pennell M, Rodriguez OL, Watson CT, Greiff V. The evolutionary and functional significance of germline immunoglobulin gene variation. Trends Immunol 2023; 44:7-21. [PMID: 36470826 DOI: 10.1016/j.it.2022.11.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 11/07/2022] [Indexed: 12/04/2022]
Abstract
The recombination between immunoglobulin (IG) gene segments determines an individual's naïve antibody repertoire and, consequently, (auto)antigen recognition. Emerging evidence suggests that mammalian IG germline variation impacts humoral immune responses associated with vaccination, infection, and autoimmunity - from the molecular level of epitope specificity, up to profound changes in the architecture of antibody repertoires. These links between IG germline variants and immunophenotype raise the question on the evolutionary causes and consequences of diversity within IG loci. We discuss why the extreme diversity in IG loci remains a mystery, why resolving this is important for the design of more effective vaccines and therapeutics, and how recent evidence from multiple lines of inquiry may help us do so.
Collapse
Affiliation(s)
- Matt Pennell
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA.
| | - Oscar L Rodriguez
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA
| | - Corey T Watson
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA
| | - Victor Greiff
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway.
| |
Collapse
|
11
|
Kaduk M, Corcoran M, Karlsson Hedestam GB. Addressing IGHV Gene Structural Diversity Enhances Immunoglobulin Repertoire Analysis: Lessons From Rhesus Macaque. Front Immunol 2022; 13:818440. [PMID: 35419009 PMCID: PMC8995469 DOI: 10.3389/fimmu.2022.818440] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 03/01/2022] [Indexed: 11/13/2022] Open
Abstract
The accurate germline gene assignment and assessment of somatic hypermutation in antibodies induced by immunization or infection are important in immunological studies. Here, we illustrate issues specific to the construction of comprehensive immunoglobulin (IG) germline gene reference databases for outbred animal species using rhesus macaques, a frequently used non-human primate model, as a model test case. We demonstrate that the genotypic variation found in macaque germline inference studies is reflected in similar levels of gene diversity in genomic assemblies. We show that the high frequency of IG heavy chain V (IGHV) region structural and gene copy number variation between subjects means that individual animals lack genes that are present in other animals. Therefore, gene databases compiled from a single or too few animals will inevitably result in inaccurate gene assignment and erroneous SHM level assessment for those genes it lacks. We demonstrate this by assigning a test macaque IgG library to the KIMDB, a database compiled of germline IGHV sequences from 27 rhesus macaques, and, alternatively, to the IMGT rhesus macaque database, based on IGHV genes inferred primarily from the genomic sequence of the rheMac10 reference assembly, supplemented with 10 genes from the Mmul_051212 assembly. We found that the use of a gene-restricted database led to overestimations of SHM by up to 5% due to misassignments. The principles described in the current study provide a model for the creation of comprehensive immunoglobulin reference databases from outbred species to ensure accurate gene assignment, lineage tracing and SHM calculations.
Collapse
Affiliation(s)
- Mateusz Kaduk
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
| | - Martin Corcoran
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
| | | |
Collapse
|
12
|
IMGT® Biocuration and Analysis of the Rhesus Monkey IG Loci. Vaccines (Basel) 2022; 10:vaccines10030394. [PMID: 35335026 PMCID: PMC8950363 DOI: 10.3390/vaccines10030394] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 02/25/2022] [Accepted: 02/25/2022] [Indexed: 11/29/2022] Open
Abstract
The adaptive immune system, along with the innate immune system, are the two main biological processes that protect an organism from pathogens. The adaptive immune system is characterized by the specificity and extreme diversity of its antigen receptors. These antigen receptors are the immunoglobulins (IG) or antibodies of the B cells and the T cell receptors (TR) of the T cells. The IG are proteins that have a dual role in immunity: they recognize antigens and trigger elimination mechanisms, to rid the body of foreign cells. The synthesis of the immunoglobulin heavy and light chains requires gene rearrangements at the DNA level in the IGH, IGK, and IGL loci. The rhesus monkey (Macaca mulatta) is one of the most widely used nonhuman primate species in biomedical research. In this manuscript, we provide a thorough analysis of the three IG loci of the Mmul_10 assembly of rhesus monkey, integrating IMGT previously existing data. Detailed characterization of IG genes includes their localization and position in the loci, the determination of the allele functionality, and the description of the regulatory elements of their promoters as well as the sequences of the conventional recombination signals (RS). This complete annotation of the genomic IG loci of Mmul_10 assembly and the highly detailed IG gene characterization could be used as a model, in additional rhesus monkey assemblies, for the analysis of the IG allelic polymorphism and structural variation, which have been described in rhesus monkeys.
Collapse
|
13
|
Omer A, Peres A, Rodriguez OL, Watson CT, Lees W, Polak P, Collins AM, Yaari G. T cell receptor beta germline variability is revealed by inference from repertoire data. Genome Med 2022; 14:2. [PMID: 34991709 PMCID: PMC8740489 DOI: 10.1186/s13073-021-01008-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 12/08/2021] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND T and B cell receptor (TCR, BCR) repertoires constitute the foundation of adaptive immunity. Adaptive immune receptor repertoire sequencing (AIRR-seq) is a common approach to study immune system dynamics. Understanding the genetic factors influencing the composition and dynamics of these repertoires is of major scientific and clinical importance. The chromosomal loci encoding for the variable regions of TCRs and BCRs are challenging to decipher due to repetitive elements and undocumented structural variants. METHODS To confront this challenge, AIRR-seq-based methods have recently been developed for B cells, enabling genotype and haplotype inference and discovery of undocumented alleles. However, this approach relies on complete coverage of the receptors' variable regions, whereas most T cell studies sequence a small fraction of that region. Here, we adapted a B cell pipeline for undocumented alleles, genotype, and haplotype inference for full and partial AIRR-seq TCR data sets. The pipeline also deals with gene assignment ambiguities, which is especially important in the analysis of data sets of partial sequences. RESULTS From the full and partial AIRR-seq TCR data sets, we identified 39 undocumented polymorphisms in T cell receptor Beta V (TRBV) and 31 undocumented 5 ' UTR sequences. A subset of these inferences was also observed using independent genomic approaches. We found that a single nucleotide polymorphism differentiating between the two documented T cell receptor Beta D2 (TRBD2) alleles is strongly associated with dramatic changes in the expressed repertoire. CONCLUSIONS We reveal a rich picture of germline variability and demonstrate how a single nucleotide polymorphism dramatically affects the composition of the whole repertoire. Our findings provide a basis for annotation of TCR repertoires for future basic and clinical studies.
Collapse
Affiliation(s)
- Aviv Omer
- Faculty of Engineering, Bar Ilan University, Ramat Gan, 5290002, Israel
- Bar Ilan institute of Nanotechnology and Advanced Materials, Bar Ilan University, Ramat Gan, 5290002, Israel
| | - Ayelet Peres
- Faculty of Engineering, Bar Ilan University, Ramat Gan, 5290002, Israel
- Bar Ilan institute of Nanotechnology and Advanced Materials, Bar Ilan University, Ramat Gan, 5290002, Israel
| | - Oscar L Rodriguez
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA
| | - Corey T Watson
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA
| | - William Lees
- Institute of Structural and Molecular Biology, Birkbeck College, University of London, London, UK
| | - Pazit Polak
- Faculty of Engineering, Bar Ilan University, Ramat Gan, 5290002, Israel
- Bar Ilan institute of Nanotechnology and Advanced Materials, Bar Ilan University, Ramat Gan, 5290002, Israel
| | - Andrew M Collins
- School of Biotechnology and Biomedical Sciences, University of New South Wales, Sydney, Australia
| | - Gur Yaari
- Faculty of Engineering, Bar Ilan University, Ramat Gan, 5290002, Israel.
- Bar Ilan institute of Nanotechnology and Advanced Materials, Bar Ilan University, Ramat Gan, 5290002, Israel.
| |
Collapse
|
14
|
Dong H, Zhang Y, Wang J, Xiang H, Lv T, Wei L, Yang S, Liu X, Ren B, Zhang X, Liu L, Cao J, Wang M, Shi J, Yang N. Cas9-Based Local Enrichment and Genomics Sequence Revision of Megabase-Sized Shark IgNAR Loci. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2022; 208:181-189. [PMID: 34880108 DOI: 10.4049/jimmunol.2100844] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 10/21/2021] [Indexed: 06/13/2023]
Abstract
The 0.8-Mb Ig new Ag receptor (IgNAR) region of the whitespotted bamboo shark (Chiloscyllium plagiosum) is incompletely assembled in Chr_44 of the reference genome. Here we used Cas9-assisted targeting of chromosome segments (CATCH) to enrich the 2 Mb region of the Chr_44 IgNAR loci and sequenced it by PacBio and next-generation sequencing. A fragment >3.13 Mb was isolated intact from the RBCs of sharks. The target was enriched 245.531-fold, and sequences had up to 94% coverage with a 255× mean depth. Compared with the previously published sequences, 20 holes were filled, with a total length of 3508 bp. In addition, we report five potential germline V alleles of IgNAR1 from six sharks that may belong to two clusters of the IgNAR. Our results provide a new method to research the germline of large Ig gene segments, as well as provide the enhanced bamboo shark IgNAR gene loci with fewer gaps.
Collapse
Affiliation(s)
- Hongming Dong
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- Beijing Genomics Institution-Shenzhen, Shenzhen, China
| | - Yaolei Zhang
- Beijing Genomics Institution-Qingdao, Beijing Genomics Institution-Shenzhen, Qingdao, China
| | - Jiahao Wang
- Beijing Genomics Institution-Qingdao, Beijing Genomics Institution-Shenzhen, Qingdao, China
| | - Haitao Xiang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- Beijing Genomics Institution-Shenzhen, Shenzhen, China
| | - Tianhang Lv
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- Beijing Genomics Institution-Shenzhen, Shenzhen, China
| | - Likun Wei
- Department of Biomedical Sciences, City University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Shaosen Yang
- Beijing Genomics Institution Marine, Beijing Genomics Institution, Shenzhen, China
| | - Xiaopan Liu
- Beijing Genomics Institution-Shenzhen, Shenzhen, China
| | - Bingzhao Ren
- Beijing Genomics Institution-Shenzhen, Shenzhen, China
| | - Xiuqing Zhang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- Beijing Genomics Institution-Shenzhen, Shenzhen, China
| | - Lirong Liu
- Beijing Genomics Institution-Shenzhen, Shenzhen, China
| | - Jun Cao
- Beijing Genomics Institution-Shenzhen, Shenzhen, China
| | - Meiniang Wang
- Beijing Genomics Institution-Shenzhen, Shenzhen, China;
| | - Jiahai Shi
- Synthetic Biology Translational Research Programmes, Yong Loo Lin School of Medicine, National University of Singapore, Singapore;
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; and
| | - Naibo Yang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China;
- Beijing Genomics Institution-Shenzhen, Shenzhen, China
- Complete Genomics Inc., San Jose, CA
| |
Collapse
|
15
|
Slabodkin A, Chernigovskaya M, Mikocziova I, Akbar R, Scheffer L, Pavlović M, Bashour H, Snapkov I, Mehta BB, Weber CR, Gutierrez-Marcos J, Sollid LM, Haff IH, Sandve GK, Robert PA, Greiff V. Individualized VDJ recombination predisposes the available Ig sequence space. Genome Res 2021; 31:2209-2224. [PMID: 34815307 PMCID: PMC8647828 DOI: 10.1101/gr.275373.121] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 10/20/2021] [Indexed: 11/25/2022]
Abstract
The process of recombination between variable (V), diversity (D), and joining (J) immunoglobulin (Ig) gene segments determines an individual's naive Ig repertoire and, consequently, (auto)antigen recognition. VDJ recombination follows probabilistic rules that can be modeled statistically. So far, it remains unknown whether VDJ recombination rules differ between individuals. If these rules differed, identical (auto)antigen-specific Ig sequences would be generated with individual-specific probabilities, signifying that the available Ig sequence space is individual specific. We devised a sensitivity-tested distance measure that enables inter-individual comparison of VDJ recombination models. We discovered, accounting for several sources of noise as well as allelic variation in Ig sequencing data, that not only unrelated individuals but also human monozygotic twins and even inbred mice possess statistically distinguishable immunoglobulin recombination models. This suggests that, in addition to genetic, there is also nongenetic modulation of VDJ recombination. We demonstrate that population-wide individualized VDJ recombination can result in orders of magnitude of difference in the probability to generate (auto)antigen-specific Ig sequences. Our findings have implications for immune receptor-based individualized medicine approaches relevant to vaccination, infection, and autoimmunity.
Collapse
Affiliation(s)
- Andrei Slabodkin
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Maria Chernigovskaya
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Ivana Mikocziova
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Rahmad Akbar
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Lonneke Scheffer
- Department of Informatics, University of Oslo, 0373 Oslo, Norway
| | - Milena Pavlović
- Department of Informatics, University of Oslo, 0373 Oslo, Norway
| | - Habib Bashour
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Igor Snapkov
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Brij Bhushan Mehta
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Cédric R Weber
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland
| | | | - Ludvig M Sollid
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | | | | | - Philippe A Robert
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Victor Greiff
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| |
Collapse
|
16
|
Yang X, Zhu Y, Chen S, Zeng H, Guan J, Wang Q, Lan C, Sun D, Yu X, Zhang Z. Novel Allele Detection Tool Benchmark and Application With Antibody Repertoire Sequencing Dataset. Front Immunol 2021; 12:739179. [PMID: 34764956 PMCID: PMC8576399 DOI: 10.3389/fimmu.2021.739179] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Accepted: 10/11/2021] [Indexed: 11/29/2022] Open
Abstract
Detailed knowledge of the diverse immunoglobulin germline genes is critical for the study of humoral immunity. Hundreds of alleles have been discovered by analyzing antibody repertoire sequencing (Rep-seq or Ig-seq) data via multiple novel allele detection tools (NADTs). However, the performance of these NADTs through antibody sequences with intrinsic somatic hypermutations (SHMs) is unclear. Here, we developed a tool to simulate repertoires by integrating the full spectrum features of an antibody repertoire such as germline gene usage, junctional modification, position-specific SHM and clonal expansion based on 2152 high-quality datasets. We then systematically evaluated these NADTs using both simulated and genuine Ig-seq datasets. Finally, we applied these NADTs to 687 Ig-seq datasets and identified 43 novel allele candidates (NACs) using defined criteria. Twenty-five alleles were validated through findings of other sources. In addition to the NACs detected, our simulation tool, the results of our comparison, and the streamline of this process may benefit further humoral immunity studies via Ig-seq.
Collapse
Affiliation(s)
- Xiujia Yang
- Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China.,Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China.,State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou, China.,Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
| | - Yan Zhu
- Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
| | - Sen Chen
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou, China.,Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
| | - Huikun Zeng
- Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China.,Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
| | - Junjie Guan
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou, China.,Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
| | - Qilong Wang
- Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China.,Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
| | - Chunhong Lan
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
| | - Deqiang Sun
- Department of Center Laboratory, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Xueqing Yu
- Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China.,Division of Nephrology, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
| | - Zhenhai Zhang
- Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China.,Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China.,State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou, China.,Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China.,Key Laboratory of Mental Health of the Ministry of Education, Guangdong-Hong Kong-Macao Greater Bay Area Center for Brain Science and Brain-Inspired Intelligence, Southern Medical University, Guangzhou, China
| |
Collapse
|
17
|
Mikocziova I, Peres A, Gidoni M, Greiff V, Yaari G, Sollid LM. Germline polymorphisms and alternative splicing of human immunoglobulin light chain genes. iScience 2021; 24:103192. [PMID: 34693229 PMCID: PMC8517844 DOI: 10.1016/j.isci.2021.103192] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 07/17/2021] [Accepted: 09/27/2021] [Indexed: 10/25/2022] Open
Abstract
Inference of germline polymorphisms in immunoglobulin genes from B cell receptor repertoires is complicated by somatic hypermutations, sequencing/PCR errors, and by varying length of reference alleles. The light chain inference is particularly challenging owing to large gene duplications and absence of D genes. We analyzed the light chain cDNA sequences from naïve B cell receptor repertoires from 100 individuals. We optimized light chain allele inference by tweaking parameters of the TIgGER functions, extending the germline reference sequences, and establishing mismatch frequency patterns at polymorphic positions to filter out false-positive candidates. We identified 48 previously unreported variants of light chain variable genes. We selected 14 variants for validation and successfully validated 11 by Sanger sequencing. Clustering of light chain 5'UTR, L-PART1, and L-PART2 revealed partial intron retention in 11 kappa and 9 lambda V alleles. Our results provide insight into germline variation in human light chain immunoglobulin loci.
Collapse
Affiliation(s)
- Ivana Mikocziova
- K.G. Jebsen Centre for Coeliac Disease Research, Institute of Clinical Medicine, University of Oslo, 0372 Oslo, Norway
- Department of Immunology, Oslo University Hospital, 0372 Oslo, Norway
| | - Ayelet Peres
- Faculty of Engineering, Bar Ilan University, Ramat Gan 5290002, Israel
- Bar Ilan Institute of Nanotechnologies and Advanced Materials, Bar Ilan University, Ramat Gan 5290002, Israel
| | - Moriah Gidoni
- Faculty of Engineering, Bar Ilan University, Ramat Gan 5290002, Israel
| | - Victor Greiff
- Department of Immunology, Oslo University Hospital, 0372 Oslo, Norway
| | - Gur Yaari
- Faculty of Engineering, Bar Ilan University, Ramat Gan 5290002, Israel
- Bar Ilan Institute of Nanotechnologies and Advanced Materials, Bar Ilan University, Ramat Gan 5290002, Israel
| | - Ludvig M. Sollid
- K.G. Jebsen Centre for Coeliac Disease Research, Institute of Clinical Medicine, University of Oslo, 0372 Oslo, Norway
- Department of Immunology, Oslo University Hospital, 0372 Oslo, Norway
| |
Collapse
|
18
|
Zhu Y, Yang X, Ma C, Tang H, Wang Q, Guan J, Xie W, Chen S, Chen Y, Wang M, Lan C, Sun D, Wei L, Sun C, Yu X, Zhang Z. Antibody upstream sequence diversity and its biological implications revealed by repertoire sequencing. J Genet Genomics 2021; 48:936-945. [PMID: 34420911 DOI: 10.1016/j.jgg.2021.06.016] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 06/10/2021] [Accepted: 06/16/2021] [Indexed: 12/26/2022]
Abstract
The sequence upstream of the antibody variable region (antibody upstream sequence [AUS]) consists of a 5' untranslated region (5' UTR) and a preceding leader region. The sequence variations in AUS affect antibody engineering and PCR based antibody quantification and may also be implicated in mRNA transcription and translation. However, the diversity of AUSs remains elusive. Using 5' rapid amplification of cDNA ends and high-throughput antibody repertoire sequencing technique, we acquired full-length AUSs for human, rhesus macaque, cynomolgus macaque, mouse, and rat. We designed a bioinformatics pipeline and identified 3307 unique AUSs, corresponding to 3026 and 1457 unique sequences for 5' UTR and leader region, respectively. Comparative analysis indicated that 928 (63.69%) leader sequences are novel relative to those recorded in the international ImMunoGeneTics information system. Evolutionarily, leader sequences are more conserved than 5' UTR and seem to coevolve with their downstream V genes. Besides, single-nucleotide polymorphisms are position dependent for leader regions and may contribute to the functional reversal of the downstream V genes. Finally, the AUGs in AUSs were found to have little impact on gene expression. Taken together, our findings can facilitate primer design for capturing antibodies efficiently and provide a valuable resource for antibody engineering and molecule-level antibody studies.
Collapse
Affiliation(s)
- Yan Zhu
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China; Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China; Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China; Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China; Key Laboratory of Mental Health of the Ministry of Education, Guangdong-Hong Kong-Macao Greater Bay Area Center for Brain Science and Brain-Inspired Intelligence, Southern Medical University, Guangzhou 510515, China
| | - Xiujia Yang
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China; Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China; Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China; Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China; Key Laboratory of Mental Health of the Ministry of Education, Guangdong-Hong Kong-Macao Greater Bay Area Center for Brain Science and Brain-Inspired Intelligence, Southern Medical University, Guangzhou 510515, China
| | - Cuiyu Ma
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China; Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China
| | - Haipei Tang
- Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China
| | - Qilong Wang
- Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China
| | - Junjie Guan
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China; Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China
| | - Wenxi Xie
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China; Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China
| | - Sen Chen
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China; Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China
| | - Yuan Chen
- Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China
| | - Minhui Wang
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China; Department of Nephrology, Hainan Affiliated Hospital of Hainan Medical College, Haikou 570311, China; Department of Nephrology, Hainan General Hospital, Haikou 570311, China
| | - Chunhong Lan
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China; Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China
| | - Deqiang Sun
- Department of Center Laboratory, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou 510700, China
| | - Lai Wei
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou 510060, China
| | - Caijun Sun
- School of Public Health, Sun Yat-sen University, Shenzhen 510006, China
| | - Xueqing Yu
- Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China; Division of Nephrology, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China.
| | - Zhenhai Zhang
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China; Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China; Center for Precision Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China; Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China; Key Laboratory of Mental Health of the Ministry of Education, Guangdong-Hong Kong-Macao Greater Bay Area Center for Brain Science and Brain-Inspired Intelligence, Southern Medical University, Guangzhou 510515, China.
| |
Collapse
|
19
|
Huang Y, Thörnqvist L, Ohlin M. Computational Inference, Validation, and Analysis of 5'UTR-Leader Sequences of Alleles of Immunoglobulin Heavy Chain Variable Genes. Front Immunol 2021; 12:730105. [PMID: 34671351 PMCID: PMC8521166 DOI: 10.3389/fimmu.2021.730105] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 09/06/2021] [Indexed: 12/05/2022] Open
Abstract
Upstream and downstream sequences of immunoglobulin genes may affect the expression of such genes. However, these sequences are rarely studied or characterized in most studies of immunoglobulin repertoires. Inference from large, rearranged immunoglobulin transcriptome data sets offers an opportunity to define the upstream regions (5'-untranslated regions and leader sequences). We have now established a new data pre-processing procedure to eliminate artifacts caused by a 5'-RACE library generation process, reanalyzed a previously studied data set defining human immunoglobulin heavy chain genes, and identified novel upstream regions, as well as previously identified upstream regions that may have been identified in error. Upstream sequences were also identified for a set of previously uncharacterized germline gene alleles. Several novel upstream region variants were validated, for instance by their segregation to a single haplotype in heterozygotic subjects. SNPs representing several sequence variants were identified from population data. Finally, based on the outcomes of the analysis, we define a set of testable hypotheses with respect to the placement of particular alleles in complex IGHV locus haplotypes, and discuss the evolutionary relatedness of particular heavy chain variable genes based on sequences of their upstream regions.
Collapse
Affiliation(s)
| | | | - Mats Ohlin
- Department of Immunotechnology, Lund University, Lund, Sweden
| |
Collapse
|
20
|
Yan SM, Sherman RM, Taylor DJ, Nair DR, Bortvin AN, Schatz MC, McCoy RC. Local adaptation and archaic introgression shape global diversity at human structural variant loci. eLife 2021; 10:e67615. [PMID: 34528508 PMCID: PMC8492059 DOI: 10.7554/elife.67615] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 09/14/2021] [Indexed: 12/13/2022] Open
Abstract
Large genomic insertions and deletions are a potent source of functional variation, but are challenging to resolve with short-read sequencing, limiting knowledge of the role of such structural variants (SVs) in human evolution. Here, we used a graph-based method to genotype long-read-discovered SVs in short-read data from diverse human genomes. We then applied an admixture-aware method to identify 220 SVs exhibiting extreme patterns of frequency differentiation - a signature of local adaptation. The top two variants traced to the immunoglobulin heavy chain locus, tagging a haplotype that swept to near fixation in certain southeast Asian populations, but is rare in other global populations. Further investigation revealed evidence that the haplotype traces to gene flow from Neanderthals, corroborating the role of immune-related genes as prominent targets of adaptive introgression. Our study demonstrates how recent technical advances can help resolve signatures of key evolutionary events that remained obscured within technically challenging regions of the genome.
Collapse
Affiliation(s)
- Stephanie M Yan
- Department of Biology, Johns Hopkins University, BaltimoreBaltimoreUnited States
| | - Rachel M Sherman
- Department of Computer Science, Johns Hopkins UniversityBaltimoreUnited States
| | - Dylan J Taylor
- Department of Biology, Johns Hopkins University, BaltimoreBaltimoreUnited States
| | - Divya R Nair
- Department of Biology, Johns Hopkins University, BaltimoreBaltimoreUnited States
| | - Andrew N Bortvin
- Department of Biology, Johns Hopkins University, BaltimoreBaltimoreUnited States
| | - Michael C Schatz
- Department of Biology, Johns Hopkins University, BaltimoreBaltimoreUnited States
- Department of Computer Science, Johns Hopkins UniversityBaltimoreUnited States
| | - Rajiv C McCoy
- Department of Biology, Johns Hopkins University, BaltimoreBaltimoreUnited States
| |
Collapse
|
21
|
Lee JH, Toy L, Kos JT, Safonova Y, Schief WR, Havenar-Daughton C, Watson CT, Crotty S. Vaccine genetics of IGHV1-2 VRC01-class broadly neutralizing antibody precursor naïve human B cells. NPJ Vaccines 2021; 6:113. [PMID: 34489473 PMCID: PMC8421370 DOI: 10.1038/s41541-021-00376-7] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Accepted: 07/29/2021] [Indexed: 02/07/2023] Open
Abstract
A successful HIV vaccine eliciting broadly neutralizing antibodies (bnAbs) must overcome the hurdle of being able to activate naive precursor B cells encoding features within their germline B cell receptors (BCR) that allow recognition of broadly neutralizing epitopes. Knowledge of whether bnAb precursor B cells are circulating at sufficient frequencies within individuals in communities heavily impacted by HIV may be important. Using a germline-targeting eOD-GT8 immunogen and high-throughput droplet-based single-cell BCR sequencing, we demonstrate that large numbers of paired BCR sequences from multiple donors can be efficiently screened to elucidate precursor frequencies of rare, naive VRC01-class B cells. Further, we analyzed IGHV1-2 allelic usage among three different cohorts; we find that IGHV1-2 alleles traditionally thought to be incompatible with VRC01-class responses are relatively common in various human populations and that germline variation within IGHV1-2 associates with gene usage frequencies in the naive BCR repertoire.
Collapse
Affiliation(s)
- Jeong Hyun Lee
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology (LJI), La Jolla, CA, USA
- Consortium for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA, USA
| | - Laura Toy
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology (LJI), La Jolla, CA, USA
- Consortium for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA, USA
| | - Justin T Kos
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA
| | - Yana Safonova
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA
- Computer Science and Engineering Department, University of California San Diego, San Diego, CA, USA
| | - William R Schief
- Consortium for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA, USA
- International AIDS Vaccine Initiative Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA, USA
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
- Ragon Institute of Massachusetts General Hospital, Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
| | - Colin Havenar-Daughton
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology (LJI), La Jolla, CA, USA
- Consortium for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA, USA
| | - Corey T Watson
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA.
| | - Shane Crotty
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology (LJI), La Jolla, CA, USA.
- Consortium for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA, USA.
- Department of Medicine, Division of Infectious Diseases and Global Public Health, University of California, San Diego (UCSD), La Jolla, CA, USA.
| |
Collapse
|
22
|
Mikocziova I, Greiff V, Sollid LM. Immunoglobulin germline gene variation and its impact on human disease. Genes Immun 2021; 22:205-217. [PMID: 34175903 PMCID: PMC8234759 DOI: 10.1038/s41435-021-00145-5] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 06/01/2021] [Accepted: 06/10/2021] [Indexed: 02/06/2023]
Abstract
Immunoglobulins (Ig) play an important role in the immune system both when expressed as antigen receptors on the cell surface of B cells and as antibodies secreted into extracellular fluids. The advent of high-throughput sequencing methods has enabled the investigation of human Ig repertoires at unprecedented depth. This has led to the discovery of many previously unreported germline Ig alleles. Moreover, it is becoming clear that convergent and stereotypic antibody responses are common where different individuals recognise defined antigenic epitopes with the use of the same Ig V genes. Thus, germline V gene variation is increasingly being linked to the differential capacity of generating an effective immune response, which might lead to varying disease susceptibility. Here, we review recent evidence of how germline variation in Ig genes impacts the Ig repertoire and its subsequent effects on the adaptive immune response in vaccination, infection, and autoimmunity.
Collapse
Affiliation(s)
- Ivana Mikocziova
- Department of Immunology, University of Oslo, Oslo, Norway
- K. G. Jebsen Centre for Coeliac Disease Research, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Victor Greiff
- Department of Immunology, University of Oslo, Oslo, Norway
| | - Ludvig M Sollid
- Department of Immunology, University of Oslo, Oslo, Norway.
- K. G. Jebsen Centre for Coeliac Disease Research, University of Oslo and Oslo University Hospital, Oslo, Norway.
| |
Collapse
|
23
|
Ohlin M. Poorly Expressed Alleles of Several Human Immunoglobulin Heavy Chain Variable Genes are Common in the Human Population. Front Immunol 2021; 11:603980. [PMID: 33717051 PMCID: PMC7943739 DOI: 10.3389/fimmu.2020.603980] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 12/08/2020] [Indexed: 12/23/2022] Open
Abstract
Extensive diversity has been identified in the human heavy chain immunoglobulin locus, including allelic variation, gene duplication, and insertion/deletion events. Several genes have been suggested to be deleted in many haplotypes. Such findings have commonly been based on inference of the germline repertoire from data sets covering antibody heavy chain encoding transcripts. The inference process operates under conditions that may limit identification of genes transcribed at low levels. The presence of rare transcripts that would indicate the existence of poorly expressed alleles in haplotypes that otherwise appear to have deleted these genes has been assessed in the present study. Alleles IGHV1-2*05, IGHV1-3*02, IGHV4-4*01, and IGHV7-4-1*01 were all identified as being expressed from multiple haplotypes, but only at low levels, haplotypes that by inference often appeared not to express these genes at all. These genes are thus not as commonly deleted as previously thought. An assessment of the 5' untranslated region (up to and including the TATA-box), the signal peptide-encoding part of the gene, and the 3'-heptamer suggests that the alleles have no or minimal sequence difference in these regions in comparison to highly expressed alleles. This suggest that they may be able to participate in immunoglobulin gene rearrangement, transcription and translation. However, all four poorly expressed alleles harbor unusual sequence variants within their coding region that may compromise the functionality of the encoded products, thereby limiting their incorporation into the immunoglobulin repertoire. Transcripts based on IGHV7-4-1*01 that had undergone somatic hypermutation and class switch had mutated the codon that encoded the unusual residue in framework region 3 (cysteine 92; located far from the antigen binding site). This finding further supports the poor compatibility of this unusual residue in a fully functional protein product. Indications of a linkage disequilibrium were identified as IGHV1-2*05 and IGHV4-4*01 co-localized to the same haplotypes. Furthermore, transcripts of two of the poorly expressed alleles (IGHV1-3*02 and IGHV4-4*01) mostly do not encode in-frame, functional products, suggesting that these alleles might be essentially non-functional. It is proposed that the functionality status of immunoglobulin genes should also include assessment of their ability to encode functional protein products.
Collapse
Affiliation(s)
- Mats Ohlin
- Department of Immunotechnology, Lund University, Lund, Sweden
| |
Collapse
|
24
|
Collins AM, Yaari G, Shepherd AJ, Lees W, Watson CT. Germline immunoglobulin genes: Disease susceptibility genes hidden in plain sight? CURRENT OPINION IN SYSTEMS BIOLOGY 2020; 24:100-108. [PMID: 37008538 PMCID: PMC10062056 DOI: 10.1016/j.coisb.2020.10.011] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Immunoglobulin genes are rarely considered as disease susceptibility genes despite their obvious and central contributions to immune function. This appears to be a consequence of historical views on antibody repertoire formation that no longer stand, and of difficulties that until recently surrounded the documentation of the suite of antibody genes in any individual. If these important genes are to be accessible to GWAS studies, allelic variation within the human population needs to be better documented, and a curated set of genomic variations associated with antibody genes needs to be formulated. Repertoire studies arising from the COVID-19 pandemic provide an opportunity to meet these needs, and may provide insights into the profound variability that is seen in outcomes to this infection.
Collapse
|