Literature DB >> 28539189

The Individual and Population Genetics of Antibody Immunity.

Corey T Watson1, Jacob Glanville2, Wayne A Marasco3.   

Abstract

Antibodies (Abs) produced by immunoglobulin (IG) genes are the most diverse proteins expressed in humans. While part of this diversity is generated by recombination during B-cell development and mutations during affinity maturation, the germ-line IG loci are also diverse across human populations and ethnicities. Recently, proof-of-concept studies have demonstrated genotype-phenotype correlations between specific IG germ-line variants and the quality of Ab responses during vaccination and disease. However, the functional consequences of IG genetic variation in Ab function and immunological outcomes remain underexplored. In this opinion article, we outline interconnections between IG genomic diversity and Ab-expressed repertoires and structure. We further propose a strategy for integrating IG genotyping with functional Ab profiling data as a means to better predict and optimize humoral responses in genetically diverse human populations, with immediate implications for personalized medicine.
Copyright © 2017 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28539189      PMCID: PMC5656258          DOI: 10.1016/j.it.2017.04.003

Source DB:  PubMed          Journal:  Trends Immunol        ISSN: 1471-4906            Impact factor:   16.687


The Molecular Basis for Antibody Diversity

Antibodies (Abs) have long been appreciated as key constituents of the adaptive immune response. Their function is to enable selective recognition and mediate immune responses to novel foreign antigens. This is accomplished through the somatic generation of vast repertoires of hundreds of millions of unique Ab receptors that can be selected, matured, and ultimately participate in the formation of long-term memory during B-cell development and activation. As a consequence of this diversity, even after nearly a century of research, the complexity of the Ab response within and between individuals is only beginning to be delineated at the molecular and genetic levels. Hundreds of variable (V) and dozens of diversity (D) and joining (J) immunoglobulin (IG) germ-line gene segments across three primary loci in the human genome comprise the necessary building blocks of the expressed Ab heavy- and light-chain repertoires [1]. Whereas the heavy chain is encoded by genes at the IG heavy-chain locus (IGH), the light chain can be encoded by genes at either the IG kappa (IGK) or IG lambda (IGL) chain loci [1]. The naïve Ab repertoire is formed by assembling variants of these building blocks using a specialized V(D)J recombination process that somatically joins various V, D, and J segments (or V and J at IGK and IGL). The introduction and deletion of P and N nucleotides at V(D)J junctions and the pairing of different heavy and light chains dramatically increase diversity (Figure 1 ) [2]. Considering these processes alone, a given baseline or primary naïve repertoire can theoretically sample from 1015 different Abs [3]. The extraordinary diversity of the naïve repertoire ensures that it will likely contain a naïve Ab with at least weak initial binding against a vast array of antigens.
Figure 1

A Basic Overview of Key Elements That Contribute to the Diversity of Naïve and Memory Repertoires. A basic schematic of the germ-line IGH locus is shown (not to scale), consisting of clusters of tandemly arrayed IGH V, D, J, and constant (C) gene segments. For a subset of these segments, multiple alleles are shown, representing population-level ‘allelic diversity’ (see Table 1). During the initial formation of the naïve repertoire, single IGH V, D and J gene segment on one of two chromosomes in a given B cell are somatically recombined; at each of these steps, P and N nucleotides are added at the D–J and V–D junctions (​‘junctional diversity’), respectively. This process, known as V(D)J rearrangement, is the basis for ‘combinatorial diversity’. The recombined V (red), D (orange), and J (maroon) segments will then be transcribed, and following splicing, will be paired with a C gene (gray). The somatic recombination process also occurs at one of two loci encoding the Ab light-chain gene segments [IGK and IGL; except it involves only V (yellow), J (maroon), and C (light gray) gene segments]. Two identical heavy chains and two identical light chains are ultimately paired through disulfide bonds to form a functional Ab; thus, additional diversity in the expressed Ab repertoire comes from ‘heavy- and light-chain pairing’. Together, the V, D, and J segments depicted comprise the variable domain of the heavy chain of a functional antibody, and together with the variable domain of the light chain, encoded by V and J segments, are responsible for Ag binding. The C domains of both heavy and light chains provide structural and/or effector functions of the Ab. As shown here for the heavy chain, the variable domain is partitioned into four framework regions (FRs) and three complementarity-determining regions (CDRs). Following Ag stimulation, ‘somatic hypermutations’ introduce additional variation in the variable domain of the Ab (vertical purple bars), with the aim of improving binding affinity. Mutations that arise via SHM can occur across all FRs and CDRs, but these are most prevalent in CDRs, as illustrated by the hypothetical frequency histogram shown between the unmutated and mutated IG heavy-chain RNA. While the general molecular mechanisms outlined here have long been realized as the primary determinants of diversity within a given expressed Ab repertoire, there is a growing appreciation for the contribution of ‘allelic diversity’ as well, particularly as this pertains to repertoire differences observed between unrelated individuals. Ab, antibody; C, constant; D, diversity; IGH, immunoglobulin heavy-chain locus; IGK, immunoglobulin kappa; IGL, immunoglobulin lambda; J, joining; SHM, somatic hypermutation; V, variable.

A Basic Overview of Key Elements That Contribute to the Diversity of Naïve and Memory Repertoires. A basic schematic of the germ-line IGH locus is shown (not to scale), consisting of clusters of tandemly arrayed IGH V, D, J, and constant (C) gene segments. For a subset of these segments, multiple alleles are shown, representing population-level ‘allelic diversity’ (see Table 1). During the initial formation of the naïve repertoire, single IGH V, D and J gene segment on one of two chromosomes in a given B cell are somatically recombined; at each of these steps, P and N nucleotides are added at the D–J and V–D junctions (​‘junctional diversity’), respectively. This process, known as V(D)J rearrangement, is the basis for ‘combinatorial diversity’. The recombined V (red), D (orange), and J (maroon) segments will then be transcribed, and following splicing, will be paired with a C gene (gray). The somatic recombination process also occurs at one of two loci encoding the Ab light-chain gene segments [IGK and IGL; except it involves only V (yellow), J (maroon), and C (light gray) gene segments]. Two identical heavy chains and two identical light chains are ultimately paired through disulfide bonds to form a functional Ab; thus, additional diversity in the expressed Ab repertoire comes from ‘heavy- and light-chain pairing’. Together, the V, D, and J segments depicted comprise the variable domain of the heavy chain of a functional antibody, and together with the variable domain of the light chain, encoded by V and J segments, are responsible for Ag binding. The C domains of both heavy and light chains provide structural and/or effector functions of the Ab. As shown here for the heavy chain, the variable domain is partitioned into four framework regions (FRs) and three complementarity-determining regions (CDRs). Following Ag stimulation, ‘somatic hypermutations’ introduce additional variation in the variable domain of the Ab (vertical purple bars), with the aim of improving binding affinity. Mutations that arise via SHM can occur across all FRs and CDRs, but these are most prevalent in CDRs, as illustrated by the hypothetical frequency histogram shown between the unmutated and mutated IG heavy-chain RNA. While the general molecular mechanisms outlined here have long been realized as the primary determinants of diversity within a given expressed Ab repertoire, there is a growing appreciation for the contribution of ‘allelic diversity’ as well, particularly as this pertains to repertoire differences observed between unrelated individuals. Ab, antibody; C, constant; D, diversity; IGH, immunoglobulin heavy-chain locus; IGK, immunoglobulin kappa; IGL, immunoglobulin lambda; J, joining; SHM, somatic hypermutation; V, variable.
Table 1

Allelic, Copy Number, and Amino Acid Variation for IG Functional and Open Reading Frame Genes Cataloged in IMGTa

FamilyGenesAllelesNS variantsS variantsCDR-H1 NS variantsCDR-H2 NS variantsGenes in CNV
IGHV111401913236
IGHV2423269311
IGHV327109825791712
IGHV4107892711188
IGHV52944001
IGHV61201000
IGHV72640001
Subtotal58267227155252929
IGKV120353317411
IGKV21118144110
IGKV3818249210
IGKV411NANANANA0
IGKV511NANANANA0
IGKV63520000
IGKV700NANANANA0
Subtotal44787330731
IGLV171242021
IGLV2620138230
IGLV31118145330
IGLV43621000
IGLV551032001
IGLV61220000
IGLV72310000
IGLV81311001
IGLV91302000
IGLV101341100
IGLV111211000
Subtotal39824523683
Total141427345208384033

Data accessed from IMGT February 2017. NS, nonsynonymous; S, synonymous.

Even so, this impressive baseline diversity can be subsequently augmented when a B cell encounters and is stimulated by an antigen to undergo somatic hypermutation (SHM; Figure 1), resulting in lineages of tens of thousands of clonally derived affinity maturation variants of the initial Ab. Specifically, SHM introduces somatic mutations throughout the variable portion of the Ab, including targeted hotspots residing within the antigen-contacting hypervariable complementarity-determining regions (CDRs). This process ultimately increases the affinity and specificity of the Ab for binding the target epitope, facilitating a highly focused antigen-specific response. While the prevailing paradigm for investigating B-cell and Ab-mediated responses has placed emphasis on the importance of the unique molecular mechanisms cited earlier in the generation of key functional Abs, there is a growing appreciation for the fact that IG genes are highly variable at the germ-line level, exhibiting extreme allelic polymorphism and gene copy number variation (CNV) between individuals and across populations 4, 5, 6, 7, 8, 9. Recent studies have begun to highlight that, in addition to diversity introduced during V(D)J recombination, heavy- and light-chain pairing, and SHM, IG germ-line variation (e.g., allelic variation; Figure 1) plays a vital part in determining the development of the naïve repertoire, with downstream impacts on signatures observed in the memory compartment, and the capacity of an individual to mount an Ab response to specific epitopes 10, 11, 12, 13, 14, 15, 16.

IG Loci Haplotype Diversity in the Human Population

Recent genomic sequencing indicates that IG loci, specifically IGH, may be among the most polymorphic in the human genome [17]. Across IGH, IGK, and IGL, there are currently >420 alleles cataloged in the ImMunoGeneTics information system database (IMGT) 18, 19, 20, 21 that have been described from germ-line DNA in the human population, with an enrichment of nonsynonymous variants (Table 1 ). Although the validity of some alleles in IMGT has been called into question [22], the number of polymorphic alleles continues to grow 11, 23, 24, especially as IG gene sequencing is conducted in increasing numbers of non-Caucasian samples 7, 9, 25. A recent study conducted in 28 indigenous South Africans identified 122 non-IMGT IGHV alleles [9]. In addition to IG allelic variation and single nucleotide polymorphisms (SNPs), CNVs, including large deletions, insertions, and duplications (∼8–75 Kb in length), are also prevalent in IG regions (Table 1). Using IGH as an example, up to 29 of the 58 functional/open reading frame (ORF) IGHV genes may vary in genomic copy number 4, 6, 7, 11, 26, 27, 28; CNVs of IGH D (diversity) and constant (C) region genes are also known 11, 12, 29. Until recently, primarily due to technical difficulties associated with the complex genomic architecture of the IG loci, none of the known CNVs in IGHV had been sequenced at nucleotide resolution [7]; many likely remain undescribed at the genomic level. Allelic, Copy Number, and Amino Acid Variation for IG Functional and Open Reading Frame Genes Cataloged in IMGTa Data accessed from IMGT February 2017. NS, nonsynonymous; S, synonymous. The high prevalence of IG allelic and locus structural diversity translates into extreme levels of inter-individual haplotype variation 4, 5, 6, 7. For example, recent comparisons of the two available completed assemblies for the IGHV gene region (∼1 Mb in length) revealed that two human chromosomes can vary by >100 Kb of sequence, with >2,800 SNPs, and CNVs of 10 IGHV functional/ORF genes 7, 17. In population sequencing experiments, extreme examples of heterozygosity have been noted, with evidence of some individuals carrying more than one allele at every IGHV coding gene [9]. Supporting earlier genetic mapping data 4, 5, more recent analysis of inferred haplotypes from Ab repertoire data surveyed in nine individuals revealed that all 18 haplotypes characterized were unique [6]. Furthermore, at the population level, of the few SNPs and CNVs screened within IGH, allele and genotype frequencies have been shown to vary considerably between ethnic backgrounds 7, 8, 9, 15, with evidence of selection [7]. Despite the evidence for elevated germ-line diversity, genomic resources for IG loci continue to lag behind other regions of the genome [26]. Because of this, the comprehensive and accurate genotyping of IG polymorphisms remains a significant challenge 26, 30, and as a result, the full extent of IG polymorphism and the implications for human health are yet to be uncovered [26]. However, it is plausible that population-level diversity in the IG loci, particularly in IGH, will rival that of other complex immune gene families, such as the human leukocyte antigen (HLA) and killer cell IG-like receptor (KIR) genes. These genes are also characterized by extreme haplotype diversity, due to CNV and coding region variation 31, 32; HLA genes, for example, have thousands of known alleles [31]. In contrast to IG genes, HLA and KIR have been studied more extensively across human populations, and have demonstrated critical roles in disease 31, 32.

Influence of IG Germ-Line Diversity in the Expressed Ab Repertoire and Ab Function

Our limited knowledge of IG population diversity has hindered our ability to comprehensively test for direct connections between IG germ-line polymorphisms, variation in the repertoire generated after recombination, amino acid variation in the Ab produced, and ultimately Ab function. Advances in high-throughput sequencing technology now enable extensive characterization of the expressed Ab repertoire 33, 34, 35, creating opportunities for beginning to investigate the heritability of the Ab response at fine-scale resolution. Applications of these methods, collectively referred to as repertoire sequencing (‘IgSeq’ or ‘RepSeq’), have already led to a wealth of new discoveries in a range of contexts 33, 36. These include general observations that key features of the Ab repertoire show extensive variability between healthy individuals 10, 11, 13, 14, 37, and a limited overlap of B-cell receptor clones between individuals, even monozygotic (MZ) twins 10, 13, 14. However, RepSeq studies have also revealed that these inter-individual differences are not necessarily random, but likely have a strong underlying genetic component, providing initial support for the importance of germ-line IG polymorphism in determining the naïve and Ag-stimulated Ab repertoire. For example, several recent studies have revealed that V, D, and J gene usage in the naïve repertoire is much more highly correlated between MZ twins than between unrelated individuals 10, 13, 14, and that IG gene usage patterns are consistent across time points within a given individual [38]. A role for genetic factors can be seen for other repertoire features in twins as well, including the degree of SHM [13], and the distribution of CDR-H3 length and clone convergence 10, 13, 14. Intriguingly, although existing data suggest that features in the memory compartment are more stochastic, likely reflective of random recruitment and transient proliferation, certain genes and repertoire features exhibit predictable patterns even in memory B cells 10, 13, 14, 39. Studies of repertoire heritability are consistent with a number of examples for which germ-line IG polymorphisms have been explicitly linked to features in the expressed Ab repertoire 12, 15, 40, 41, 42 (see Figure IA in Box 1 for hypothetical examples of IG genotype effects on the repertoire). Sasso et al. [40] reported the first direct connection to IG genotype, reporting that CNV of IGHV1-69 was tightly correlated with its relative usage in tonsillar B cells. Our own work has also demonstrated this relationship, but uncovered associations for IGHV1-69 coding and potentially noncoding polymorphism as well as CNV [15]. Inferred deletions of IGHD genes have also been shown to associate with variation in D–J pairing frequencies, demonstrating that germ-line effects on the repertoire extend beyond V genes [12]. An interesting aspect of IGH CNVs is that, in addition to observed effects of these variants on the genes within the CNV event, they also can impact the usage of genes elsewhere in the locus 12, 15. For example, we recently observed apparent long-range effects of IGHV1-69 CNV in the naïve and memory repertoire, in that individuals with fewer IGHV1-69 germ-line copies and reduced usage showed consistently higher usage of IGHV genes over 200 Kb away [15]. The mechanisms underlying the observed effects of CNVs in human IG loci remain technically difficult to assess experimentally, but it has been speculated that these large changes in locus architecture (i.e., deletions and insertions) could alter regulatory systems related to V(D)J recombination 12, 15, for example, by modifying the chromatin landscape, cis-regulatory elements and transcription factor binding, and/or the physical locations of the IG V, D, and J genes. All of these factors are known to be key determinants of IG gene accessibility and usage frequencies in mice 43, 44.
Figure I

Impacts of IG Germ-Line Polymorphism on Ab Repertoire/Structural Diversity. (A) Hypothetical examples of associations between IG gene region CNV (V gene 1 insertion/deletion) and SNP (noncoding regulatory variant, A/C) genotypes and V gene usage frequencies in the expressed Ab repertoire. (B) Violin plot showing nonsynonymous polymorphism rates in CDR positions with high (>0.6; ‘high’, blue) or low (<0.25; ‘low’, red) frequency of contact with antigen, as labeled on the X axis. The Y axis records, for each CDR-H1 and CDR-H2 position, the number of IMGT IGHV genes that have alleles with nonsynonymous polymorphisms at that position. The positional probability of antigen contact was calculated for each CDR position as the percentage of 150 crystal structures of antibody–antigen complexes from the protein database (PDB) where any atom of that residue is within 5 Å of any antigen atom. Allelic variation is enriched in antigen-contact sites, in that the number of IGHV genes with alleles containing nonsynonymous polymorphisms is greater for high contact probability positions. (C) Genotype frequency differences between five human ethnic groups [Africans (AFR); East Asians (EAS); South Asians (SAS); Central/South American (AMR); and Europeans (EUR)], published by the 1000 Genomes Project [80]*, at two SNPs in IGHV1-69 that have been shown to encode functional residues critical for neutralizing Abs against the influenza HA stem (F54 and L54 amino acid-associated alleles; SNP rs55891010; left panel), and ‘NEAT2’ domain of Staphylococcus aureus (R50 and G50 alleles; SNP rs11845244; right panel). In the left panel, the F allele encodes the functional critical phenylalanine residue, and in the right panel, the primary glycine residue is encoded by the G allele. Interestingly, in both cases, the frequency of individuals lacking alleles encoding the critical residues varies among populations, with the L/L and R/R genotypes showing the lowest frequencies in Africans, and the highest frequencies in South Asians. rs55891010 and rs11845244 are in linkage disequilibrium, and thus R50 and L54 amino acids (and likewise, G50 and F54) tend to co-occur in alleles of IGHV1-69. This explains similarities in genotype frequency estimates between the two SNPs in each population. *Although these genotypes may contain error due to confounds of unrepresented CNV information, they can provide insight into potential population differences. Ab, antibody; CDR, complementarity-determining region; CNV, copy number variation; HA, hemagglutinin; IG, immunoglobulin; IMGT, ImMunoGeneTics information system database; SNP, single nucleotide polymorphism.

Although the roles of IG germ-line variants have not been comprehensively studied, there is now convincing evidence that they can influence Ab repertoire variation and function in two main ways (i and ii). In addition, known functional variants exhibit allele frequency variation between human populations (iii): (i) Gene copy number changes and coding/noncoding SNPs in IGHV genes have been shown to correlate with gene usage patterns in the naïve repertoire, the memory repertoire, patterns of SHM, class-switch frequency, and circulating Ab titers (Figure IA). (ii) There are now many examples that provide evidence for functional effects of germ-line variants encoded in CDR-H1 and CDR-H2, many of which are polymorphic and vary between human populations. Based on known IGHV alleles in the IMGT database, residues within CDR-H1/H2 that have a higher probability of making Ag contact are also more likely to be associated with a polymorphic allele (Figure IB). (iii) Several positions in IGHV genes that encode residues critical for antigen binding are polymorphic and exhibit different genotype frequencies between human populations and ethnicities (Figure IC). Alt-text: Box 1 Impacts of IG Germ-Line Polymorphism on Ab Repertoire/Structural Diversity. (A) Hypothetical examples of associations between IG gene region CNV (V gene 1 insertion/deletion) and SNP (noncoding regulatory variant, A/C) genotypes and V gene usage frequencies in the expressed Ab repertoire. (B) Violin plot showing nonsynonymous polymorphism rates in CDR positions with high (>0.6; ‘high’, blue) or low (<0.25; ‘low’, red) frequency of contact with antigen, as labeled on the X axis. The Y axis records, for each CDR-H1 and CDR-H2 position, the number of IMGT IGHV genes that have alleles with nonsynonymous polymorphisms at that position. The positional probability of antigen contact was calculated for each CDR position as the percentage of 150 crystal structures of antibody–antigen complexes from the protein database (PDB) where any atom of that residue is within 5 Å of any antigen atom. Allelic variation is enriched in antigen-contact sites, in that the number of IGHV genes with alleles containing nonsynonymous polymorphisms is greater for high contact probability positions. (C) Genotype frequency differences between five human ethnic groups [Africans (AFR); East Asians (EAS); South Asians (SAS); Central/South American (AMR); and Europeans (EUR)], published by the 1000 Genomes Project [80]*, at two SNPs in IGHV1-69 that have been shown to encode functional residues critical for neutralizing Abs against the influenza HA stem (F54 and L54 amino acid-associated alleles; SNP rs55891010; left panel), and ‘NEAT2’ domain of Staphylococcus aureus (R50 and G50 alleles; SNP rs11845244; right panel). In the left panel, the F allele encodes the functional critical phenylalanine residue, and in the right panel, the primary glycine residue is encoded by the G allele. Interestingly, in both cases, the frequency of individuals lacking alleles encoding the critical residues varies among populations, with the L/L and R/R genotypes showing the lowest frequencies in Africans, and the highest frequencies in South Asians. rs55891010 and rs11845244 are in linkage disequilibrium, and thus R50 and L54 amino acids (and likewise, G50 and F54) tend to co-occur in alleles of IGHV1-69. This explains similarities in genotype frequency estimates between the two SNPs in each population. *Although these genotypes may contain error due to confounds of unrepresented CNV information, they can provide insight into potential population differences. Ab, antibody; CDR, complementarity-determining region; CNV, copy number variation; HA, hemagglutinin; IG, immunoglobulin; IMGT, ImMunoGeneTics information system database; SNP, single nucleotide polymorphism. A role for noncoding polymorphisms is also strongly supported by early work conducted in the human IGK region which directly showed that a variant associated with Haemophilus influenzae infection susceptibility in the recombination signal sequence (RSS) of IGKV2-29 significantly decreased gene rearrangement frequency [42]. RSSs, which are critical for the recruitment of RAG1/2 proteins, have also been demonstrated to impact IGHV gene usage in mice 43, 44. Moreover, extensive work in the murine IG gene loci has uncovered important roles for other key cis-regulatory sequences and transcription factors as well 45, 46. Such analyses have not yet been comprehensively conducted in humans, and as a result, our knowledge of the IG regulatory elements involved in the formation of the expressed Ab repertoire is restricted to canonical RSS, promoter, enhancer elements, and class switch regions. However, even for these well-known noncoding regulatory regions, limited data on human population-level variation exist, and thus the broader consequences of polymorphism in these elements on Ab repertoire variability have not been explored. Although direct links between repertoire variability and human IG CNVs and noncoding polymorphisms remain limited to the few examples discussed above, additional evidence from expressed Ab repertoire studies in unrelated individuals also highlights the clear potential for these variants to have pervasive impacts on Ab repertoire features, particularly gene usage in the naïve compartment. Most demonstrable is the fact that many of the genes with the most variability in naïve repertoire usage across individuals are also known to be in CNV, including examples of the complete absence of genes in the expressed Ab repertoires of some donors 6, 10, 11, 12. In addition, allele-specific usage in the naïve Ab repertoires of individuals heterozygous at a given IGHV gene has been demonstrated, also clearly suggesting a role for noncoding variation and CNV [11]. Moreover, although effects of germ-line IG polymorphism may be most evident on a per gene basis, it is worth noting that findings from MZ twins demonstrated that certain CDR-H3 features are highly heritable 13, 14. This indicates that even strong genetically determined biases on individual V, D, and J gene usage [and thus their nonrandom combination during V(D)J rearrangement] could also be directly linked to variation observed within CDR-H3. This is an important point given that CDR-H3 variation has classically been considered independent of the germ line 13, 14. In addition to effects of IG polymorphism on gene usage, functional CDR variants can also be directly encoded in the genome. For example, across the ∼267 coding alleles cataloged in IMGT for functional and ORF IGHV genes, 60% of the 382 polymorphisms are nonsynonymous (Table 1), including sites located in CDR-H1 and CDR-H2 with predicted relevance to Ab functional residue diversity (see Figure IB in Box 1). Although the CDR-H3 loop, formed at the V(D)J junction, is the most diverse region of an Ab and is a principal determinant of specificity 47, 48, there is a growing appreciation for the importance of residues outside of CDR-H3 in antigen recognition and binding 15, 49, 50, 51. For example, recent analyses have shown that the median length of CDR-H2, which is solely encoded by germ-line V gene sequence, is substantially longer than that of CDR-H3, and typically forms the same number of interactions with antigen [52]. Specifically, analyses of antigen-binding region (ABRs; which roughly correspond to CDRs, but differ slightly in their boundaries) have shown that Abs contain a median of six, six, and four contact residues in the heavy-chain CDR-H3, H2, and H1 ABR regions, respectively. In addition, the overall percentage of energetically important Ag-binding residues within each ABR follows the same rank order, with ∼31%, 23%, and 14% for H3, H2, and H1, respectively. Similar trends were noted for light-chain ABRs as well [52]. In addition, considering that many known nonsynonymous sites reside outside of CDRs (Table 1), it is worth highlighting the fact that there are also examples demonstrating indirect effects of framework region variants on Ag binding 53, 54.

The Identification of Shared Ab Immune Response Signatures across Individuals

A critical question is whether the germ-line effects on the repertoire outlined above can also partially account for inter-individual variation of the Ab-mediated response in disease and clinical phenotypes. The initial observation from RepSeq studies that essentially no Ab clones were shared among individuals, including MZ twins, posed a challenge to comparative Ab repertoire analysis: how could correlates of protection be identified in the Ab repertoire if every individual was responding with different Abs? However, an answer began to emerge with the observation that in multiple settings, including viral and bacterial infection, different individuals have been shown to respond to a given antigen with Abs that share convergent amino acid signatures 13, 49, 54, 55, 56, 57, 58. These convergent Abs are often encoded by common V genes or sets of V genes, and specific amino acid residues in their CDRs enable them to converge upon a common binding solution against a shared antigen. Critically, in some cases evaluated, convergent signatures include amino acid residues that are directly encoded in the germ line. The occurrence of such convergent Ab responses highlights the potential for tracking common immune responses across individuals, and understanding the role of genetic factors, even when each individual creates unique Abs. Importantly, the implications of this line of thinking could be broad, as IG gene biases have been observed in contexts other than infection, including autoimmunity and cancer 59, 60. Moreover, IG gene biases may also extend to usage patterns of D and J genes, light-chain genes, and heavy- and light-chain V gene pairing frequencies 56, 61, 62.

Structural Residues Critical for Ag Binding and Involved in Biased Gene Usage Are Encoded in the Germ Line and Exhibit Population Variability

There are now many instances for which functional contributions of biased IG genes have been traced back to specific germ-line-encoded residues, including sites that are polymorphic in the human population 15, 16, 50, 53, 54, 55, 63, 64, 65. These examples illuminate a direct role of the IG germ line in disease-associated Ab responses. In the case of stem-directed broadly neutralizing Abs (BnAbs) against influenza hemagglutinin (HA), the most prevalent Abs use the heavy-chain gene IGHV1-69 66, 67, 68, 69, 70. These IGHV1-69 BnAbs recognize an overlapping epitope of group 1 influenza A viruses and only amino acids from IGHV make contact with HA. Importantly, of the 14 known alleles at IGHV1-69, only those encoding a critical phenylalanine at position 54 (F54) within CDR-H2 have a major role in shaping the BnAbs response 16, 15, 55, 71. Although IGHV1-69 F54-encoding alleles are dominant, there is a growing list of additional HA-directed BnAbs that also show IG germ-line biases 51, 56, 72, 73, 74, including those also known to be polymorphic with respect to coding variants and CNVs. Interestingly, there are additional instances of biased IGHV1-69 allele usage in other disease contexts, with both overlapping and contrasting patterns to that observed for influenza. For example, F54 alleles are predominantly observed in IGHV1-69-expressing B cells associated with chronic lymphoid leukemia (CLL), whereas alleles encoding a leucine (L54) at this position are primarily used by non-neutralizing anti-gp41 Abs in HIV-1 63, 64. Moreover, it has been shown that IGHV1-69 F54 alleles, in comparison with L54 alleles, have lower usage in the memory B-cell pool 10, 15. This observation may be similar to trends noted for IGHV4-34, which is also significantly underrepresented in the memory compartment of healthy individuals [10], and presumes to reflect a selective pressure against autoreactive Abs 75, 76. Other polymorphic positions in the framework regions of IGHV1-69, in conjunction with CDR-H2 54, have also recently been shown to influence Ab binding of Middle East respiratory syndrome coronavirus (MERS-CoV) [53] and the Staphylococcus aureus NEAr iron transporter 2 (NEAT2) domain [54]. In the example of NEAT2, neutralizing Abs encoded by IGHV1-69 alleles carrying an arginine (R) at position 50 in place of glycine (G) showed significantly reduced NEAT2 binding [54]. Interestingly, based on publicly available data, the frequencies of critical alleles within polymorphic positions of IGHV1-69 vary across populations (see Figure IC in Box 1).

A Strategy for Defining Relationships between IG Polymorphisms, Expressed Ab Signatures, and Functional Outcomes

Considering the aforementioned evidence, we argue that the antigen-specific Ab repertoire is likely influenced by the host genotype. Although the genetic bases for repertoire and germ-line gene biases have not been comprehensively investigated, several recent studies provide a strategy for systematically integrating data on IG polymorphism and Ab responses at the population and molecular levels to provide unique insight into Ab signatures associated with disease. We have begun to explore this idea in detail at the IGHV1-69 locus in the context of influenza vaccination [15]. Providing strong proof-of-concept, by initially focusing on observed IGHV1-69 allelic usage bias against a critical broadly neutralizing epitope, we genotyped the IGHV1-69 F54/L54 allele and copy number frequencies in a cohort of 85 H5N1 vaccines, including 18 individuals with accompanying Ab repertoire data [15]. Drawing directly on aspects of repertoire heritability reviewed above, we found robust connections between these polymorphisms and repertoire gene usage in both the unmutated IgM (naïve) and IgG memory repertoires, with IGHV1-69 germ-line gene usage increasing with the number of copies of F54 alleles. In addition to usage frequencies, IGHV1-69 genotype also associated with IGHV1-69 B-cell expansion, SHM, and Ig class switching. It is important to note that these genotype effects extended to levels of circulating anti-HA stem BnAbs postvaccination, with individuals carrying only germ-line-encoded CDR-H2 L54 alleles having lower IGHV1-69 BnAbs. Furthermore, with direct repertoire sequencing, we were able to specifically demonstrate that only carriers of the IGHV1-69 F54 alleles expressed convergent anti-BnAb signatures. These results are bolstered by similar observations recently made by two other groups that also carried out IGHV1-69 F54/L54 allele genotyping in their cohorts 16, 55. Altogether, these data demonstrate that genetically determined baseline differences in the Ab repertoire can set the stage for disease-related responses. A crucial aspect of this story (which is expected to emerge in other cases as well) is that the frequency of IGHV1-69 F54 alleles and CNV varies considerably across populations 7, 15. Specifically, the number of individuals that would be predicted to lack the capacity to generate effective IGHV1-69 BnAbs was much higher in some populations. However, we and others have shown that individuals lacking IGHV1-69 F54 alleles likely utilize other germ-line genes in place of IGHV1-69 51, 55. This finding in particular both highlights the complexity of the Ab response and demonstrates that the integration of genotyping information can help provide a more nuanced interpretation of the signatures discovered in the expressed repertoire. Moreover, it suggests that efforts should be made to study these complex responses in larger and more diverse cohorts, including individuals from presently understudied populations. Building on findings in these initial studies 15, 16, 55, we propose a framework for integrating genotypic information into future studies of the Ab response in wellness and disease (Figure 2 , Key Figure). The general strategy is as follows: (i) identify IG gene biases observed in a disease-related or epitope-specific response; (ii) characterize this response at the population level by performing comprehensive genotyping of coding, noncoding, and gene copy number variants at and around the locus of interest (and others if there is rationale); (iii) perform repertoire sequencing and analysis of the response in all relevant B-cell subsets to identify all Ab convergence groups with allele bias; and (iv) evaluate genotype–phenotype linkages of the functional Ab response and specific Ab convergence groups.
Figure 2

Key Figure: A New Paradigm for Integrating Genotypic Information into the Study of the Ab-Mediated Response in Disease and Clinical Phenotypes

In the proposed paradigm, a population cohort is partitioned into subgroups based on functional genotypes/haplotypes that are directly associated with subgroup-specific signatures in the expressed repertoire and other relevant phenotypes (e.g., Ab titer; clinical outcome) associated with the Ab response to a given antigen/epitope. This partitioning can be used to inform tailored clinical care and treatment (e.g., vaccination regime). Ab, antibody.

Key Figure: A New Paradigm for Integrating Genotypic Information into the Study of the Ab-Mediated Response in Disease and Clinical Phenotypes In the proposed paradigm, a population cohort is partitioned into subgroups based on functional genotypes/haplotypes that are directly associated with subgroup-specific signatures in the expressed repertoire and other relevant phenotypes (e.g., Ab titer; clinical outcome) associated with the Ab response to a given antigen/epitope. This partitioning can be used to inform tailored clinical care and treatment (e.g., vaccination regime). Ab, antibody.

Concluding Remarks

We see a growing body of evidence to support the link between IG polymorphism and phenotype that may have important clinical applications (see Outstanding Questions). The most obvious of these correlations include potential effects of CNV and SNPs in non-translated and translated IG gene regions on expressed repertoire variability in naïve and memory B cell subsets. Some of these polymorphisms could be expected to more broadly impact variation in protective Ab responses [77] and quality of the memory B-cell pool. We anticipate that IG polymorphism will contribute to differences in expression of common (public) and unique (private) antibody signatures that are associated with protective responses in disease and in response to vaccination. We propose a model for the future in which cataloging these public signatures for biased gene use, V(D)J associations, SHMs, and heavy-light chain pairing in the context of IG germ-line variation should begin to provide us with information to advance our understanding of the immunogenetic potential of an individual’s baseline naïve repertoire (Figure 2), particularly when more complete data sets of biased Ab signatures to specific epitopes become available. Based on existing genetic data, it is probable that similar IG haplotypes will associate with overlapping signatures in baseline repertoire profiles, even if not to the degree of repertoire similarity observed in MZ twins. This IG polymorphism, as we and others have begun to show, may further influence the evolution of antigen-experienced B cells and plasma cells, where other genetic polymorphisms in the IG loci and environmental exposures come into play in continuing to shape affinity, epitope specificity, and fate. In addition, class-switched memory B-cell compartments will vary over time [37], and could be quantitated in the type and size of clonotypes with both public and private signatures against immunodominant epitopes. Together, this knowledge should pave the way to using molecular and genetic signatures for mapping an individual’s exposure history, current wellness state, and immune potential against future antigenic threats. For example, characterization of genotypes that specifically lead to common BnAb signatures in the repertoire should be useful for tailoring vaccines to responsive genotypes with the goal of achieving 100% ‘universal vaccine’ responsiveness at the population level (Figure 2). In addition, such information could lead to advances in the use of anti-idiotypic antibody and chimeric antigen receptor T-cell therapies that are directed against germ-line gene expressing B-cell clonotypes that are directly involved in autoimmune disease and hematologic malignancies 78, 79. We face tall hurdles to moving this paradigm forward, the greatest being the completion of a comprehensive catalogue of human IG haplotype variation [26]. However, with ever expanding advances in immunologic and genomic technologies, we believe that such integrative approaches are within our reach, and have the potential to transform our understanding of Ab-mediated immune responses in the clinical and research arenas. How large of an effect does IG polymorphism have on the development of the baseline naïve repertoire, and what types of genetic variation (CNV, coding variants, regulatory variants) matter most? Do effects of IG genetic variants on the Ab repertoire correspond to known biases in disease and/or clinically relevant Ab responses? What can population-level data on genetic and expressed Ab repertoire signatures tell us about an individual’s exposure history, current wellness state, and immune potential against future antigenic threats? Can we leverage integrated population-level data sets to inform clinical care, and more effective vaccine and therapeutic strategies?
  79 in total

1.  Diversity in the CDR3 region of V(H) is sufficient for most antibody specificities.

Authors:  J L Xu; M M Davis
Journal:  Immunity       Date:  2000-07       Impact factor: 31.745

2.  Pulsed-field electrophoresis screening for immunoglobulin heavy-chain constant-region (IGHC) multigene deletions and duplications.

Authors:  A Bottaro; U Cariota; M DeMarchi; A O Carbonara
Journal:  Am J Hum Genet       Date:  1991-04       Impact factor: 11.025

3.  Genetic measurement of memory B-cell recall using antibody repertoire sequencing.

Authors:  Christopher Vollmers; Rene V Sit; Joshua A Weinstein; Cornelia L Dekker; Stephen R Quake
Journal:  Proc Natl Acad Sci U S A       Date:  2013-07-29       Impact factor: 11.205

Review 4.  High-Throughput DNA Sequencing Analysis of Antibody Repertoires.

Authors:  Scott D Boyd; Shilpa A Joshi
Journal:  Microbiol Spectr       Date:  2014-10

5.  Comprehensive interrogation of a minimalist synthetic CDR-H3 library and its ability to generate antibodies with therapeutic potential.

Authors:  Ciara M Mahon; Matthew A Lambert; Jacob Glanville; Jason M Wade; Brian J Fennell; Mark R Krebs; Douglas Armellino; Sharon Yang; Xuemei Liu; Cliona M O'Sullivan; Benedicte Autin; Katarzyna Oficjalska; Laird Bloom; Janet Paulsen; Davinder Gill; Marc Damelin; Orla Cunningham; William J J Finlay
Journal:  J Mol Biol       Date:  2013-02-19       Impact factor: 5.469

6.  Vaccine-Induced Antibodies that Neutralize Group 1 and Group 2 Influenza A Viruses.

Authors:  M Gordon Joyce; Adam K Wheatley; Paul V Thomas; Gwo-Yu Chuang; Cinque Soto; Robert T Bailer; Aliaksandr Druz; Ivelin S Georgiev; Rebecca A Gillespie; Masaru Kanekiyo; Wing-Pui Kong; Kwanyee Leung; Sandeep N Narpala; Madhu S Prabhakaran; Eun Sung Yang; Baoshan Zhang; Yi Zhang; Mangaiarkarasi Asokan; Jeffrey C Boyington; Tatsiana Bylund; Sam Darko; Christopher R Lees; Amy Ransier; Chen-Hsiang Shen; Lingshu Wang; James R Whittle; Xueling Wu; Hadi M Yassine; Celia Santos; Yumiko Matsuoka; Yaroslav Tsybovsky; Ulrich Baxa; James C Mullikin; Kanta Subbarao; Daniel C Douek; Barney S Graham; Richard A Koup; Julie E Ledgerwood; Mario Roederer; Lawrence Shapiro; Peter D Kwong; John R Mascola; Adrian B McDermott
Journal:  Cell       Date:  2016-07-21       Impact factor: 41.582

7.  Convergent antibody signatures in human dengue.

Authors:  Poornima Parameswaran; Yi Liu; Krishna M Roskin; Katherine K L Jackson; Vaishali P Dixit; Ji-Yeun Lee; Karen L Artiles; Simona Zompi; Maria José Vargas; Birgitte B Simen; Bozena Hanczaruk; Kim R McGowan; Muhammad A Tariq; Nader Pourmand; Daphne Koller; Angel Balmaseda; Scott D Boyd; Eva Harris; Andrew Z Fire
Journal:  Cell Host Microbe       Date:  2013-06-12       Impact factor: 21.023

8.  Individual variation in the germline Ig gene repertoire inferred from variable region gene rearrangements.

Authors:  Scott D Boyd; Bruno A Gaëta; Katherine J Jackson; Andrew Z Fire; Eleanor L Marshall; Jason D Merker; Jay M Maniar; Lyndon N Zhang; Bita Sahaf; Carol D Jones; Birgitte B Simen; Bozena Hanczaruk; Khoa D Nguyen; Kari C Nadeau; Michael Egholm; David B Miklos; James L Zehnder; Andrew M Collins
Journal:  J Immunol       Date:  2010-05-21       Impact factor: 5.422

9.  A potent broad-spectrum protective human monoclonal antibody crosslinking two haemagglutinin monomers of influenza A virus.

Authors:  Ying Wu; MyungSam Cho; David Shore; Manki Song; JungAh Choi; Tao Jiang; Yong-Qiang Deng; Melissa Bourgeois; Lynn Almli; Hua Yang; Li-Mei Chen; Yi Shi; Jianxu Qi; An Li; Kye Sook Yi; MinSeok Chang; Jin Soo Bae; HyunJoo Lee; JiYoung Shin; James Stevens; SeoungSuh Hong; Cheng-Feng Qin; George F Gao; Shin Jae Chang; Ruben O Donis
Journal:  Nat Commun       Date:  2015-07-21       Impact factor: 14.919

10.  Structural and genetic basis for development of broadly neutralizing influenza antibodies.

Authors:  Daniel Lingwood; Patrick M McTamney; Hadi M Yassine; James R R Whittle; Xiaoti Guo; Jeffrey C Boyington; Chih-Jen Wei; Gary J Nabel
Journal:  Nature       Date:  2012-08-29       Impact factor: 49.962

View more
  41 in total

Review 1.  Rationalizing Random Walks: Replicating Protective Antibody Trajectories.

Authors:  Jennifer L Remmel; Margaret E Ackerman
Journal:  Trends Immunol       Date:  2021-01-26       Impact factor: 16.687

2.  B cells engineered to express pathogen-specific antibodies protect against infection.

Authors:  Howell F Moffett; Carson K Harms; Kristin S Fitzpatrick; Marti R Tooley; Jim Boonyaratanakornkit; Justin J Taylor
Journal:  Sci Immunol       Date:  2019-05-17

3.  Susceptibility to Hypertensive Renal Disease in the Spontaneously Hypertensive Rat Is Influenced by 2 Loci Affecting Blood Pressure and Immunoglobulin Repertoire.

Authors:  Isha S Dhande; Stacy M Cranford; Yaming Zhu; Sterling C Kneedler; M John Hicks; Scott E Wenderfer; Michael C Braun; Peter A Doris
Journal:  Hypertension       Date:  2018-02-05       Impact factor: 10.190

Review 4.  Targeting the N332-supersite of the HIV-1 envelope for vaccine design.

Authors:  Thandeka Moyo; Dale Kitchin; Penny L Moore
Journal:  Expert Opin Ther Targets       Date:  2020-04-27       Impact factor: 6.902

Review 5.  The antibody response in HIV-1-infected donors.

Authors:  Simone I Richardson; Penny L Moore
Journal:  Curr Opin HIV AIDS       Date:  2019-07       Impact factor: 4.283

6.  Slow Delivery Immunization Enhances HIV Neutralizing Antibody and Germinal Center Responses via Modulation of Immunodominance.

Authors:  Kimberly M Cirelli; Diane G Carnathan; Bartek Nogal; Jacob T Martin; Oscar L Rodriguez; Amit A Upadhyay; Chiamaka A Enemuo; Etse H Gebru; Yury Choe; Federico Viviano; Catherine Nakao; Matthias G Pauthner; Samantha Reiss; Christopher A Cottrell; Melissa L Smith; Raiza Bastidas; William Gibson; Amber N Wolabaugh; Mariane B Melo; Benjamin Cossette; Venkatesh Kumar; Nirav B Patel; Talar Tokatlian; Sergey Menis; Daniel W Kulp; Dennis R Burton; Ben Murrell; William R Schief; Steven E Bosinger; Andrew B Ward; Corey T Watson; Guido Silvestri; Darrell J Irvine; Shane Crotty
Journal:  Cell       Date:  2019-05-09       Impact factor: 41.582

7.  A Bayesian phylogenetic hidden Markov model for B cell receptor sequence analysis.

Authors:  Amrit Dhar; Duncan K Ralph; Vladimir N Minin; Frederick A Matsen
Journal:  PLoS Comput Biol       Date:  2020-08-17       Impact factor: 4.475

8.  Anti-V2 antibody deficiency in individuals infected with HIV-1 in Cameroon.

Authors:  Lily Liu; Liuzhe Li; Aubin Nanfack; Luzia M Mayr; Sonal Soni; Adam Kohutnicki; Lucy Agyingi; Xiao-Hong Wang; Michael Tuen; Yongzhao Shao; Maxim Totrov; Susan Zolla-Pazner; Xian-Peng Kong; Ralf Duerr; Miroslaw K Gorny
Journal:  Virology       Date:  2019-01-14       Impact factor: 3.616

9.  Rapid isolation and immune profiling of SARS-CoV-2 specific memory B cell in convalescent COVID-19 patients via LIBRA-seq.

Authors:  Bing He; Shuning Liu; Yuanyuan Wang; Mengxin Xu; Wei Cai; Jia Liu; Wendi Bai; Shupei Ye; Yong Ma; Hengrui Hu; Huicui Meng; Tao Sun; Yanling Li; Huanle Luo; Mang Shi; Xiangjun Du; Wenjing Zhao; Shoudeng Chen; Jingyi Yang; Haipeng Zhu; Yusheng Jie; Yuedong Yang; Deyin Guo; Qiao Wang; Yuwen Liu; Huimin Yan; Manli Wang; Yao-Qing Chen
Journal:  Signal Transduct Target Ther       Date:  2021-05-17

Review 10.  Igh Locus Polymorphism May Dictate Topological Chromatin Conformation and V Gene Usage in the Ig Repertoire.

Authors:  Amy L Kenter; Corey T Watson; Jan-Hendrik Spille
Journal:  Front Immunol       Date:  2021-05-18       Impact factor: 7.561

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.