| Literature DB >> 26459025 |
Rodrigo Dos Santos Francisco1,2,3, Stéphane Buhler4,5, José Manuel Nunes4,6, Bárbara Domingues Bitarello7, Gustavo Starvaggi França8,9, Diogo Meyer10, Alicia Sanchez-Mazas11,12.
Abstract
Supertypes are groups of human leukocyte antigen (HLA) alleles which bind overlapping sets of peptides due to sharing specific residues at the anchor positions-the B and F pockets-of the peptide-binding region (PBR). HLA alleles within the same supertype are expected to be functionally similar, while those from different supertypes are expected to be functionally distinct, presenting different sets of peptides. In this study, we applied the supertype classification to the HLA-A and HLA-B data of 55 worldwide populations in order to investigate the effect of natural selection on supertype rather than allelic variation at these loci. We compared the nucleotide diversity of the B and F pockets with that of the other PBR regions through a resampling procedure and compared the patterns of within-population heterozygosity (He) and between-population differentiation (G ST) observed when using the supertype definition to those estimated when using randomized groups of alleles. At HLA-A, low levels of variation are observed at B and F pockets and randomized He and G ST do not differ from the observed data. By contrast, HLA-B concentrates most of the differences between supertypes, the B pocket showing a particularly high level of variation. Moreover, at HLA-B, the reassignment of alleles into random groups does not reproduce the patterns of population differentiation observed with supertypes. We thus conclude that differently from HLA-A, for which supertype and allelic variation show similar patterns of nucleotide diversity within and between populations, HLA-B has likely evolved through specific adaptations of its B pocket to local pathogens.Entities:
Keywords: Adaptation; HLA; Human populations; Natural selection; Pathogens; Supertypes
Mesh:
Substances:
Year: 2015 PMID: 26459025 PMCID: PMC4636516 DOI: 10.1007/s00251-015-0875-9
Source DB: PubMed Journal: Immunogenetics ISSN: 0093-7711 Impact factor: 2.846
Codon composition of the PBR pockets
| Pockets | Codons | Total size in base pairs (bp) |
|---|---|---|
| A | 5, 7, 59, 63, 66, 99, 159, 163, 167, and 171 | 30 |
| B | 7, 9, 24, 25, 34, 45, 63, 66, 67, 70, and 99 | 33 |
| C, D, and E | 9, 70, 73, 74, 97, 99, 114, 147, 152, 155, 156, 159, and 160 | 39 |
| F | 77, 80, 81, 84, 116, 123, 143, 146, and 147 | 27 |
From: Saper et al. (1991)
Fig. 1Supertype variation, a boxes represent the frequency distributions of the four HLA-A and the five HLA-B supertypes and the “non-classified alleles” NCA and NCB, respectively; b each box represents the distribution of the number of distinct alleles of each supertype per population; and c the dark gray section of the bars represents the number of populations showing only one allele for the referred supertype (referred to as “monomorphic populations”). The light gray section of the bars represents the number of populations where the referred supertype was not detected
Fig. 2HLA-A supertype frequencies. Heat map summarizing the frequencies of the four HLA-A supertypes and the non-classified alleles (NCAs). Population names are shown on the right
Fig. 3HLA-B supertype frequencies. Heat map summarizing the frequencies of the five HLA-B supertypes and the non-classified alleles (NCBs). Population names are shown on the right
Supertype differentiation indexes among populations (F ST), among populations within geographic regions (F SC), and among geographic regions (F CT)
| Supertypes |
|
|
|
|---|---|---|---|
| A1 | 9.95 %*** | 2.67 %*** |
|
| A2 | 4.85 %*** | 3.40 %*** | 1.51 %* |
| A3 | 6.48 %*** | 6.48 %*** | 0.000b |
| A24 | 11.14 %*** | 6.66 %*** | 4.80 %*** |
| NCA | 15.90 %*** | 4.90 %*** |
|
| B7 | 5.11 %*** | 3.21 %*** | 1.97 %* |
| B27 | 7.54 %*** | 7.10 %*** | 0.47%b |
| B44 | 3.21 %*** | 1.72 %*** | 1.51 %** |
| B58 | 8.34 %*** | 2.91 %*** |
|
| B62 | 11.38 %*** | 7.35 %*** | 4.35 %* |
| NCB | 7.02 %*** | 2.90 %*** |
|
*p < 0.01; **p < 0.001; ***p < 0.0001, where p values refer to the probability of observing a statistic as extreme under the null hypothesis of no structure
aIn italics: Values of F CT > F SC, an indication that most of the variation was found among populations of different geographic regions
bNot significant value
Expected heterozygosity (He) of alleles and supertypes
| Loci | Dataseta | Average allelic | Average supertype |
|---|---|---|---|
| HLA-A | Complete | 0.7761 | 0.6774 |
| HLA-A | Reduced | 0.8974 | 0.7504 |
| HLA-B | Complete | 0.8948 | 0.7577 |
| HLA-B | Reduced | 0.9429 | 0.7766 |
aComplete dataset, all populations; reduced dataset, excluding Pacific, Australian, Taiwanese, and Native American populations
Fig. 4Plots of G ST values between populations based on allele (Y axis) and supertype (X axis) frequencies. The correlation (Rxy) and significance were obtained using a Mantel test. Complete dataset, all populations and reduced dataset, excluding Pacific, Australian, Taiwanese, and Native American populations
Fig. 5Total nucleotide diversity (π total) at HLA-A and HLA-B PBR pockets. Each box represents the distribution of the total nucleotide diversity per pocket for the populations of the complete dataset
Fig. 6Nucleotide diversity between supertypes (π st) at HLA-A and HLA-B PBR pockets. Each box represents the distribution of the nucleotide diversity between supertypes per pocket for the populations of the complete dataset
Fig. 7P value distributions obtained through simulations for the expected heterozygosity (He). The p value is defined as the proportion of simulated datasets with He larger than the observed He. The results obtained with the complete (top) and reduced (bottom) dataset are shown
Fig. 8Simulation results for G ST. The red line represents the average observed G ST. We calculated the average G ST value for each simulated step and then determined the significance as the proportion of simulated values smaller than the observed one. The results with the complete (top) and reduced (bottom) datasets are shown