| Literature DB >> 24294213 |
Xiangyu Rao1, Rob J De Boer, Debbie van Baarle, Martin Maiers, Can Kesmir.
Abstract
Different human leukocyte antigen (HLA) haplotypes (i.e., the specific combinations of HLA-A, -B, -DR alleles inherited together from one parent) are observed in different frequencies in human populations. Some haplotypes, like HLA-A1-B8, are very frequent, reaching up to 10% in the Caucasian population, while others are very rare. Numerous studies have identified associations between HLA haplotypes and diseases, and differences in haplotype frequencies can in part be explained by these associations: the stronger the association with a severe (autoimmune) disease, the lower the expected HLA haplotype frequency. The peptide repertoires of the HLA molecules composing a haplotype can also influence the frequency of a haplotype. For example, it would seem advantageous to have HLA molecules with non-overlapping binding specificities within a haplotype, as individuals expressing such an haplotype would present a diverse set of peptides from viruses and pathogenic bacteria on the cell surface. To test this hypothesis, we collect the proteome data from a set of common viruses, and estimate the total ligand repertoire of HLA class I haplotypes (HLA-A-B) using in silico predictions. We compare the size of these repertoires to the HLA haplotype frequencies reported in the National Marrow Donor Program (NMDP). We find that in most HLA-A and HLA-B pairs have fairly distinct binding motifs, and that the observed haplotypes do not contain HLA-A and -B molecules with more distinct binding motifs than random HLA-A and HLA-B pairs. In addition, the population frequency of a haplotype is not correlated to the distinctness of its HLA-A and HLA-B peptide binding motifs. These results suggest that there is a not a strong selection pressure on the haplotype level favoring haplotypes having HLA molecules with distinct binding motifs, which would result the largest possible presented peptide repertoires in the context of infectious diseases.Entities:
Keywords: HLA antigens; bioinformatics; computational biology; genetic; haplotypes; peptide binding; selection
Year: 2013 PMID: 24294213 PMCID: PMC3827838 DOI: 10.3389/fimmu.2013.00374
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Occurrences of the three most common HLA-A-B frequency ranked haplotypes in four major ethnic groups in US (adopted from .
| EUR | AFA | API | HIS | ||||||
|---|---|---|---|---|---|---|---|---|---|
| HLA-A | HLA-B | Rank | Rank | Rank | Rank | ||||
| 0101 | 0801 | 9.55 | 1 | 1.50 | 6 | 0.41 | 46 | 2.21 | 2 |
| 0201 | 4402 | 5.70 | 3 | 1.33 | 9 | 0.17 | 130 | 1.94 | 4 |
| 0201 | 4501 | 0.05 | 200 | 1.66 | 3 | – | – | 0.23 | 105 |
| 0201 | 5101 | 2.00 | 9 | 0.61 | 26 | 0.91 | 25 | 2.20 | 3 |
| 0207 | 4601 | – | – | – | – | 3.34 | 2 | – | – |
| 0301 | 0702 | 6.01 | 2 | 1.73 | 2 | 0.26 | 82 | 1.92 | 5 |
| 2902 | 4403 | 2.38 | 7 | 1.08 | 15 | 0.03 | 433 | 2.54 | 1 |
| 3001 | 4201 | – | – | 2.96 | 1 | – | – | 0.40 | 50 |
| 3303 | 4403 | – | – | 0.09 | 261 | 2.94 | 3 | 0.16 | 156 |
| 3303 | 5801 | 0.08 | 162 | 0.28 | 88 | 4.53 | 1 | 0.10 | 230 |
EUR, Caucasian; AFA, African; API, Asian; HIS, Hispanic. F stands for population frequency in percentages.
Numbers of different haplotypes with a significantly positive LD in four major US ethnic groups.
| Ethnicity | Haplotype # (%) |
|---|---|
| EUR | 60 (57.7) |
| API | 34 (35.4) |
| HIS | 43 (33.5) |
| AFA | 43 (33.7) |
Population coverage in percentages is given within parenthesis.
Figure 1Distribution of the presented peptide overlap (given in percentages, see . (A) The distribution of the peptide overlaps for the 120 unique haplotypes we identified in the US population. (B) The distribution in A is weighted with the frequency of the haplotype in the population. If a haplotype was found in more than one ethnicity in US, we have taken the maximum frequency into account, normalizing the frequencies such that the sum remains one. In (C) we plot the overlaps found in artificially generated haplotypes (created by reshuffling the HLA-A and -B molecules from the 120 unique haplotypes). This plot is representative of 100 sets of artificially generated haplotypes. (D) The weighted peptide overlaps for the artificial haplotypes, where we estimate the frequency of the haplotype as the multiplication of the frequency of HLA-A and HLA-B molecule in the population (normalized to let the sum of the frequencies of the artificial haplotypes remain one).