| Literature DB >> 23549729 |
Yoshiki Yasukochi1, Yoko Satta.
Abstract
Polymorphism of genes in the major histocompatibility complex (MHC) is believed to be maintained by balancing selection. However, direct evidence of selection has proven difficult to demonstrate. In 1994, Satta and colleagues estimated the selection intensity of the human MHC (human leukocyte antigen (HLA)) loci; however, at that time the number of HLA sequences was limited. By comparing five different methods, this study demonstrated the best way to calculate the selection coefficient, through a computer simulation study. Since the study, many HLA nucleotide sequences have been made available. Our new analysis takes advantage of these newly available sequences and compares new estimates with those of the previous study. Generally, our new results are consistent with those of the 1994 study. Our results show that, even after 20 years of exhaustive sequencing of human HLA, the number of dominant HLA alleles, on which our original estimate of selection intensity depended, appears to be conserved. Indeed, according to the frequency distribution for each HLA allele, most sequences in the database were minor or private alleles; therefore, we conclude that the selection intensities of HLA loci are at most 4.4 % even though the HLA is the prominent example on which the natural selection has been operating.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23549729 PMCID: PMC3651823 DOI: 10.1007/s00251-013-0693-x
Source DB: PubMed Journal: Immunogenetics ISSN: 0093-7711 Impact factor: 2.846
The number of alleles and dominant alleles in the database
|
| No. of alleles | No. of PBR in different alleles | ||||
|---|---|---|---|---|---|---|
| In the database | Whole | Nonrecombinant | Dominanta, b | Nonrecombinantb | Dominantb | |
|
| 1,594 | 156 | 50 | 27c (18)d | 32 | 26 |
|
| 2,123 | 235 | 143 | 40c (21)d | 113 | 39 |
|
| 1,102 | 143 | 129 | 20c | 60 | 19 |
|
| 975 | 64 | 56 | 26 (1)d | 37 | 23 |
|
| 144 | (61)e | 55 | 13 | 13 | 10 |
|
| 145 | (44)e | 38 | 11 (1)d | 23 | 11 |
|
| 58 | (13)e | 13 | − | 6 | − |
|
| 20 | (5)e | 5 | − | 4 | − |
|
| 47 | 34 | 31 | 8 (2)d | 9 | 5 |
|
| 34 | (11)e | 9 | 5 (2)d | 3 | 3 |
aThe number of dominant alleles that have a high frequency (>1 %) throughout human populations worldwide (including possible recombinants)
bThe number of amino acid sequences
cThe number of dominant alleles that are detected in >100 chromosomes from >25 human populations
dThe number of dominant alleles that are excluded due to a possible recombinant or short sequence
eNot whole coding sequence (see text)
Estimates of the mean number of nonsynonymous substitutions, the relative nonsynonymous substitution rate in the PBR, and the selection coefficient (s)
|
| Lengtha |
|
|
| No. of allele 1b | No. of allele 2c |
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 1,095 bp | 295 | 123 | 674 | 27d | 9 | 28.9 (26.0) | 7.6 (6.3) | 4,500 (3,000) | 0.04 (0.09) | 2.25 % (1.50 %) |
|
| 1,086 bp | 300 | 122 | 643 | 40d | 19 | 35.9 (36.0) | 9.7 (9.0) | 8,825 (8,200) | 0.01 (0.02) | 4.41 % (4.20 %) |
|
| 1,093 bp | 301 | 125 | 665 | 20d | 20 | 17.3 (15.0) | 4.9 (3.4) | 1,030 (530) | 0.15 (0.29) | 0.52 % (0.26 %) |
|
| 795 bp | 223 | 53 | 521 | 26 | 25 | 23.2 (25.0) | 10.2 (9.3) | 3,890 (3,900) | 0.01 (0.01) | 1.94 % (1.90 %) |
|
| 687 bp | 148 | 51 | 347 | 13 | 13 | 12.4 (20.0) | 4.4 (6.0) | 479 (1,700) | 0.14 (0.08) | 0.24 % (0.85 %) |
|
| 543 bp | 146 | 53 | 344 | 11 | 10 | 11.9 (6.8) | 9.2 (4.3) | 918 (140) | 0.01 (0.08) | 0.46 % (0.07 %) |
|
| 549 bp | 148 | 54 | 347 | (13e) | − | 5.6 − | 5.4 − | 120 − | 0.04 − | 0.06 % − |
|
| 549 bp | 148 | 53 | 348 | (5e) | − | 8.0 − | 7.9 − | 360 − | 0.01 − | 0.18 % − |
|
| 765 bp | 211 | 47 | 504 | 8 | 6 | 5.9 (13.0) | 2.1 (4.5) | 53 (550) | 0.23 (0.14) | 0.03 % (0.28 %) |
|
| 663 bp | 190 | 42 | 428 | 5 | 3 | 4.8 − | 3.3 − | 54 − | 0.10 − | 0.03 % − |
The numbers of sites of synonymous and nonsynonymous substitutions were estimated using the modified Nei–Gojobori model (R = 1.04 for class I, R = 1.14 for class II). The parameter values in parentheses were estimated on the basis of method II described in Satta (1992). The mutation rate per PBR per generation (u) = 1.7 × 10−6 for class I loci and 7.5 × 10−7 for class II loci; effective population size (N e) = 105 (see Satta et al. 1994)
L the number of synonymous sites across the entire region, L the number of nonsynonymous sites at the PBR, L the number of nonsynonymous sites at the non-PBR
aThe length or the number of sites used in this study (not in the previous study)
bThe number of dominant alleles that have a high frequency (>1 %) throughout human populations worldwide (shown as n a in text)
cThe number of dominant alleles excluding possible recombinants
dThe number of dominant alleles that are detected in >100 chromosomes from >25 human populations
eThe number of alleles not derived from the dominant allele because of lack of information about allele frequencies in the human populations