| Literature DB >> 26230582 |
Hao Sun1, Zhaoqing Yang1, Keqin Lin1, Shuyuan Liu1, Kai Huang1, Xiuyun Wang1, Jiayou Chu1, Xiaoqin Huang1.
Abstract
Pathogen-driven balancing selection determines the richness of human leukocyte antigen (HLA) alleles. Changes in the pathogen spectrum may cause corresponding changes in HLA loci. Approximately 700 years ago, a Mongolian population moved from the north of China to the Yunnan region in the south of China. The pathogen spectrum in the south of China differs from that in the north. In this study, changes in the HLA genes in the Yunnan Mongolian population, as well as the underlying mechanism, were investigated. A sequence-based typing method (SBT) was used to genotype HLA-DRB1 in 470 individuals from two Mongolian populations and another five ethnic groups. Meanwhile, 10 autosomal short tandem repeats (STRs) were genotyped to assess the influence of genetic background on HLA-DRB1 frequencies. The frequencies of certain alleles changed significantly in the Mongolian population that migrated to Yunnan. For example, DRB1*12:02:01 increased from 6.1% to 35.4%. STR analysis excluded the possibility of a recent bottleneck and indicated that 50% of the genetic consistency between northern and southern Mongolians; Tajima's D value for HLA-DRB1 exon2 and dN/dS analysis showed that the HLA-DRB1 genes in both Mongolian populations were under balancing selection. However, the sites under natural selection changed. We proposed that the dramatically change of HLA frequencies in southern Mongolian was caused by a combination of inter-population gene flow and natural selection. Certain diseases specific to the south of China, such as malaria, may be the driving force behind the enhanced DRB1*12:02:01 frequency.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26230582 PMCID: PMC4521750 DOI: 10.1371/journal.pone.0134334
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Information about the seven populations sampled for HLA-DRB1 genotyping.
| No. | Population | Size | Location | Lat(N) | Long(E) | Language classification |
|---|---|---|---|---|---|---|
| 1 | Han | 49 | Zouping, Shandong | 36.86 | 117.74 | Chinese |
| 2 | Mongolian_IM | 49 | Damaoqi, Inner Mongolia | 41.70 | 110.43 | Altaic, Mongolic |
| 3 | Mongolian_YN | 103 | Tonghai, Yunnan | 24.15 | 102.67 | Altaic, Mongolic |
| 4 | Hani | 67 | Jinghong, Yunnan | 22.01 | 100.79 | Tibeto-Burman |
| 5 | Dai | 65 | Xinping, Yunnan | 24.07 | 101.99 | Tai-Kadai |
| 6 | Yao | 60 | Mengla, Yunnan | 21.48 | 101.57 | Hmong-Mien |
| 7 | Wa | 77 | Ximeng, Yunnan | 22.64 | 99.60 | Austro-Asiatic |
| Total | 470 |
Lat and Long represent latitude (north) and longitude (east), respectively.
* 10 STR analyses were performed from 48 members of this population.
Fig 1Information on the seven sampled populations.
Geographical location and language classification of the seven sampled populations.
Results from the BOTTLENECK tests using 10 microsatellites for seven populations.
| Population(n) | Wilcoxon test P values in TPM | Wilcoxon test P values in SMM | Allele frequency distribution |
|---|---|---|---|
| Han (n = 90) | 0.920 | 0.984 | L-shaped |
| Mongolian_IM (n = 100) | 0.813 | 0.991 | L-shaped |
| Mongolian_YN (n = 96) | 0.461 | 0.688 | L-shaped |
| Hani (n = 110) | 0.997 | 0.999 | L-shaped |
| Dai (n = 120) | 0.754 | 0.935 | L-shaped |
| Yao (n = 102) | 0.423 | 0.615 | L-shaped |
| Wa (n = 98) | 0.577 | 0.722 | L-shaped |
n represents the number of chromosomes; TPM, SMM, and L-shaped are defined in the text; Wilcoxon test P values are one-tailed tests of excess heterozygosity (H e>H eq).
Test of the neutrality-equilibrium model based on the site frequency spectrum at exon 2 of HLA-DRB1 in seven populations of China.
| Population(n) | Tajima's D | P |
|---|---|---|
| Han(n = 98) | 3.32 | P = 0.0002 |
| Mongolian_IM(n = 98) | 2.96 | P = 0.0005 |
| Mongolian_YN(n = 206) | 2.93 | P = 0.0015 |
| Hani(n = 134) | 2.52 | P = 0.0053 |
| Dai(n = 130) | 2.63 | P = 0.0035 |
| Yao(n = 120) | 2.21 | P = 0.0120 |
| Wa(n = 154) | 2.82 | P = 0.0035 |
Log-likelihood values and parameter estimates for HLA-DRB1 exon 2 in Mongolian_YN and Mongolian_IM.
| Pop | Model code | λ | Average | Parameter estimates |
|---|---|---|---|---|
| Mongolian_YN |
| -1248.0 | 0.295 |
|
|
| -1224.1 | 0.708 |
| |
|
| -1249.7 | 0.218 | a = 0.014, b = 0.045 | |
|
| -1226.1 | 0.708 |
| |
| Mongolian_IM |
| -1163.7 | 0.301 |
|
|
| -1143.4 | 0.730 |
| |
|
| -1164.3 | 0.316 | a = 0.013, b = 0.025 | |
|
| -1144.1 | 0.713 |
|
*ω0 is theratio of nonsynonymous-synonymous substitutions (d N/d S) of sites, with the proportion of p0, at which nonsynonymous mutations are “slightly deleterious”; ω1 is the d N/d S of completely neutral sites (ω1 = 1) with a proportion of p1; ω2 is the d N/d S of positively selected sites with a proportion of p2. The ‘a’ and ‘b’ are the shape parameters of the beta distribution.
Positively selected amino acid sites in HLA-DRB1 predicted by using Bayes Empirical Bayes analysis under the M2a model in seven populations.
| Populations | Positively selected amino acid sites (Reference sequence P04229) | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| 11 | 13 | 30 | 37 | 57 | 67 | 70 | 74 | 86 | |
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
| ||||
|
|
|
|
|
|
| ||||
|
|
|
|
|
| |||||
|
|
|
|
|
|
| ||||
|
|
| ||||||||
** the probability of this site under positive selectiongreater than 0.99, Pr(ω> 1)> 0.99.
* Pr(ω> 1) > 0.95.
Fig 2Clustering analysis from STRUCTURE.
Population structure was investigated using STRUCTURE software, assuming K = 3, 4, 5, and 6. Populations were ordered according to their linguistic affiliations. Linguistic affiliations and population names are labeled above and beneath the plot. According to the guidelines in the STRUCTURE manual and ΔK values at different Ks, K = 5 is the most appropriate value. The r values indicate that more information is given with the LOCPRIOR model than without it.
Model choice and performance of the ABC analysis.
| Scenario | Relative posterior probability (95% CI) | P(SC2) | Gene-flow hypothesis |
|---|---|---|---|
| SC1 | 0.001 (0.000–0.035) | 0.014 | No gene flow among these populations |
| SC2 | 0.748 (0.645–0.850) | 0.862 | Gene flow occurred between Mongolian_YN and Hani |
| SC3 | 0.251 (0.149–0.354) | 0.036 | Gene flow occurred between Mongolian_YN and Wa |
*P(SC2) is the proportion of pseudo-observed simulated datasets using each competing scenario (SC1 to SC3) for which SC2 was selected because it had the highest posterior probability.
‡For SC1 and SC3, P(SC2) represents an empirical estimate of the model-specific type II error rate (here, 3.6%+1.4% = 5%).
§For SC2, 1 − P(SC2) provides an empirical estimate of the type I error rate (here, 13.8%).
Fig 3PCA of the three scenarios.
The principal component analyses (PCAs) of summary statistics were performed using DIYABC 2.0 software with three different scenarios (see S2 Fig). The summary statistics of the different scenarios were calculated based on the computer simulation of HLA-DRB1 sequences under widely predefined demographic parameters. The observed data represent summary statistics of real HLA-DRB1 sequence data.
The difference in the frequency of HLA-DRB1 alleles between the Mongolian population in Yunnan and other populations.
| HLA-DRB1 Alleles | Number of HLA-DRB1 Alleles | Number of HLA-DRB1 Alleles | ||||
|---|---|---|---|---|---|---|
| Mongolian_YN(%) | Mongolian_IM(%) | χ2
| Mongolian_YN(%) | Wa(%) |
| |
| DRB1 | 15(7.3%) | 4(4.1%) | 15(7.3%) | 6(3.9%) | ||
| DRB1 | 12(5.8%) | 13(13.3%) | 12(5.8%) | 10(6.5%) | ||
| DRB1 | 18(8.7%) | 6(6.1%) | 18(8.7%) | 5(3.2%) | ||
| DRB1 | 73(35.4%) | 6(6.1%) | 73(35.4%) | 68(44.2%) | ||
| DRB1 | 15(7.3%) | 15(15.3%) | 15(7.3%) | 16(10.4%) | ||
|
| 73(35.4%) | 54(55.1%) | χ2 = 38.6 | 73(35.4%) | 49(31.8%) | χ2 = 8.99 |
| Total | 206(100.0%) | 98(100.0%) | P<0.0001 | 206(100.0%) | 154(100.0%) | P = 0.109 |
*R×C χ2 test between the two listed populations.
#LowFreqAlleles: Merged alleles with frequencies lower than 5% in Mongolian_IM.
Fig 4Comparisons of allele frequencies.
Allele frequencies higher than 5% in any one of the populations were compared between A.) Mongolian_YN and Mongolian_IM or B.) Mongolian_YN and Wa. ** indicates P< 0.0001 using a 2×2 χ2 test. * indicates P< 0.05.