| Literature DB >> 28225073 |
Xin-Sheng Hu1,2, Francis C Yeh3, Yang Hu4, Li-Ting Deng1,2, Richard A Ennos5, Xiaoyang Chen1,2.
Abstract
Copy-number-variable (CNV) loci differ from single nucleotide polymorphic (SNP) sites in size, mutation rate, and mechanisms of maintenance in natural populations. It is therefore hypothesized that population genetic divergence at CNV loci will differ from that found at SNP sites. Here, we test this hypothesis by analysing 856 CNV loci from the genomes of 1184 healthy individuals from 11 HapMap populations with a wide range of ancestry. The results show that population genetic divergence at the CNV loci is generally more than three times lower than at genome-wide SNP sites. Populations generally exhibit very small genetic divergence (Gst = 0.05 ± 0.049). The smallest divergence is among African populations (Gst = 0.0081 ± 0.0025), with increased divergence among non-African populations (Gst = 0.0217 ± 0.0109) and then among African and non-African populations (Gst = 0.0324 ± 0.0064). Genetic diversity is high in African populations (~0.13), low in Asian populations (~0.11), and intermediate in the remaining 11 populations. Few significant linkage disequilibria (LDs) occur between the genome-wide CNV loci. Patterns of gametic and zygotic LDs indicate the absence of epistasis among CNV loci. Mutation rate is about twice as large as the migration rate in the non-African populations, suggesting that the high mutation rates play dominant roles in producing the low population genetic divergence at CNV loci.Entities:
Mesh:
Year: 2017 PMID: 28225073 PMCID: PMC5320550 DOI: 10.1038/srep43178
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Sample sizes and polymorphisms at the genome-wide CNV loci in 11 human populations.
| Populations | Sample sizes | Mean | ||
|---|---|---|---|---|
| ASW | 83 | 85.16 | 1.90 | 0.1290 ± 0.1457 |
| CEU | 165 | 57.71 | 1.62 | 0.1123 ± 0.1622 |
| CHB | 84 | 46.14 | 1.50 | 0.1072 ± 0.1651 |
| CHD | 85 | 46.61 | 1.50 | 0.1047 ± 0.1636 |
| GIH | 88 | 53.85 | 1.58 | 0.1121 ± 0.1629 |
| JPT | 86 | 44.51 | 1.48 | 0.1079 ± 0.1669 |
| LWK | 90 | 80.49 | 1.85 | 0.1268 ± 0.1483 |
| MEX | 77 | 62.38 | 1.66 | 0.1130 ± 0.1599 |
| MKK | 171 | 83.53 | 1.88 | 0.1248 ± 0.1522 |
| TSI | 88 | 56.78 | 1.60 | 0.1123 ± 0.1626 |
| YRI | 167 | 80.14 | 1.84 | 0.1322 ± 0.1496 |
*P(99%): the percentage of polymorphic loci where the frequency of the most common allele was ≤0.99.
Na: the number of observed alleles per CNV locus; H: the expected heterozygosity under Hardy-Weinberg equilibrium.
Figure 1A histogram of G distribution at 856 CNV loci.
The abscissa axis is the G values. The curve is based on the kernel-smoothed density function.
Figure 2Gst values across chromosomes at CNV loci.
The observed G values are in red, and their 95% CIs are derived from 1000 bootstrapping samples on each chromosome. The lines with opened and closed circles are the lower and upper G values of 95% CIs, respectively. The abscissa axis is the positions for CNV loci on each chromosome in Mb, and the ordinate axis is the G values.
Comparison of the pairwise G at the genome-wide CNV loci with the pairwise F at the genome-wide SNP sites7.
| ASW | CEU | CHB | CHD | GIH | JPT | LWK | MEX | MKK | TSI | YRI | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ASW | 0.0275 (0.0001) | 0.0357 (0.0001) | 0.0353 (0.0001) | 0.0267 (0.0001) | 0.0336 (0.0001) | 0.0071 (0.00001) | 0.0258 (0.0001) | 0.0087 (0.00001) | 0.0255 (0.00005) | 0.0061 (0.00001) | |
| CEU | 3.7081 (0.0230) | 0.0307 (0.0001) | 0.0307 (0.0001) | 0.0140 (0.00004) | 0.0331 (0.0001) | 0.0348 (0.0001) | 0.0116 (0.00003) | 0.0248 (0.00005) | 0.0038 (0.00001) | 0.0375 (0.0001) | |
| CHB | 3.9697 (0.0210) | 3.5964 (0.0251) | 0.0038 (0.00001) | 0.0289 (0.0001) | 0.0063 (0.00002) | 0.0413 (0.0001) | 0.0246 (0.0001) | 0.0295 (0.0001) | 0.0332 (0.0001) | 0.0401 (0.0001) | |
| CHD | 4.0432 (0.0213) | 3.6545 (0.0253) | 0.2649 (0.0265) | 0.0280 (0.0001) | 0.0073 (0.00003) | 0.0415 (0.0001) | 0.0243 (0.0001) | 0.0295 (0.0001) | 0.0326 (0.0001) | 0.0403 (0.0001) | |
| GIH | 3.5497 (0.0200) | 2.4946 (0.0227) | 2.6315 (0.0221) | 2.7458 (0.0229) | 0.0304 (0.0001) | 0.0333 (0.0001) | 0.0147 (0.00004) | 0.0221 (0.00005) | 0.0139 (0.00004) | 0.0328 (0.0001) | |
| JPT | 3.9680 (0.0209) | 3.3981 (0.0234) | 1.1119 (0.0163) | 1.0957 (0.0144) | 2.5390 (0.0181) | 0.0421 (0.0001) | 0.0260 (0.0001) | 0.0306 (0.0001) | 0.0352 (0.0001) | 0.0406 (0.0001) | |
| LWK | 1.4017 (0.0142) | 4.1900 (0.0242) | 4.2389 (0.0187) | 4.2400 (0.0210) | 3.9617 (0.0195) | 4.1856 (0.0207) | 0.0331 (0.0001) | 0.0082 (0.00002) | 0.0332 (0.0001) | 0.0059 (0.00001) | |
| MEX | 3.6383 (0.0206) | 2.6817 (0.0118) | 2.0810 (0.0217) | 2.9173 (0.0221) | 2.3857 (0.0152) | 2.6898 (0.0205) | 4.0130 (0.0196) | 0.0214 (0.0001) | 0.0112 (0.00003) | 0.0330 (0.0001) | |
| MKK | 1.6680 (0.0576) | 4.1655 (0.0217) | 4.8353 (0.0256) | 4.8624 (0.0258) | 4.2751 (0.0243) | 4.7094 (0.0248) | 2.0688 (0.0129) | 4.4724 (0.0255) | 0.0206 (0.00004) | 0.0128 (0.00002) | |
| TSI | 3.8807 (0.0210) | 1.0415 (0.0262) | 3.3390 (0.0231) | 3.4390 (0.0236) | 2.4423 (0.0159) | 3.1983 (0.0219) | 4.2654 (0.0225) | 2.8533 (0.0195) | 4.7502 (0.0307) | 0.0334 (0.0001) | |
| YRI | 1.5206 (0.0662) | 4.1929 (0.0203) | 4.6181 (0.0199) | 4.6174 (0.0199) | 4.3656 (0.0230) | 4.5913 (0.0242) | 1.3481 (0.0171) | 4.3467 (0.0230) | 2.1023 (0.0087) | 4.5808 (0.0201) |
The above diagonal values are the mean multilocus G estimates, and the below diagonal values are the ratios of F/G. Standard deviations are shown in parentheses.
Figure 3Cluster analysis of 11 human populations.
The plot is based on Nei’s genetic distance by using UPGMA for hierarchical clustering.
Means and standard deviations of significant gametic LDs (r-squares) in 11 human populations*.
| ASW | CEU | CHB | CHD | GIH | JPT | LWK | MEX | MKK | TSI | YRI | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.031% | 0.053% | 0.045% | 0.054% | 0.034% | 0.041% | 0.027% | 0.044% | 0.034% | 0.031% | 0.045% | |
| 0.84 ± 0.32 | 0.78 ± 0.32 | 0.79 ± 0.33 | 0.72 ± 0.35 | 0.75 ± 0.34 | 0.76 ± 0.34 | 0.80 ± 0.34 | 0.79 ± 0.33 | 0.68 ± 0.43 | 0.86 ± 0.24 | 0.64 ± 0.44 | |
| 21(16:5) | 28(17:11) | 26(16:10) | 30(16:14) | 27(16:11) | 25(15:10) | 21(15:6) | 28 (17:11) | 27(16:11) | 25(17:8) | 27(15:12) | |
| 0.21 ± 0.04 | 0.09 ± 0.02 | 0.18 ± 0.03 | 0.17 ± 0.01 | 0.19 ± 0.02 | 0.18 ± 0.02 | 0.18 ± 0.04 | 0.21 ± 0.04 | 0.10 ± 0.02 | 0.18 ± 0.03 | 0.10 ± 0.02 | |
| 59 | 37 | 9 | 13 | 9 | 5 | 42 | 34 | 60 | 11 | 77 | |
| 0.035% | 0.058% | 0.050% | 0.062% | 0.037% | 0.047% | 0.032% | 0.048% | 0.038% | 0.033% | 0.050% | |
| 0.79 ± 0.32 | 0.78 ± 0.31 | 0.80 ± 0.32 | 0.74 ± 0.34 | 0.80 ± 0.30 | 0.75 ± 0.33 | 0.80 ± 0.33 | 0.80 ± 0.39 | 0.65 ± 0.42 | 0.85 ± 0.23 | 0.63 ± 0.43 | |
| 23(18:5) | 30(19:11) | 28(18:10) | 32(18:14) | 27(18:9) | 28(17:11) | 23(17:6) | 29(19:10) | 28(16:12) | 26(18:8) | 29(17:12) | |
| 0.21 ± 0.04 | 0.10 ± 0.01 | 0.20 ± 0.03 | 0.17 ± 0.01 | 0.19 ± 0.03 | 0.18 ± 0.02 | 0.19 ± 0.04 | 0.21 ± 0.04 | 0.10 ± 0.02 | 0.17 ± 0.03 | 0.10 ± 0.02 | |
| 68 | 41 | 11 | 17 | 12 | 6 | 52 | 40 | 69 | 13 | 89 | |
| 0.034% | 0.064% | 0.057% | 0.062% | 0.044% | 0.051% | 0.029% | 0.048% | 0.040% | 0.041% | 0.051% | |
| 0.82 ± 0.32 | 0.75 ± 0.34 | 0.75 ± 0.34 | 0.71 ± 0.35 | 0.76 ± 0.33 | 0.74 ± 0.34 | 0.77 ± 0.36 | 0.78 ± 0.32 | 0.68 ± 0.42 | 0.83 ± 0.26 | 0.64 ± 0.44 | |
| 23(17:6) | 31(18:13) | 27(16:11) | 31(16:15) | 28(17:11) | 27(16:11) | 22(15:7) | 29(18:11) | 27(16:11) | 27(18:9) | 27(15:12) | |
| 0.22 ± 0.09 | 0.13 ± 0.17 | 0.27 ± 0.23 | 0.21 ± 0.16 | 0.23 ± 0.15 | 0.26 ± 0.26 | 0.18 ± 0.04 | 0.23 ± 0.11 | 0.10 ± 0.02 | 0.21 ± 0.14 | 0.11 ± 0.07 | |
| 68 | 47 | 17 | 18 | 19 | 10 | 47 | 40 | 76 | 21 | 92 | |
| 0.041% | 0.071% | 0.064% | 0.073% | 0.048% | 0.059% | 0.034% | 0.056% | 0.045% | 0.047% | 0.058% | |
| 0.79 ± 0.32 | 0.74 ± 0.34 | 0.74 ± 0.35 | 0.72 ± 0.34 | 0.80 ± 0.30 | 0.74 ± 0.33 | 0.78 ± 0.34 | 0.82 ± 0.30 | 0.64 ± 0.44 | 0.82 ± 0.27 | 0.65 ± 0.43 | |
| 28(19:9) | 34(20:14) | 31(19:13) | 35(19:16) | 29(19:10) | 31(18:13) | 25(17:8) | 30(20:10) | 29(16:13) | 30(19:11) | 33(18:15) | |
| 0.22 ± 0.08 | 0.13 ± 0.34 | 0.26 ± 0.21 | 0.20 ± 0.12 | 0.22 ± 0.30 | 0.24 ± 0.24 | 0.19 ± 0.04 | 0.22 ± 0.09 | 0.10 ± 0.01 | 0.20 ± 0.13 | 0.11 ± 0.07 | |
| 80 | 53 | 19 | 23 | 22 | 12 | 55 | 49 | 85 | 25 | 104 |
*The percentages in the same row as d (i, j = 0, 1) in the table are the proportions of significant gametic LDs among all pairs of LD tests. The data in the second row under each d is the gametic LD among CNV loci from the same chromosomes. The data in the third row under each d is the observed numbers of pairs with significant LDs from the same chromosomes (non-overlapped locus pairs: overlapped locus pairs). The data in the fourth row under each d is the significant gametic LDs among CNV loci from different chromosomes. The data in the fifth row under each d is the observed numbers of pairs with significant LDs among CNV loci from different chromosomes.
Percentages of the pairs of CNV loci with significant zygotic LDs in 11 human populations*.
| LD | ASW | CEU | CHB | CHD | GIH | JPT | LWK | MEX | MKK | TSI | YRI |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.0045 | 0.0082 | 0.0141 | 0.0189 | 0.0123 | 0.0193 | 0.0046 | 0.0063 | 0.0039 | 0.0093 | 0.0067 | |
| 0.0041 | 0.0066 | 0.0077 | 0.0101 | 0.0047 | 0.0083 | 0.0025 | 0.0042 | 0.0031 | 0.0059 | 0.0034 | |
| 0.0019 | 0.0033 | 0.0026 | 0.0025 | 0.0019 | 0.0028 | 0.0013 | 0.0021 | 0.0008 | 0.0025 | 0.0009 | |
| 0.0008 | 0.0016 | 0.0026 | 0.0025 | 0.0019 | 0.0028 | 0.0008 | 0.0014 | 0.0008 | 0.0017 | 0.0009 | |
| 0.0041 | 0.0115 | 0.0116 | 0.0126 | 0.0075 | 0.0124 | 0.0038 | 0.0063 | 0.0043 | 0.0085 | 0.0051 | |
| 0.0143 | 0.0271 | 0.0308 | 0.0327 | 0.0226 | 0.0332 | 0.0143 | 0.0253 | 0.0129 | 0.0221 | 0.0111 | |
| 0.0113 | 0.0222 | 0.0218 | 0.0214 | 0.0151 | 0.0276 | 0.0114 | 0.0211 | 0.0125 | 0.0161 | 0.0098 | |
| 0.0023 | 0.0025 | 0.0026 | 0.0025 | 0.0019 | 0.0041 | 0.0008 | 0.0014 | 0.0027 | 0.0017 | 0.0013 | |
| 0.0034 | 0.0057 | 0.0051 | 0.0050 | 0.0029 | 0.0069 | 0.0008 | 0.0028 | 0.0008 | 0.0042 | 0.0013 | |
| 0.0143 | 0.0246 | 0.0244 | 0.0264 | 0.0160 | 0.0304 | 0.0105 | 0.0232 | 0.0129 | 0.0178 | 0.0111 | |
| 0.0139 | 0.0279 | 0.0283 | 0.0252 | 0.0160 | 0.0359 | 0.0127 | 0.0246 | 0.0145 | 0.0187 | 0.0128 | |
| 0.0023 | 0.0016 | 0 | 0 | 0 | 0.0014 | 0.0004 | 0 | 0.0020 | 0 | 0.0013 | |
| 0.0004 | 0.0025 | 0 | 0.0013 | 0 | 0 | 0 | 0.0007 | 0 | 0 | 0 | |
| 0.0026 | 0.0049 | 0 | 0.0038 | 0.0019 | 0.0069 | 0.0017 | 0.0035 | 0.0024 | 0.0025 | 0.0026 | |
| 0.0041 | 0.0049 | 0.0051 | 0.0038 | 0.0047 | 0.0069 | 0.0021 | 0.0049 | 0.0016 | 0.0034 | 0.0034 | |
| 0.0015 | 0 | 0 | 0 | 0 | 0.0014 | 0.0008 | 0.0007 | 0.0004 | 0.0008 | 0.0013 |
*D is the zygotic LD between genotype i at the first locus and j at the second locus (i, j = 0, 1, 2, 3).
Ratios of the joint mutation and migration rates at CNV loci to those at SNP sites (above diagonal), and the ratios of the mutation rate to the migration rate at CNV loci (below diagonal).
| ASW | CEU | CHB | CHD | GIH | JPT | LWK | MEX | MKK | TSI | YRI | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ASW | 4.008 (0.0303) | 4.4667 (0.0288) | 4.5564 (0.0293) | 3.8132 (0.0266) | 4.8110 (0.0312) | 1.4126 (0.0144) | 3.9084 (0.0278) | 1.6765 (0.0587) | 4.1896 (0.0250) | 1.5129 (0.0664) | |
| CEU | 2.0054 (0.0202) | 3.9223 (0.0309) | 3.9942 (0.0311) | 2.5468 (0.0239) | 3.6992 (0.0284) | 4.7303 (0.0335) | 2.7259 (0.0115) | 4.5349 (0.0262) | 1.0528 (0.0266) | 4.7910 (0.0286) | |
| CHB | 2.3111 (0.0192) | 1.9482 (0.0206) | 0.2624 (0.0263) | 2.7677 (0.0256) | 1.1119 (0.0164) | 4.9274 (0.0269) | 2.9478 (0.0260) | 5.7238 (0.0375) | 3.6286 (0.0282) | 5.4445 (0.0289) | |
| CHD | 2.3709 (0.0195) | 1.9962 (0.0207) | — | 2.8878 (0.0266) | 1.0967 (0.0145) | 4.9298 (0.0299) | 3.0640 (0.0266) | 5.5163 (0.0368) | 3.7503 (0.0289) | 5.4487 (0.0288) | |
| GIH | 1.8755 (0.0177) | 1.0312 (0.0159) | 1.1785 (0.0171) | 1.2586 (0.0178) | 2.6720 (0.0208) | 4.4186 (0.0269) | 2.4310 (0.0159) | 4.6233 (0.0290) | 2.4969 (0.0168) | 4.9364 (0.0321) | |
| JPT | 2.5407 (0.0208) | 1.7994 (0.0189) | 0.0746 (0.0109) | 0.0644 (0.0097) | 1.1147 (0.0139) | 4.8732 (0.0294) | 2.8197 (0.0244) | 5.3380 (0.0352) | 3.4744 (0.0264) | 5.4210 (0.0350) | |
| LWK | 0.2751 (0.0096) | 2.4869 (0.0223) | 2.6183 (0.0180) | 2.6199 (0.0199) | 2.2790 (0.0179) | 2.5822 (0.0196) | 4.4772 (0.0272) | 2.0917 (0.0135) | 4.7997 (0.0314) | 1.3588 (0.0173) | |
| MEX | 1.9390 (0.0185) | 1.1506 (0.0077) | 1.2985 (0.0173) | 1.3760 (0.0177) | 0.9540 (0.0106) | 1.2131 (0.0162) | 2.3181 (0.0181) | 4.8450 (0.0363) | 2.9185 (0.0204) | 4.9055 (0.0319) | |
| MKK | 0.4510 (0.0391) | 2.3566 (0.0175) | 3.1492 (0.0250) | 3.0109 (0.0246) | 2.4155 (0.0194) | 2.8920 (0.0235) | 0.7278 (0.0090) | 2.5633 (0.0242) | 5.1655 (0.0365) | 2.1402 (0.0088) | |
| TSI | 2.1264 (0.0167) | 0.0352 (0.0177) | 1.7524 (0.0188) | 1.8335 (0.0193) | 0.9980 (0.0112) | 1.6496 (0.0176) | 2.5331 (0.0210) | 1.2790 (0.0136) | 2.7770 (0.0244) | 5.2357 (0.0291) | |
| YRI | 0.3419 (0.0443) | 2.5273 (0.0190) | 2.9630 (0.0193) | 2.9658 (0.0192) | 2.6243 (0.0214) | 2.9473 (0.0234) | 0.2392 (0.0115) | 2.6037 (0.0213) | 0.7601 (0.0059) | 2.8238 (0.0194) |
*negative value.
Standard deviations are shown in parentheses.