| Literature DB >> 15588495 |
J Michael Macpherson1, Sohini Ramachandran, Lisa Diamond, Marcus W Feldman.
Abstract
Polymorphisms in microsatellites on the human Y chromosome have been used to estimate important demographic parameters of human history. We compare two coalescent-based statistical methods that give estimates for a number of demographic parameters using the seven Y chromosome polymorphisms in the HGDP-CEPH Cell Line Panel, a collection of samples from 52 worldwide populations. The estimates for the time to the most recent common ancestor vary according to the method used and the assumptions about the prior distributions of model parameters, but are generally consistent with other global Y chromosome studies. We explore the sensitivity of these results to assumptions about the prior distributions and the evolutionary models themselves.Entities:
Mesh:
Year: 2004 PMID: 15588495 PMCID: PMC3525096 DOI: 10.1186/1479-7364-1-5-345
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Prior distributions used to analyse microsatellite polymorphisms on the Y chromosome. The rejection algorithm [9] and BATWING [12] were run on each set of priors. Distributions were chosen based on past studies; the means for each distribution are given in brackets. μ is the mutation rate per locus per generation, Nis the ancestral population size, t0 is the time of start of exponential population growth in generations before present, r is rate of exponential population growth per generation
| Prior set | Derived from | μ prior [mean] | |||
|---|---|---|---|---|---|
| Pritchard | gamma (10, 12,500) [0.0008] | Log normal (8.5, 2) [36,000] | exp (0.001) [1000] | gamma (1, 200) [0.005] | |
| Kayser | gamma (1, 416) [0.0024] | gamma (3, 0.003) [1000] | exp (0.001) [1000] | gamma (2, 400) [0.005] | |
| Wilson, Weale and Balding (2003) | gamma (18, 8,170) [0.0022] | Gamma (3, 0.001) [3000] | exp (0.001) [1000] | gamma (2, 400) [0.005] | |
| Zhivotovsky | gamma (1.5, 2,175) [0.0069] | gamma (3, 0.003) [1000] | exp (0.001) [1000] | gamma (2, 400) [0.005] |
Analysis of molecular variance for the Y chromosome. Ninety-five per cent confidence intervals (CIs; in parentheses) were calculated using 1,000 bootstraps across loci. The World-B97 sampl [28] consists of 14 populations that were chosen in order to approximate the sample of Barbujan et al.[26] The fifth column corrects for the smaller Y chromosome population size, as in Pérez-Lezaun et al.[30] The estimate and CI for the among-region variance component for Eurasia are set to zero because Weir's unbiased estimator [23] yields slightly negative value
| Sample | Number of regions | Number of populations | Variance components and 95% confidence intervals (%) | |||
|---|---|---|---|---|---|---|
| Within populations | Within populations (corrected) | Among populations within regions | Among regions | |||
| World | 1 | 52 | 80.4 (74.7-84.5) | 94.3 (92.2-95.6) | 19.6 (15.5-25.2) | |
| World | 5 | 52 | 80.2 (73.2-85.6) | 94.2 (91.6-96.0) | 15.0 (13.2-17.1) | 4.80 (0.00-10.6) |
| World | 7 | 52 | 83.9 (77.5-88.8) | 95.4 (93.2-96.9) | 15.5 (11.2-17.2) | 0.56 (0.00-6.16) |
| World-B97 | 5 | 14 | 84.7 (71.5-97.1) | 95.7 (90.9-99.3) | 8.43 (2.90-11.1) | 6.85 (0.00-22.1) |
| Africa | 1 | 6 | 91.2 (87.6-94.5) | 97.6 (96.6-98.6) | 8.78 (5.49-12.4) | |
| Eurasia | 1 | 21 | 86.4 (83.4-89.1) | 96.2 (95.3-97.0) | 13.6 (10.9-16.6) | |
| Eurasia | 3 | 21 | 88.9 (85.4-91.7) | 97.0 (95.9-97.8) | 11.1 (8.25-14.4) | 0.00 (0.00-0.00) |
| Europe | 1 | 8 | 86.8 (81.3-92.8) | 96.3 (94.6-98.1) | 13.2 (7.17-18.7) | |
| Middle East | 1 | 4 | 66.2 (58.7-74.6) | 88.7 (85.0-92.2) | 33.8 (25.4-41.2) | |
| Central/South Asia | 1 | 9 | 94.7 (92.0-97.2) | 98.6 (97.9-99.3) | 5.30 (2.77-8.04) | |
| East Asia | 1 | 18 | 81.0 (71.8-87.7) | 94.5 (91.1-96.6) | 19.0 (12.3-28.2) | |
| Oceania | 1 | 2 | 80.6 (70.2-92.3) | 94.3 (90.4-98.0) | 19.4 (7.65-29.8) | |
| America | 1 | 5 | 58.0 (48.2-70.0) | 84.7 (78.8-90.3) | 42.0 (29.9-51.7) | |
Demographic parameters estimated from seven Y chromosome microsatellite loci in 677 individuals drawn from 52 global populations. Columns: priors (see Table 1 for details of each set of prior distributions); method (RA = rejection algorithm using stepwise mutation model [SMM], symmetrical geometric model [SGM], or range constraint model [RCM], BW = BATWING followed by number of Markov Chain Monte Carlo updates); remaining columns are posterior distributions, with mean and 95 per cent credible interval, TMRCA is the mean time to the most recent common ancestor in years before present (generation time is assumed to be 25 years), μ is mutation rate per locus per generation, Nis ancestral population size, t0 is time of start of exponential population growth in generation before present is rate of exponential population growth per generation
| Prior set | Method | |||||
|---|---|---|---|---|---|---|
| RA, SMM | 86 [30-221] | 0.71 [0.34-1.19] | 1630 [140-4520] | 22 [8-51] | 0.77 [0.23-2.13] | |
| RA, SGM | 58 [20-147] | 0.71 [0.36-1.18] | 990 [70-3110] | 23 [8-52] | 0.75 [0.25-2.08] | |
| RA, RCM | 121 [33-441] | 0.73 [0.35-1.22] | 2070 [240-6510] | 21 [8-50] | 0.79 [0.22-2.16] | |
| BW, 4 × 106 | 82 [34-195] | 0.80 [0.40-1.34] | 2510 [400-11630] | 36 [12-73] | 0.32 [0.15-0.58] | |
| BW, 200 × 106 | 64 [31-131] | 0.79 [0.39-1.32] | 710 [250-1740] | 48 [24-92] | 0.30 [0.14-0.52] | |
| RA, SMM | 68 [21-178] | 0.89 [0.27-2.15] | 1020 [220-2410] | 24 [7-64] | 0.67 [0.20-1.55] | |
| RA, SGM | 64 [21-155] | 0.66 [0.20-1.59] | 960 [220-2270] | 27 [9-61] | 0.67 [0.21-1.60] | |
| RA, RCM | 78 [22-199] | 0.98 [0.29-2.51] | 1090 [290-2390] | 22 [7-55] | 0.69 [0.23-1.55] | |
| BW,4 × 106 | 55 [16-143] | 1.27 [0.31-4.02] | 760 [190-2350] | 42 [4-118] | 0.43 [0.11-1.33] | |
| BW, 200 × 106 | 63 [17-178] | 1.10 [0.31-2.71] | 660 [190-1610] | 52 [13-159] | 0.37 [0.11-0.92] | |
| BW, 800 × 106 | 61 [18-164] | 0.90 [0.27-2.12] | 760 [230-1760] | 42 [11-124] | 0.47 [0.14-1.10] | |
| RA, SMM | 39 [19-81] | 1.68 [1.01-2.72] | 940 [250-1940] | 11 [5-18] | 1.01 [0.46-1.82] | |
| RA, SGM | 29 [14-57] | 1.49 [0.91-2.18] | 600 [230-1300] | 13 [7-21] | 0.85 [0.49-1.29] | |
| RA, RCM | 83 [20-266] | 1.76 [1.08-2.71] | 1470 [250-3730] | 11 [5-24] | 0.87 [0.36-1.95] | |
| BW,4 × 106 | 32 [16-66] | 1.85 [1.10-2.78] | 1100 [260-4160] | 14 [5-26] | 0.72 [0.41-1.17] | |
| BW, 200 × 106 | 29 [15-57] | 1.81 [1.07-2.74] | 590 [240-1210] | 16 [9-27] | 0.70 [0.40-1.09] | |
| RA, SMM | 79 [27-200] | 0.71 [0.24-1.54] | 1120 [290-2550] | 26 [9-63] | 0.62 [0.20-1.48] | |
| RA, SGM | 71 [24-170] | 0.55 [0.18-1.24] | 1010 [250-2310] | 30 [10-74] | 0.62 [0.20-1.42] | |
| RA, RCM | 88 [28-215] | 0.74 [0.25-1.61] | 1210 [330-2660] | 26 [9-65] | 0.64 [0.20-1.49] | |
| BW,4 × 106 | 41 [16-94] | 1.40 [0.52-2.94] | 1490 [540-3420] | 23 [5-60] | 0.48 [0.18-1.04] | |
| BW, 200 × 106 | 84 [26-228] | 0.72 [0.22-1.64] | 690 [210-1630] | 80 [22-232] | 0.25 [0.08-0.57] | |
Figure 1The most recent common ancestor (TMRCA) estimated from BATWING runs over the four sets of priors. Each plot point is the mean of 4 million Markov Chain Monte Carlo (MCMC) update
Ratio of acceptances per million trials at threshold δ = 0.1 and average acceptance rates using the rejection algorithm [9] and different mutation models for all four sets of priors (Table 1)
| Prior set | Mutation model | |||
|---|---|---|---|---|
| SGM/SMM | SMM/RCM | SGM/RCM | ||
| 39/61 | 44/56 | 33/67 | 1.34 | |
| 43/57 | 49/51 | 42/58 | 1.47 | |
| 16/84 | 30/70 | 8/92 | 0.14 | |
| 52/48 | 49/51 | 48/52 | 2.82 | |
Results from rejection algorithm with one and two exponential growth phases usin P priors
| (Mean, 95% range) | ||||
|---|---|---|---|---|
| Single-phase | Dual-phase | |||
| 58,000 | (20,000, 14,7000) | 46,000 | (18,000, 130,000) | |
| 990 | (44, 3,100) | 880 | (55, 3,400) | |
| 0.0075 | (0.0025, 0.028) | 0.0049 | (0.00018, 0.015) | |
| - | 0.0062 | (0.00028, 0.018) | ||
| 23,000 | (8000, 52,000) | 12,000 | (620, 32,000) | |
| - | 13,000 | (430, 41,000) | ||
Mean parameter estimates and 95 per cent credible intervals for individual populations obtained with the rejection algorithm using the priors and the P symmetrical geometric mode
| World | 58 [20-150] | 0.71 [0.34-1.19] | 990 [70-3100] | 22 [8.5-50] | 0.83 [0.26-2.0] |
| Africa | 53 [15-160] | 0.70 [0.35-1.25] | 1,000 [62-3500] | 16 [3.8-43] | 0.64 [0.12-1.8] |
| Non-Africa | 54 [20-140] | 0.70 [0.34-1.17] | 970 [64-3500] | 23 [8.4-51] | 0.81 [0.26-2.1] |
| America | 30 [11-83] | 0.74 [0.35-1.25] | 730 [57-2100] | 14 [1.0-48] | 0.43 [0.18-1.7] |
| Central/South Asia | 51 [18-140] | 0.73 [0.34-1.27] | 970 [61-3300] | 21 [6.8-48] | 0.65 [0.19-1.8] |
| East Asia | 55 [20-150] | 0.72 [0.36-1.26] | 1,000 [59-3500] | 23 [7.8-53] | 0.70 [0.21-1.9] |
| Europe | 44 [16-110] | 0.69 [0.34-1.19] | 800 [52-2700] | 20 [7.0-48] | 0.70 [0.21-1.9] |
| Eurasia | 51 [19-140] | 0.70 [0.35-1.21] | 940 [92-3100] | 20 [7.0-48] | 0.81 [0.25-2.1] |
| Middle East | 52 [15-150] | 0.76 [0.37-1.33] | 1,100 [96-3300] | 15 [1.7-48] | 0.43 [0.31-1.3] |
| Oceania | 59 [18-160] | 0.76 [0.36-1.32] | 1,400 [120-4100] | 17 [0.8-55] | 0.37 [0.15-1.4] |
Figure 2Comparison of mutation rate prior distributions and posterior distributions estimated by BATWING between four sets of priors. Priors are represented by curves, posteriors by histograms