| Literature DB >> 24034507 |
Anthony Y C Kuk1, Xiang Li, Jinfeng Xu.
Abstract
BACKGROUND: Pooling is a cost effective way to collect data for genetic association studies, particularly for rare genetic variants. It is of interest to estimate the haplotype frequencies, which contain more information than single locus statistics. By viewing the pooled genotype data as incomplete data, the expectation-maximization (EM) algorithm is the natural algorithm to use, but it is computationally intensive. A recent proposal to reduce the computational burden is to make use of database information to form a list of frequently occurring haplotypes, and to restrict the haplotypes to come from this list only in implementing the EM algorithm. There is, however, the danger of using an incorrect list, and there may not be enough database information to form a list externally in some applications.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24034507 PMCID: PMC3847674 DOI: 10.1186/1471-2156-14-82
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Haplotype frequency estimates in the region using data from 148 obese individuals
| | | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| None | 0.7927 | 0.7941 | 0.7995 | 0.7984 | | 0.7912 | 0.8202 | 0.8169 | 0.7898 |
| 1 | 0.0536 | 0.0505 | 0.0509 | 0.0544 | | 0.0497 | 0.0397 | 0.0398 | 0.0494 |
| 2 | 0.0043 | 0.0034 | 0.0034 | 0.0034 | | | | | 0.0034 |
| 3 | 0.0456 | 0.0433 | 0.0436 | 0.0441 | | 0.0381 | 0.0291 | 0.0291 | 0.0440 |
| 5 | 0.0043 | 0.0034 | 0.0034 | 0.0034 | | | | | 0.0034 |
| 6 | 0.0043 | 0.0034 | 0.0034 | 0.0034 | | 0.0067 | 0.0034 | 0.0034 | 0.0034 |
| 9 | 0.0085 | 0.0072 | 0.0073 | 0.0103 | | 0.0133 | 0.0079 | 0.0079 | 0.0101 |
| 11 | 0.0043 | 0.0034 | 0.0034 | 0.0034 | | | | | 0.0034 |
| 14 | | | | | | | | | 0.0034 |
| 15 | 0.0043 | 0.0034 | 0.0034 | 0.0034 | | | | | 0.0034 |
| 19 | 0.0043 | 0.0068 | 0.0068 | 0.0068 | | 0.0067 | 0.0069 | 0.0101 | 0.0101 |
| 20 | 0.0043 | 0.0068 | 0.0068 | 0.0068 | | 0.0067 | 0.0069 | 0.0101 | 0.0101 |
| 21 | 0.0043 | 0.0034 | 0.0034 | 0.0034 | | | | | 0.0034 |
| 22 | 0.0127 | 0.0101 | 0.0102 | 0.0103 | | 0.0197 | 0.0101 | 0.0101 | 0.0101 |
| 23 | 0.0043 | 0.0034 | 0.0034 | 0.0034 | | 0.0067 | 0.0034 | 0.0034 | 0.0034 |
| 24 | 0.0127 | 0.0101 | 0.0102 | 0.0103 | | 0.0067 | 0.0040 | 0.0040 | 0.0040 |
| 1, 3 | 0.0048 | 0.0040 | 0.0040 | 0.0038 | | 0.0090 | 0.0059 | 0.0059 | 0.0020 |
| 1, 9 | 0.0034 | 0.0029 | 0.0029 | | | 0.0032 | 0.0022 | 0.0022 | |
| 1, 15 | | | | | | 0.0056 | 0.0034 | 0.0034 | |
| 1, 24 | | | | | | 0.0098 | 0.0064 | 0.0064 | 0.0095 |
| 2, 3 | | | | | | 0.0059 | 0.0034 | 0.0034 | |
| 3, 14 | 0.0040 | 0.0034 | 0.0034 | | | 0.0059 | 0.0034 | 0.0034 | |
| 3, 11 | | | | | | 0.0059 | 0.0034 | 0.0034 | |
| 5, 21 | | | | | | 0.0067 | 0.0034 | 0.0034 | |
| 6, 7 | 0.0250 | 0.0203 | 0.0204 | 0.0205 | | 0.0314 | 0.0182 | 0.0181 | 0.0189 |
| 19, 20 | | 0.0017 | | | | | 0.0033 | | |
| 3, 6, 7 | 0.0026 | 0.0034 | 0.0034 | 0.0034 | | 0.0066 | 0.0057 | 0.0057 | 0.0081 |
| 6, 19, 20 | | 0.0017 | | | | | | | |
| 7, 19, 20 | | 0.0017 | | | | | | | |
| 1, 6, 7, 24 | 0.0039 | 0.0034 | 0.0034 | 0.0034 | | | | | |
| 6, 7, 19, 20 | 0.0041 | 0.0017 | | | | 0.0057 | 0.0033 | 0.0034 | 0.0034 |
| 1, 3, 6, 7, 24 | | | | | | 0.0041 | 0.0032 | 0.0032 | |
| 1, 12, 13, 22, 25 | 0.0039 | 0.0034 | 0.0034 | 0.0034 | 0.0053 | 0.0034 | 0.0034 | 0.0034 | |
Estimates of haplotype frequencies for the 25 rare variants in the MGLL region obtained by CDMLE (collapsed data maximum likelihood estimation), EML (EM with combinatorially determined list), EM-CDL (EM with CD list) and EM-ATCDL (EM with augmented and trimmed CD list with adaptive threshold) based on n=148/k pools of k individuals each.
Running times of EM algorithms based on different lists
| | | ||||||
|---|---|---|---|---|---|---|---|
| 1.14 | 0.08 | 3.68 | | 0.72 | 0.13 | 4.57 | |
| 18.71 | 0.10 | 7.05 | | 126.38 | 0.17 | 6.78 | |
| > 10 h | 0.23 | 7.39 | > 10 h | 0.13 | 27.93 | ||
Running times in seconds for EML (EM with combinatorially determined list), EM-CDL (EM with CD list) and EM-ATCDL (EM with augmented and trimmed CD list with adaptive threshold) for estimating the haplotype distributions of the 25 rare variants in the MGLL region and the 32 rare variants in the FAAH region when 148 obese individuals are grouped into pools of various sizes.
Average estimates of haplotype frequencies for a 25 loci case
| | | | | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| | | | | | | | | | | | | |
| None | 0.7995 | | 0.7973 | 0.8279 | 0.8283 | 0.8003 | 0.8067 | | 0.7961 | 0.8535 | 0.7957 | 0.8119 |
| | | | (0.0232) | (0.0169) | (0.0170) | (0.0215) | (0.0192) | | (0.0204) | (0.0118) | (0.0179) | (0.0144) |
| 1 | 0.0509 | | 0.0508 | 0.0412 | 0.0412 | 0.0477 | 0.0477 | | 0.0502 | 0.0344 | 0.0457 | 0.0494 |
| | | | (0.0155) | (0.0115) | (0.0115) | (0.0119) | (0.0112) | | (0.0152) | (0.0083) | (0.0093) | (0.0086) |
| 2 | 0.0034 | | 0.0036 | 0.0019 | 0.0019 | 0.0031 | 0.0024 | | 0.0032 | 0.0008 | 0.0028 | 0.0020 |
| | | | (0.0040) | (0.0022) | (0.0022) | (0.0025) | (0.0027) | | (0.0044) | (0.0012) | (0.0023) | (0.0029) |
| 3 | 0.0436 | | 0.0441 | 0.0353 | 0.0353 | 0.0425 | 0.0408 | | 0.0435 | 0.0277 | 0.0396 | 0.0411 |
| | | | (0.0139) | (0.0104) | (0.0104) | (0.0112) | (0.0107) | | (0.0144) | (0.0077) | (0.0097) | (0.0080) |
| 5 | 0.0034 | | 0.0035 | 0.0019 | 0.0019 | 0.0031 | 0.0027 | | 0.0031 | 0.0008 | 0.0028 | 0.0015 |
| | | | (0.0039) | (0.0021) | (0.0021) | (0.0028) | (0.0029) | | (0.0043) | (0.0011) | (0.0017) | (0.0024) |
| 6 | 0.0034 | | 0.0027 | 0.0017 | 0.0016 | 0.0029 | 0.0022 | | 0.0038 | 0.0013 | 0.0030 | 0.0019 |
| | | | (0.0035) | (0.0021) | (0.0021) | (0.0028) | (0.0030) | | (0.0050) | (0.0017) | (0.0022) | (0.0026) |
| 9 | 0.0073 | | 0.0092 | 0.0056 | 0.0056 | 0.0085 | 0.0087 | | 0.0073 | 0.0029 | 0.0074 | 0.0079 |
| | | | (0.0065) | (0.0039) | (0.0039) | (0.0046) | (0.0052) | | (0.0066) | (0.0026) | (0.0045) | (0.0062) |
| 11 | 0.0034 | | 0.0041 | 0.0022 | 0.0022 | 0.0036 | 0.0029 | | 0.0032 | 0.0008 | 0.0027 | 0.0016 |
| | | | (0.0048) | (0.0025) | (0.0025) | (0.0029) | (0.0031) | | (0.0046) | (0.0013) | (0.0021) | (0.0025) |
| 15 | 0.0034 | | 0.0039 | 0.0021 | 0.0021 | 0.0032 | 0.0026 | | 0.0034 | 0.0009 | 0.0029 | 0.0017 |
| | | | (0.0052) | (0.0028) | (0.0028) | (0.0032) | (0.0033) | | (0.0050) | (0.0013) | (0.0021) | (0.0025) |
| 19 | 0.0068 | | 0.0069 | 0.0039 | 0.0039 | 0.0061 | 0.0055 | | 0.0075 | 0.0022 | 0.0058 | 0.0051 |
| | | | (0.0056) | (0.0031) | (0.0031) | (0.0040) | (0.0044) | | (0.0066) | (0.0020) | (0.0031) | (0.0042) |
| 20 | 0.0068 | | 0.0073 | 0.0041 | 0.0041 | 0.0061 | 0.0058 | | 0.0080 | 0.0023 | 0.0057 | 0.0051 |
| | | | (0.0060) | (0.0035) | (0.0035) | (0.0042) | (0.0045) | | (0.0065) | (0.0020) | (0.0028) | (0.0040) |
| 21 | 0.0034 | | 0.0038 | 0.0020 | 0.0020 | 0.0032 | 0.0029 | | 0.0041 | 0.0011 | 0.0033 | 0.0022 |
| | | | (0.0041) | (0.0022) | (0.0022) | (0.0027) | (0.0031) | | (0.0051) | (0.0014) | (0.0023) | (0.0028) |
| 22 | 0.0102 | | 0.0117 | 0.0070 | 0.0070 | 0.0095 | 0.0099 | | 0.0110 | 0.0043 | 0.0096 | 0.0091 |
| | | | (0.0075) | (0.0047) | (0.0047) | (0.0053) | (0.0054) | | (0.0085) | (0.0033) | (0.0046) | (0.0056) |
| 23 | 0.0034 | | 0.0032 | 0.0018 | 0.0018 | 0.0028 | 0.0024 | | 0.0038 | 0.0010 | 0.0029 | 0.0019 |
| | | | (0.0039) | (0.0023) | (0.0023) | (0.0028) | (0.0030) | | (0.0045) | (0.0012) | (0.0021) | (0.0026) |
| 24 | 0.0102 | | 0.0096 | 0.0060 | 0.0060 | 0.0095 | 0.0098 | | 0.0114 | 0.0043 | 0.0098 | 0.0118 |
| | | | (0.0065) | (0.0041) | (0.0041) | (0.0051) | (0.0057) | | (0.0076) | (0.0028) | (0.0047) | (0.0057) |
| 1, 3 | 0.0040 | | 0.0048 | 0.0039 | 0.0039 | 0.0045 | 0.0043 | | 0.0071 | 0.0049 | 0.0061 | 0.0052 |
| | | | (0.0049) | (0.0037) | (0.0037) | (0.0041) | (0.0040) | | (0.0070) | (0.0040) | (0.0046) | (0.0048) |
| 1, 9 | 0.0029 | | 0.0030 | 0.0021 | 0.0021 | 0.0023 | 0.0018 | | 0.0051 | 0.0023 | 0.0028 | 0.0017 |
| | | | (0.0035) | (0.0024) | (0.0024) | (0.0033) | (0.0033) | | (0.0055) | (0.0021) | (0.0031) | (0.0032) |
| 6, 7 | 0.0204 | | 0.0210 | 0.0150 | 0.0148 | 0.0215 | 0.0195 | | 0.0203 | 0.0104 | 0.0210 | 0.0219 |
| | | | (0.0098) | (0.0072) | (0.0072) | (0.0075) | (0.0077) | | (0.0105) | (0.0043) | (0.0068) | (0.0060) |
| 3, 14 | 0.0034 | | 0.0035 | 0.0021 | 0.0021 | 0.0020 | 0.0028 | | 0.0031 | 0.0012 | 0.0015 | 0.0021 |
| | | | (0.0037) | (0.0022) | (0.0022) | (0.0032) | (0.0031) | | (0.0038) | (0.0015) | (0.0024) | (0.0026) |
| 3, 6, 7 | 0.0034 | | 0.0037 | 0.0028 | 0.0029 | 0.0025 | 0.0030 | | 0.0047 | 0.0028 | 0.0022 | 0.0016 |
| | | | (0.0047) | (0.0036) | (0.0036) | (0.0044) | (0.0047) | | (0.0053) | (0.0025) | (0.0033) | (0.0035) |
| 1, 6, 7, 24 | 0.0034 | | 0.0031 | 0.0021 | 0.0021 | 0.0006 | 0.0006 | | 0.0036 | 0.0018 | 0.0006 | 0.0000 |
| | | | (0.0033) | (0.0023) | (0.0023) | (0.0023) | (0.0021) | | (0.0037) | (0.0018) | (0.0017) | (0.0004) |
| 1, 12, 13, 22, 25 | 0.0034 | | 0.0037 | 0.0024 | 0.0024 | 0.0026 | 0.0029 | | 0.0038 | 0.0016 | 0.0005 | 0.0025 |
| | | | (0.0038) | (0.0025) | (0.0025) | (0.0031) | (0.0029) | | (0.0035) | (0.0015) | (0.0018) | (0.0027) |
| Sum of remaining | | 0.0376 | 0.0248 | 0.0248 | 0.0117 | 0.0124 | | 0.0893 | 0.0366 | 0.0255 | 0.0109 | |
| haplotype probabilities | | | | | | | | | | | | |
| Sum of probabilities | | 0.0247 | 0.0241 | 0.0244 | 0.0218 | 0.0286 | | 0.0296 | 0.0285 | 0.0179 | 0.0367 | |
| of missed haplotypes | | | | | | | | | | | | |
| Sum of squared errors | | 0.00166 | 0.00186 | 0.00189 | 0.00110 | 0.00106 | | 0.00201 | 0.00415 | 0.00091 | 0.00089 | |
| Length of list | | 26.77 | 116.28 | 26.77 | 19.06 | 18.25 | | 45.23 | 45.23 | 25.38 | 15.62 | |
| SD of length | | (3.26) | (81.30) | (3.26) | (3.03) | (2.18) | | (3.94) | (3.94) | (4.78) | (2.40) | |
| | | | | | | | | | | | | |
| None | 0.7995 | | 0.7979 | 0.8248 | 0.8250 | 0.7981 | 0.8009 | | 0.7990 | 0.8451 | 0.7970 | 0.8040 |
| | | | (0.0150) | (0.0117) | (0.0117) | (0.0148) | (0.0132) | | (0.0154) | (0.0086) | (0.0133) | (0.0125) |
| 1 | 0.0509 | | 0.0514 | 0.0433 | 0.0433 | 0.0492 | 0.0503 | | 0.0502 | 0.0387 | 0.0462 | 0.0507 |
| | | | (0.0103) | (0.0082) | (0.0082) | (0.0089) | (0.0088) | | (0.0121) | (0.0069) | (0.0077) | (0.0080) |
| 2 | 0.0034 | | 0.0035 | 0.0020 | 0.0020 | 0.0033 | 0.0031 | | 0.0037 | 0.0011 | 0.0030 | 0.0024 |
| | | | (0.0032) | (0.0018) | (0.0018) | (0.0024) | (0.0026) | | (0.0035) | (0.0011) | (0.0013) | (0.0021) |
| 3 | 0.0436 | | 0.0430 | 0.0362 | 0.0362 | 0.0435 | 0.0426 | | 0.0441 | 0.0322 | 0.0406 | 0.0426 |
| | | | (0.0092) | (0.0075) | (0.0075) | (0.0082) | (0.0074) | | (0.0105) | (0.0054) | (0.0071) | (0.0065) |
| 5 | 0.0034 | | 0.0033 | 0.0018 | 0.0018 | 0.0032 | 0.0028 | | 0.0034 | 0.0011 | 0.0030 | 0.0023 |
| | | | (0.0028) | (0.0016) | (0.0016) | (0.0020) | (0.0023) | | (0.0035) | (0.0011) | (0.0013) | (0.0021) |
| 6 | 0.0034 | | 0.0038 | 0.0023 | 0.0023 | 0.0033 | 0.0031 | | 0.0033 | 0.0014 | 0.0028 | 0.0022 |
| | | | (0.0031) | (0.0019) | (0.0020) | (0.0021) | (0.0025) | | (0.0035) | (0.0013) | (0.0014) | (0.0019) |
| 9 | 0.0073 | | 0.0080 | 0.0054 | 0.0054 | 0.0079 | 0.0088 | | 0.0081 | 0.0036 | 0.0070 | 0.0092 |
| | | | (0.0038) | (0.0026) | (0.0026) | (0.0032) | (0.0037) | | (0.0043) | (0.0019) | (0.0026) | (0.0031) |
| 11 | 0.0034 | | 0.0032 | 0.0018 | 0.0018 | 0.0030 | 0.0027 | | 0.0032 | 0.0010 | 0.0029 | 0.0023 |
| | | | (0.0026) | (0.0016) | (0.0016) | (0.0022) | (0.0024) | | (0.0031) | (0.0011) | (0.0015) | (0.0021) |
| 15 | 0.0034 | | 0.0035 | 0.0019 | 0.0019 | 0.0030 | 0.0028 | | 0.0038 | 0.0012 | 0.0031 | 0.0028 |
| | | | (0.0033) | (0.0018) | (0.0018) | (0.0023) | (0.0025) | | (0.0031) | (0.0010) | (0.0015) | (0.0021) |
| 19 | 0.0068 | | 0.0063 | 0.0039 | 0.0039 | 0.0062 | 0.0061 | | 0.0066 | 0.0026 | 0.0056 | 0.0057 |
| | | | (0.0039) | (0.0025) | (0.0025) | (0.0029) | (0.0034) | | (0.0041) | (0.0015) | (0.0018) | (0.0026) |
| 20 | 0.0068 | | 0.0068 | 0.0042 | 0.0042 | 0.0062 | 0.0063 | | 0.0063 | 0.0026 | 0.0054 | 0.0059 |
| | | | (0.0039) | (0.0025) | (0.0025) | (0.0028) | (0.0030) | | (0.0038) | (0.0015) | (0.0022) | (0.0026) |
| 21 | 0.0034 | | 0.0038 | 0.0022 | 0.0022 | 0.0035 | 0.0032 | | 0.0037 | 0.0012 | 0.0031 | 0.0025 |
| | | | (0.0035) | (0.0020) | (0.0020) | (0.0023) | (0.0026) | | (0.0034) | (0.0011) | (0.0015) | (0.0021) |
| 22 | 0.0102 | | 0.0105 | 0.0071 | 0.0071 | 0.0097 | 0.0103 | | 0.0112 | 0.0052 | 0.0086 | 0.0098 |
| | | | (0.0058) | (0.0037) | (0.0037) | (0.0041) | (0.0041) | | (0.0052) | (0.0022) | (0.0030) | (0.0029) |
| 23 | 0.0034 | | 0.0039 | 0.0022 | 0.0022 | 0.0031 | 0.0030 | | 0.0035 | 0.0011 | 0.0030 | 0.0025 |
| | | | (0.0027) | (0.0015) | (0.0016) | (0.0021) | (0.0022) | | (0.0030) | (0.0010) | (0.0015) | (0.0021) |
| 24 | 0.0102 | | 0.0105 | 0.0069 | 0.0069 | 0.0107 | 0.0114 | | 0.0106 | 0.0049 | 0.0088 | 0.0115 |
| | | | (0.0050) | (0.0033) | (0.0033) | (0.0043) | (0.0042) | | (0.0059) | (0.0023) | (0.0036) | (0.0043) |
| 1, 3 | 0.0040 | | 0.0046 | 0.0037 | 0.0037 | 0.0041 | 0.0040 | | 0.0052 | 0.0041 | 0.0049 | 0.0047 |
| | | | (0.0041) | (0.0032) | (0.0032) | (0.0033) | (0.0034) | | (0.0048) | (0.0029) | (0.0030) | (0.0031) |
| 1, 9 | 0.0029 | | 0.0036 | 0.0026 | 0.0026 | 0.0030 | 0.0022 | | 0.0030 | 0.0018 | 0.0027 | 0.0010 |
| | | | (0.0027) | (0.0019) | (0.0019) | (0.0026) | (0.0028) | | (0.0031) | (0.0015) | (0.0022) | (0.0020) |
| 6, 7 | 0.0204 | | 0.0191 | 0.0148 | 0.0146 | 0.0215 | 0.0199 | | 0.0206 | 0.0124 | 0.0212 | 0.0223 |
| | | | (0.0067) | (0.0048) | (0.0047) | (0.0056) | (0.0054) | | (0.0073) | (0.0037) | (0.0048) | (0.0046) |
| 3, 14 | 0.0034 | | 0.0030 | 0.0019 | 0.0019 | 0.0020 | 0.0027 | | 0.0036 | 0.0015 | 0.0025 | 0.0029 |
| | | | (0.0029) | (0.0018) | (0.0018) | (0.0026) | (0.0025) | | (0.0026) | (0.0011) | (0.0020) | (0.0018) |
| 3, 6, 7 | 0.0034 | | 0.0039 | 0.0029 | 0.0029 | 0.0021 | 0.0027 | | 0.0038 | 0.0025 | 0.0024 | 0.0019 |
| | | | (0.0035) | (0.0025) | (0.0025) | (0.0029) | (0.0031) | | (0.0041) | (0.0022) | (0.0028) | (0.0029) |
| 1, 6, 7, 24 | 0.0034 | | 0.0039 | 0.0028 | 0.0028 | 0.0007 | 0.0006 | | 0.0039 | 0.0020 | 0.0002 | 0.0000 |
| | | | (0.0025) | (0.0019) | (0.0019) | (0.0019) | (0.0018) | | (0.0026) | (0.0014) | (0.0009) | (0.0002) |
| 1, 12, 13, 22, 25 | 0.0034 | | 0.0035 | 0.0025 | 0.0024 | 0.0027 | 0.0032 | | 0.0033 | 0.0017 | 0.0004 | 0.0031 |
| | | | (0.0022) | (0.0016) | (0.0017) | (0.0023) | (0.0023) | | (0.0026) | (0.0013) | (0.0014) | (0.0020) |
| Sum of remaining | | 0.0340 | 0.0227 | 0.0227 | 0.0102 | 0.0074 | | 0.0703 | 0.0310 | 0.0255 | 0.0076 | |
| haplotype probabilities | | | | | | | | | | | | |
| Sum of probabilities | | 0.0103 | 0.0097 | 0.0100 | 0.0132 | 0.0173 | | 0.0131 | 0.0118 | 0.0111 | 0.0200 | |
| of missed haplotypes | | | | | | | | | | | | |
| Sum of squared errors | | 0.00077 | 0.00125 | 0.00126 | 0.00059 | 0.00054 | | 0.00101 | 0.00281 | 0.00055 | 0.00048 | |
| Length of list | | 39.65 | 152.30 | 39.65 | 22.08 | 19.63 | | 71.24 | 71.24 | 29.29 | 19.81 | |
| SD of length | (3.90) | (71.65) | (3.90) | (4.20) | (3.70) | (5.02) | (5.02) | (5.82) | (5.22) | |||
Average estimates of haplotype frequencies for a 25 loci case based on 100 simulations of n pools of k individuals each using CDMLE (collapsed data MLE), EML (EM with combinatorially determined list), EM-CDL (EM with CD list), EM-ATCDL (augmented and trimmed CD list) and EM-TCDL (CD list with trimming and no augmentation), with standard errors in parentheses.
Average estimates of haplotype frequencies for a 32 loci case
| | | | | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| | | | | | | | | | | | | |
| None | 0.7995 | | 0.7979 | 0.8248 | 0.8250 | 0.7981 | 0.8009 | | 0.7990 | 0.8451 | 0.7970 | 0.8040 |
| | | | (0.0150) | (0.0117) | (0.0117) | (0.0148) | (0.0132) | | (0.0154) | (0.0086) | (0.0133) | (0.0125) |
| 1 | 0.0509 | | 0.0514 | 0.0433 | 0.0433 | 0.0492 | 0.0503 | | 0.0502 | 0.0387 | 0.0462 | 0.0507 |
| | | | (0.0103) | (0.0082) | (0.0082) | (0.0089) | (0.0088) | | (0.0121) | (0.0069) | (0.0077) | (0.0080) |
| 2 | 0.0034 | | 0.0035 | 0.0020 | 0.0020 | 0.0033 | 0.0031 | | 0.0037 | 0.0011 | 0.0030 | 0.0024 |
| | | | (0.0032) | (0.0018) | (0.0018) | (0.0024) | (0.0026) | | (0.0035) | (0.0011) | (0.0013) | (0.0021) |
| 3 | 0.0436 | | 0.0430 | 0.0362 | 0.0362 | 0.0435 | 0.0426 | | 0.0441 | 0.0322 | 0.0406 | 0.0426 |
| | | | (0.0092) | (0.0075) | (0.0075) | (0.0082) | (0.0074) | | (0.0105) | (0.0054) | (0.0071) | (0.0065) |
| 5 | 0.0034 | | 0.0033 | 0.0018 | 0.0018 | 0.0032 | 0.0028 | | 0.0034 | 0.0011 | 0.0030 | 0.0023 |
| | | | (0.0028) | (0.0016) | (0.0016) | (0.0020) | (0.0023) | | (0.0035) | (0.0011) | (0.0013) | (0.0021) |
| 6 | 0.0034 | | 0.0038 | 0.0023 | 0.0023 | 0.0033 | 0.0031 | | 0.0033 | 0.0014 | 0.0028 | 0.0022 |
| | | | (0.0031) | (0.0019) | (0.0020) | (0.0021) | (0.0025) | | (0.0035) | (0.0013) | (0.0014) | (0.0019) |
| 9 | 0.0073 | | 0.0080 | 0.0054 | 0.0054 | 0.0079 | 0.0088 | | 0.0081 | 0.0036 | 0.0070 | 0.0092 |
| | | | (0.0038) | (0.0026) | (0.0026) | (0.0032) | (0.0037) | | (0.0043) | (0.0019) | (0.0026) | (0.0031) |
| 11 | 0.0034 | | 0.0032 | 0.0018 | 0.0018 | 0.0030 | 0.0027 | | 0.0032 | 0.0010 | 0.0029 | 0.0023 |
| | | | (0.0026) | (0.0016) | (0.0016) | (0.0022) | (0.0024) | | (0.0031) | (0.0011) | (0.0015) | (0.0021) |
| 15 | 0.0034 | | 0.0035 | 0.0019 | 0.0019 | 0.0030 | 0.0028 | | 0.0038 | 0.0012 | 0.0031 | 0.0028 |
| | | | (0.0033) | (0.0018) | (0.0018) | (0.0023) | (0.0025) | | (0.0031) | (0.0010) | (0.0015) | (0.0021) |
| 19 | 0.0068 | | 0.0063 | 0.0039 | 0.0039 | 0.0062 | 0.0061 | | 0.0066 | 0.0026 | 0.0056 | 0.0057 |
| | | | (0.0039) | (0.0025) | (0.0025) | (0.0029) | (0.0034) | | (0.0041) | (0.0015) | (0.0018) | (0.0026) |
| 20 | 0.0068 | | 0.0068 | 0.0042 | 0.0042 | 0.0062 | 0.0063 | | 0.0063 | 0.0026 | 0.0054 | 0.0059 |
| | | | (0.0039) | (0.0025) | (0.0025) | (0.0028) | (0.0030) | | (0.0038) | (0.0015) | (0.0022) | (0.0026) |
| 21 | 0.0034 | | 0.0038 | 0.0022 | 0.0022 | 0.0035 | 0.0032 | | 0.0037 | 0.0012 | 0.0031 | 0.0025 |
| | | | (0.0035) | (0.0020) | (0.0020) | (0.0023) | (0.0026) | | (0.0034) | (0.0011) | (0.0015) | (0.0021) |
| 22 | 0.0102 | | 0.0105 | 0.0071 | 0.0071 | 0.0097 | 0.0103 | | 0.0112 | 0.0052 | 0.0086 | 0.0098 |
| | | | (0.0058) | (0.0037) | (0.0037) | (0.0041) | (0.0041) | | (0.0052) | (0.0022) | (0.0030) | (0.0029) |
| 23 | 0.0034 | | 0.0039 | 0.0022 | 0.0022 | 0.0031 | 0.0030 | | 0.0035 | 0.0011 | 0.0030 | 0.0025 |
| | | | (0.0027) | (0.0015) | (0.0016) | (0.0021) | (0.0022) | | (0.0030) | (0.0010) | (0.0015) | (0.0021) |
| 24 | 0.0102 | | 0.0105 | 0.0069 | 0.0069 | 0.0107 | 0.0114 | | 0.0106 | 0.0049 | 0.0088 | 0.0115 |
| | | | (0.0050) | (0.0033) | (0.0033) | (0.0043) | (0.0042) | | (0.0059) | (0.0023) | (0.0036) | (0.0043) |
| 1, 3 | 0.0040 | | 0.0046 | 0.0037 | 0.0037 | 0.0041 | 0.0040 | | 0.0052 | 0.0041 | 0.0049 | 0.0047 |
| | | | (0.0041) | (0.0032) | (0.0032) | (0.0033) | (0.0034) | | (0.0048) | (0.0029) | (0.0030) | (0.0031) |
| 1, 9 | 0.0029 | | 0.0036 | 0.0026 | 0.0026 | 0.0030 | 0.0022 | | 0.0030 | 0.0018 | 0.0027 | 0.0010 |
| | | | (0.0027) | (0.0019) | (0.0019) | (0.0026) | (0.0028) | | (0.0031) | (0.0015) | (0.0022) | (0.0020) |
| 6, 7 | 0.0204 | | 0.0191 | 0.0148 | 0.0146 | 0.0215 | 0.0199 | | 0.0206 | 0.0124 | 0.0212 | 0.0223 |
| | | | (0.0067) | (0.0048) | (0.0047) | (0.0056) | (0.0054) | | (0.0073) | (0.0037) | (0.0048) | (0.0046) |
| 3, 14 | 0.0034 | | 0.0030 | 0.0019 | 0.0019 | 0.0020 | 0.0027 | | 0.0036 | 0.0015 | 0.0025 | 0.0029 |
| | | | (0.0029) | (0.0018) | (0.0018) | (0.0026) | (0.0025) | | (0.0026) | (0.0011) | (0.0020) | (0.0018) |
| 3, 6, 7 | 0.0034 | | 0.0039 | 0.0029 | 0.0029 | 0.0021 | 0.0027 | | 0.0038 | 0.0025 | 0.0024 | 0.0019 |
| | | | (0.0035) | (0.0025) | (0.0025) | (0.0029) | (0.0031) | | (0.0041) | (0.0022) | (0.0028) | (0.0029) |
| 1, 6, 7, 24 | 0.0034 | | 0.0039 | 0.0028 | 0.0028 | 0.0007 | 0.0006 | | 0.0039 | 0.0020 | 0.0002 | 0.0000 |
| | | | (0.0025) | (0.0019) | (0.0019) | (0.0019) | (0.0018) | | (0.0026) | (0.0014) | (0.0009) | (0.0002) |
| 1, 12, 13, 22, 25 | 0.0034 | | 0.0035 | 0.0025 | 0.0024 | 0.0027 | 0.0032 | | 0.0033 | 0.0017 | 0.0004 | 0.0031 |
| | | | (0.0022) | (0.0016) | (0.0017) | (0.0023) | (0.0023) | | (0.0026) | (0.0013) | (0.0014) | (0.0020) |
| Sum of remaining | | 0.0340 | 0.0227 | 0.0227 | 0.0102 | 0.0074 | | 0.0703 | 0.0310 | 0.0255 | 0.0076 | |
| haplotype probabilities | | | | | | | | | | | | |
| Sum of probabilities | | 0.0103 | 0.0097 | 0.0100 | 0.0132 | 0.0173 | | 0.0131 | 0.0118 | 0.0111 | 0.0200 | |
| of missed haplotypes | | | | | | | | | | | | |
| Sum of squared errors | | 0.00077 | 0.00125 | 0.00126 | 0.00059 | 0.00054 | | 0.00101 | 0.00281 | 0.00055 | 0.00048 | |
| Length of list | | 39.65 | 152.30 | 39.65 | 22.08 | 19.63 | | 71.24 | 71.24 | 29.29 | 19.81 | |
| SD of length | | (3.90) | (71.65) | (3.90) | (4.20) | (3.70) | | (5.02) | (5.02) | (5.82) | (5.22) | |
| | | | | | | | | | | | | |
| None | 0.7113 | | 0.7108 | 0.7674 | 0.7677 | 0.7115 | 0.7219 | | 0.7110 | 0.8075 | 0.7109 | 0.7374 |
| | | | (0.0202) | (0.0132) | (0.0132) | (0.0197) | (0.0161) | | (0.0232) | (0.0119) | (0.0192) | (0.0147) |
| 1 | 0.0034 | | 0.0031 | 0.0013 | 0.0013 | 0.0032 | 0.0022 | | 0.0034 | 0.0005 | 0.0031 | 0.0012 |
| | | | (0.0032) | (0.0014) | (0.0014) | (0.0022) | (0.0025) | | (0.0054) | (0.0007) | (0.0015) | (0.0018) |
| 3 | 0.0034 | | 0.0039 | 0.0016 | 0.0016 | 0.0032 | 0.0028 | | 0.0036 | 0.0005 | 0.0030 | 0.0017 |
| | | | (0.0034) | (0.0014) | (0.0014) | (0.0022) | (0.0025) | | (0.0047) | (0.0007) | (0.0014) | (0.0022) |
| 5 | 0.0034 | | 0.0037 | 0.0016 | 0.0016 | 0.0035 | 0.0029 | | 0.0036 | 0.0006 | 0.0032 | 0.0017 |
| | | | (0.0035) | (0.0015) | (0.0015) | (0.0022) | (0.0027) | | (0.0049) | (0.0007) | (0.0013) | (0.0022) |
| 7 | 0.0068 | | 0.0066 | 0.0038 | 0.0041 | 0.0061 | 0.0064 | | 0.0065 | 0.0027 | 0.0058 | 0.0061 |
| | | | (0.0048) | (0.0024) | (0.0028) | (0.0030) | (0.0040) | | (0.0070) | (0.0024) | (0.0024) | (0.0053) |
| 9 | 0.0034 | | 0.0037 | 0.0015 | 0.0015 | 0.0034 | 0.0029 | | 0.0038 | 0.0006 | 0.0031 | 0.0016 |
| | | | (0.0038) | (0.0016) | (0.0016) | (0.0024) | (0.0029) | | (0.0049) | (0.0008) | (0.0015) | (0.0022) |
| 10 | 0.0102 | | 0.0098 | 0.0050 | 0.0050 | 0.0087 | 0.0093 | | 0.0103 | 0.0026 | 0.0088 | 0.0085 |
| | | | (0.0059) | (0.0029) | (0.0030) | (0.0035) | (0.0035) | | (0.0072) | (0.0017) | (0.0026) | (0.0042) |
| 11 | 0.0034 | | 0.0038 | 0.0016 | 0.0016 | 0.0032 | 0.0029 | | 0.0027 | 0.0004 | 0.0031 | 0.0010 |
| | | | (0.0030) | (0.0013) | (0.0013) | (0.0023) | (0.0026) | | (0.0044) | (0.0007) | (0.0014) | (0.0019) |
| 14 | 0.0034 | | 0.0034 | 0.0014 | 0.0014 | 0.0031 | 0.0023 | | 0.0030 | 0.0004 | 0.0029 | 0.0013 |
| | | | (0.0037) | (0.0016) | (0.0016) | (0.0022) | (0.0027) | | (0.0048) | (0.0007) | (0.0015) | (0.0021) |
| 17 | 0.0034 | | 0.0033 | 0.0014 | 0.0014 | 0.0029 | 0.0023 | | 0.0031 | 0.0005 | 0.0028 | 0.0014 |
| | | | (0.0035) | (0.0014) | (0.0014) | (0.0021) | (0.0024) | | (0.0047) | (0.0008) | (0.0015) | (0.0021) |
| 20 | 0.0034 | | 0.0037 | 0.0015 | 0.0015 | 0.0030 | 0.0025 | | 0.0028 | 0.0004 | 0.0029 | 0.0014 |
| | | | (0.0035) | (0.0014) | (0.0014) | (0.0019) | (0.0025) | | (0.0042) | (0.0006) | (0.0014) | (0.0021) |
| 21 | 0.0264 | | 0.0251 | 0.0164 | 0.0164 | 0.0252 | 0.0252 | | 0.0266 | 0.0117 | 0.0257 | 0.0260 |
| | | | (0.0089) | (0.0051) | (0.0051) | (0.0064) | (0.0063) | | (0.0135) | (0.0040) | (0.0055) | (0.0056) |
| 22 | 0.0068 | | 0.0074 | 0.0040 | 0.0040 | 0.0095 | 0.0099 | | 0.0067 | 0.0016 | 0.0089 | 0.0074 |
| | | | (0.0055) | (0.0029) | (0.0029) | (0.0040) | (0.0051) | | (0.0067) | (0.0017) | (0.0038) | (0.0062) |
| 24 | 0.0306 | | 0.0307 | 0.0223 | 0.0234 | 0.0289 | 0.0295 | | 0.0297 | 0.0194 | 0.0273 | 0.0291 |
| | | | (0.0096) | (0.0066) | (0.0066) | (0.0070) | (0.0074) | | (0.0146) | (0.0049) | (0.0058) | (0.0061) |
| 25 | 0.0136 | | 0.0138 | 0.0075 | 0.0075 | 0.0129 | 0.0129 | | 0.0155 | 0.0047 | 0.0139 | 0.0125 |
| | | | (0.0072) | (0.0036) | (0.0036) | (0.0040) | (0.0036) | | (0.0100) | (0.0029) | (0.0042) | (0.0051) |
| 26 | 0.0034 | | 0.0033 | 0.0013 | 0.0013 | 0.0029 | 0.0023 | | 0.0037 | 0.0005 | 0.0031 | 0.0014 |
| | | | (0.0035) | (0.0014) | (0.0014) | (0.0024) | (0.0026) | | (0.0052) | (0.0007) | (0.0014) | (0.0021) |
| 28 | 0.0675 | | 0.0661 | 0.0487 | 0.0488 | 0.0615 | 0.0640 | | 0.0668 | 0.0390 | 0.0614 | 0.0629 |
| | | | (0.0136) | (0.0089) | (0.0089) | (0.0107) | (0.0105) | | (0.0162) | (0.0075) | (0.0090) | (0.0086) |
| 30 | 0.0036 | | 0.0032 | 0.0017 | 0.0017 | 0.0070 | 0.0064 | | 0.0043 | 0.0009 | 0.0077 | 0.0060 |
| | | | (0.0029) | (0.0015) | (0.0015) | (0.0039) | (0.0056) | | (0.0053) | (0.0011) | (0.0040) | (0.0067) |
| 31 | 0.0034 | | 0.0037 | 0.0016 | 0.0016 | 0.0031 | 0.0025 | | 0.0031 | 0.0004 | 0.0030 | 0.0014 |
| | | | (0.0037) | (0.0016) | (0.0016) | (0.0021) | (0.0025) | | (0.0044) | (0.0006) | (0.0013) | (0.0020) |
| 32 | 0.0038 | | 0.0034 | 0.0016 | 0.0016 | 0.0045 | 0.0041 | | 0.0041 | 0.0007 | 0.0042 | 0.0029 |
| | | | (0.0032) | (0.0015) | (0.0016) | (0.0029) | (0.0037) | | (0.0060) | (0.0010) | (0.0021) | (0.0038) |
| 2, 25 | 0.0034 | | 0.0034 | 0.0015 | 0.0015 | 0.0021 | 0.0026 | | 0.0036 | 0.0006 | 0.0013 | 0.0018 |
| | | | (0.0033) | (0.0015) | (0.0015) | (0.0029) | (0.0028) | | (0.0048) | (0.0008) | (0.0020) | (0.0023) |
| 7, 24 | 0.0510 | | 0.0507 | 0.0403 | 0.0386 | 0.0525 | 0.0530 | | 0.0523 | 0.0319 | 0.0524 | 0.0519 |
| | | | (0.0120) | (0.0079) | (0.0078) | (0.0094) | (0.0084) | | (0.0139) | (0.0063) | (0.0089) | (0.0064) |
| 12, 13 | 0.0034 | | 0.0031 | 0.0013 | 0.0013 | 0.0016 | 0.0023 | | 0.0029 | 0.0005 | 0.0011 | 0.0014 |
| | | | (0.0033) | (0.0015) | (0.0015) | (0.0025) | (0.0026) | | (0.0049) | (0.0009) | (0.0020) | (0.0022) |
| 21, 23 | 0.0034 | | 0.0031 | 0.0015 | 0.0015 | 0.0018 | 0.0023 | | 0.0035 | 0.0007 | 0.0016 | 0.0019 |
| | | | (0.0035) | (0.0016) | (0.0016) | (0.0027) | (0.0026) | | (0.0042) | (0.0009) | (0.0023) | (0.0023) |
| 21, 28 | 0.0009 | | 0.0031 | 0.0021 | 0.0021 | 0.0022 | 0.0019 | | 0.0035 | 0.0016 | 0.0015 | 0.0010 |
| | | | (0.0038) | (0.0022) | (0.0022) | (0.0027) | (0.0030) | | (0.0050) | (0.0017) | (0.0023) | (0.0027) |
| 21, 30 | 0.0033 | | 0.0035 | 0.0018 | 0.0018 | 0.0022 | 0.0018 | | 0.0035 | 0.0008 | 0.0016 | 0.0012 |
| | | | (0.0033) | (0.0016) | (0.0016) | (0.0027) | (0.0027) | | (0.0044) | (0.0010) | (0.0024) | (0.0023) |
| 22, 30 | 0.0034 | | 0.0029 | 0.0014 | 0.0013 | 0.0017 | 0.0017 | | 0.0037 | 0.0007 | 0.0014 | 0.0014 |
| | | | (0.0031) | (0.0015) | (0.0015) | (0.0024) | (0.0027) | | (0.0051) | (0.0009) | (0.0021) | (0.0022) |
| 24, 28 | 0.0034 | | 0.0042 | 0.0034 | 0.0033 | 0.0036 | 0.0035 | | 0.0064 | 0.0036 | 0.0037 | 0.0040 |
| | | | (0.0048) | (0.0030) | (0.0030) | (0.0032) | (0.0034) | | (0.0072) | (0.0028) | (0.0029) | (0.0036) |
| 28, 32 | 0.0030 | | 0.0034 | 0.0019 | 0.0019 | 0.0022 | 0.0019 | | 0.0033 | 0.0011 | 0.0018 | 0.0018 |
| | | | (0.0033) | (0.0018) | (0.0019) | (0.0028) | (0.0028) | | (0.0037) | (0.0011) | (0.0021) | (0.0026) |
| 4, 7, 24 | 0.0034 | | 0.0028 | 0.0015 | 0.0016 | 0.0013 | 0.0024 | | 0.0028 | 0.0008 | 0.0008 | 0.0019 |
| | | | (0.0025) | (0.0014) | (0.0014) | (0.0022) | (0.0022) | | (0.0027) | (0.0008) | (0.0016) | (0.0019) |
| 7, 22, 24 | 0.0034 | | 0.0042 | 0.0025 | 0.0026 | 0.0019 | 0.0011 | | 0.0029 | 0.0015 | 0.0018 | 0.0012 |
| | | | (0.0036) | (0.0022) | (0.0021) | (0.0028) | (0.0024) | | (0.0034) | (0.0013) | (0.0024) | (0.0025) |
| 7, 24, 30 | 0.0034 | | 0.0034 | 0.0023 | 0.0023 | 0.0019 | 0.0018 | | 0.0033 | 0.0014 | 0.0018 | 0.0018 |
| | | | (0.0032) | (0.0022) | (0.0022) | (0.0031) | (0.0031) | | (0.0035) | (0.0013) | (0.0024) | (0.0029) |
| Sum of remaining | | 0.0828 | 0.0454 | 0.0453 | 0.0178 | 0.0083 | | 0.2194 | 0.0592 | 0.0243 | 0.0160 | |
| haplotype probabilities | | | | | | | | | | | | |
| Sum of probabilities | | 0.0278 | 0.0265 | 0.0271 | 0.0263 | 0.0384 | | 0.0474 | 0.0462 | 0.0222 | 0.0553 | |
| of missed haplotypes | | | | | | | | | | | | |
| Sum of squared errors | | 0.00161 | 0.00451 | 0.00456 | 0.00103 | 0.00100 | | 0.00334 | 0.01152 | 0.00092 | 0.00152 | |
| Length of list | | 62.39 | 159.68 | 62.39 | 30.60 | 23.57 | | 106.91 | 106.91 | 36.11 | 21.92 | |
| SD of length | (4.88) | (27.84) | (4.88) | (6.04) | (4.22) | (6.41) | (6.41) | (7.12) | (5.11) | |||
Average estimates of haplotype frequencies for a 32 loci case based on 100 simulations of n pools of k individuals each using CDMLE (collapsed data MLE), EML (EM with combinatorially determined list), EM-CDL (EM with CD list), EM-ATCDL (augmented and trimmed CD list) and EM-TCDL (CD list with trimming and no augmentation), with standard errors in parentheses.
Figure 1Expected sum of squared errors of various haplotype frequency estimators for a 25 loci case. Expected sum of squared errors of various haplotype frequency estimators (EM-CDL: EM with CD list; EM-ACDL: augmented CD list; EML: EM with combinatorially determined list; CDMLE: collapsed data MLE; EM-TCDL: CD list with trimming and no augmentation; EM-ATCDL: augmented and trimmed CD list; EM-PL: EM with perfect list) based on 100 simulations of (a)n=100 and (b)n=200 pools of k individuals each when the true haplotype distribution over 25 loci is as given in Table 3.
Figure 2Expected sum of squared errors of various haplotype frequency estimators for a 32 loci case. Expected sum of squared errors of various haplotype frequency estimators (EM-CDL: EM with CD list; EM-ACDL: augmented CD list; EML: EM with combinatorially determined list; CDMLE: collapsed data MLE; EM-TCDL: CD list with trimming and no augmentation; EM-ATCDL: augmented and trimmed CD list; EM-PL: EM with perfect list) based on 100 simulations of (a)n=100 and (b)n=200 pools of k individuals each when the true haplotype distribution over 32 loci is as given in Table 4.
Figure 3Expected sum of squared errors of the EM-ATCDL estimator with fixed threshold (25 loci case). Expected sum of squared errors of the EM-ATCDL estimator for various choices of the threshold based on 100 simulations of n=200 pools of (a)k=2 and (b)k=4 individuals each when the true haplotype distribution over 25 loci is as given in Table 3; optimal threshold: the threshold obtained by minimizing the averaged sum of squared errors; average adaptive threshold: adaptively chosen thresholds obtained by minimizing the distance between and f(0) over the grid 0.0001 to 0.002 in steps of 0.0001.
Figure 4Expected sum of squared errors of the EM-ATCDL estimator with fixed threshold (32 loci case). Expected sum of squared errors of the EM-ATCDL estimator for various choices of the threshold based on 100 simulations of n=200 pools of (a)k=2 and (b)k=4 individuals each when the true haplotype distribution over 32 loci is as given in Table 4; optimal threshold: the threshold obtained by minimizing the averaged sum of squared errors; average adaptive threshold: adaptively chosen thresholds obtained by minimizing the distance between and f(0) over the grid 0.0001 to 0.002 in steps of 0.0001.
Sufficient conditions for non-ancestral haplotype frequencies to be increased by collapsing data
| 0.5000 | 0.6300 | 0.6988 | 0.7430 | 0.7743 |
Sufficient conditions for collapsed data frequencies {g(y),y≠0} to be greater than haplotype frequencies {f(y),y≠0} for various choices of pool size k.
Induced collapsed data frequencies
| | | ||||||
|---|---|---|---|---|---|---|---|
| None | | 0.7995 | | 0.6392 | 0.4085 | 0.2611 | 0.1669 |
| 1 | | 0.0509 | | 0.0839 | 0.1143 | 0.1169 | 0.1065 |
| 2 | | 0.0034 | | 0.0055 | 0.0070 | 0.0067 | 0.0058 |
| 3 | | 0.0436 | | 0.0716 | 0.0967 | 0.0980 | 0.0883 |
| 5 | | 0.0034 | | 0.0055 | 0.0070 | 0.0067 | 0.0058 |
| 6 | | 0.0034 | | 0.0055 | 0.0070 | 0.0067 | 0.0058 |
| 9 | | 0.0073 | | 0.0117 | 0.0151 | 0.0146 | 0.0125 |
| 11 | | 0.0034 | | 0.0055 | 0.0070 | 0.0067 | 0.0058 |
| 15 | | 0.0034 | | 0.0055 | 0.0070 | 0.0067 | 0.0058 |
| 19 | | 0.0068 | | 0.0109 | 0.0141 | 0.0136 | 0.0117 |
| 20 | | 0.0068 | | 0.0109 | 0.0141 | 0.0136 | 0.0117 |
| 21 | | 0.0034 | | 0.0055 | 0.0070 | 0.0067 | 0.0058 |
| 22 | | 0.0102 | | 0.0164 | 0.0213 | 0.0206 | 0.0178 |
| 23 | | 0.0034 | | 0.0055 | 0.0070 | 0.0067 | 0.0058 |
| 24 | | 0.0102 | | 0.0164 | 0.0213 | 0.0206 | 0.0178 |
| 1, 3 | | 0.0040 | | 0.0117 | 0.0307 | 0.0482 | 0.0610 |
| 1, 9 | | 0.0029 | | 0.0058 | 0.0105 | 0.0135 | 0.0148 |
| 3, 14 | | 0.0034 | | 0.0057 | 0.0082 | 0.0088 | 0.0084 |
| 6, 7 | | 0.0204 | | 0.0332 | 0.0439 | 0.0435 | 0.0384 |
| 3, 6, 7 | | 0.0034 | | 0.0077 | 0.0164 | 0.0231 | 0.0271 |
| 1, 6, 7, 24 | | 0.0034 | | 0.0060 | 0.0097 | 0.0119 | 0.0132 |
| 1, 12, 13, 22, 25 | | 0.0034 | | 0.0059 | 0.0087 | 0.0097 | 0.0096 |
| Sum of haplotype probabilities | 1.0000 | 0.9751 | 0.8822 | 0.7650 | 0.6462 | ||
Haplotype frequencies f(y) for a 25-loci case and the induced collapsed data frequencies g(y) for various pool sizes k.