| Literature DB >> 15291970 |
Abstract
BACKGROUND: Analyses of genetic data at the level of haplotypes provide increased accuracy and power to infer genotype-phenotype correlations and evolutionary history of a locus. However, empirical determination of haplotypes is expensive and laborious. Therefore, several methods of inferring haplotypes from unphased genotypic data have been proposed, but it is unclear how accurate each of the methods is or which methods are superior. The accuracy of some of the leading methods of computational haplotype inference (PL-EM, Phase, SNPHAP, Haplotyper) are compared using a large set of 308 empirically determined haplotypes based on 15 SNPs, among which 36 haplotypes were observed to occur. This study presents several advantages over many previous comparisons of haplotype inference methods: a large number of subjects are included, the number of known haplotypes is much smaller than the number of chromosomes surveyed, a range in values of linkage disequilibrium, presence of rare SNP alleles, and considerable dispersion in the frequencies of haplotypes.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15291970 PMCID: PMC512280 DOI: 10.1186/1471-2156-5-22
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Linkage disequilibrium (D') among loci with minor allele frequencies ≥ 5%1
| Site | |||||||
| Site | -278 | -75 | -57 | -31 | -6 | -1 | +59 |
| -308 | 1.000 | 0.653 | 0.892 | 0.741 | 0.458 | 0.192 | 0.646 |
| -278 | 0.857 | 0.845 | 0.696 | 0.820 | 0.666 | 0.334 | |
| -75 | 1.000 | 0.358 | 0.708 | 0.561 | 0.172 | ||
| -57 | 1.000 | 0.872 | 1.000 | 1.000 | |||
| -31 | 0.410 | 1.000 | 0.538 | ||||
| -6 | 1.000 | 0.194 | |||||
| -1 | 0.002 | ||||||
1 Numbering relative to the start of transcription for GH1
Inferred frequencies of haplotypes.
| SNP | Haplotype Frequency | ||||||||||||||||||||
| -476 | -339 | -308 | -301 | -278 | -168 | -75 | -57 | -31 | -6 | -1 | 3 | 16 | 25 | 59 | Empirical | Phase, no LD | Phase, with LD | Haplotyper | PL-EM | SNPHAP | |
| Empirical Haplotypes | |||||||||||||||||||||
| 1 | G | G | G | G | G | T | A | T | G | A | A | G | A | A | T | 0.334 | 0.312 | 0.321 | 0.325 | 0.333 | 0.326 |
| 2 | G | G | G | G | T | T | A | G | G | G | A | G | A | A | T | 0.162 | 0.166 | 0.162 | 0.166 | 0.181 | 0.171 |
| 3 | G | G | T | T | G | T | A | G | G | A | A | G | A | A | T | 0.091 | 0.097 | 0.097 | 0.101 | 0.098 | 0.102 |
| 4 | G | G | T | T | G | T | A | G | - | A | A | G | A | A | T | 0.052 | 0.055 | 0.055 | 0.049 | 0.047 | 0.050 |
| 5 | G | G | G | G | T | T | G | G | G | G | A | G | A | A | T | 0.042 | 0.052 | 0.052 | 0.052 | 0.049 | 0.050 |
| 6 | G | G | T | T | G | T | A | G | - | A | A | G | A | A | G | 0.029 | 0.032 | 0.032 | 0.032 | 0.030 | 0.030 |
| 7 | G | G | G | G | T | T | A | G | G | G | T | G | A | A | T | 0.026 | 0.032 | 0.032 | 0.032 | 0.028 | 0.029 |
| 8 | G | G | T | T | G | T | A | G | G | G | A | G | A | A | T | 0.019 | 0.016 | 0.016 | 0.013 | 0.016 | 0.018 |
| 9 | G | G | G | G | T | T | A | T | G | G | A | G | A | A | T | 0.019 | 0.013 | 0.013 | 0.013 | 0.011 | 0.011 |
| 10 | G | G | T | T | G | T | A | G | - | G | A | G | A | A | T | 0.019 | 0.023 | 0.026 | 0.023 | 0.025 | 0.025 |
| 11 | G | G | G | G | T | T | G | G | G | G | A | G | G | C | T | 0.016 | 0.016 | 0.016 | 0.016 | 0.014 | 0.014 |
| 12 | G | G | G | G | T | T | A | G | G | A | A | G | A | A | T | 0.016 | 0.010 | 0.006 | 0.006 | 0.008 | 0.008 |
| 13 | G | - | G | G | T | T | G | G | G | G | A | G | A | A | T | 0.016 | 0.016 | 0.016 | 0.013 | 0.010 | 0.013 |
| 14 | G | G | G | G | T | C | A | G | G | G | T | G | A | A | T | 0.016 | 0.016 | 0.016 | 0.016 | 0.016 | 0.016 |
| 15 | G | G | T | T | G | T | A | G | G | G | T | G | A | A | T | 0.013 | 0.010 | 0.010 | 0.010 | 0.006 | 0.009 |
| 16 | G | G | G | G | T | T | G | G | G | A | A | G | A | A | T | 0.013 | 0.013 | 0.013 | 0.016 | 0.008 | 0.008 |
| 17 | G | - | G | G | T | T | A | G | G | G | A | G | A | A | T | 0.013 | 0.013 | 0.013 | 0.013 | 0.011 | 0.011 |
| 18 | G | G | G | G | T | T | A | G | - | G | A | G | A | A | T | 0.010 | - | 0.006 | 0.010 | 0.007 | 0.008 |
| 19 | A | G | G | G | T | T | A | G | G | G | A | G | A | A | T | 0.010 | 0.013 | 0.013 | 0.010 | 0.005 | 0.010 |
| 20 | G | G | G | G | G | T | A | G | - | A | A | G | A | A | T | 0.010 | - | 0.003 | 0.010 | 0.006 | 0.005 |
| 21 | G | G | G | G | T | T | G | G | G | G | A | G | A | A | G | 0.010 | 0.010 | 0.010 | 0.010 | 0.011 | 0.011 |
| 22 | G | G | T | T | G | T | A | T | G | A | A | G | A | A | T | 0.010 | 0.013 | 0.010 | 0.013 | 0.007 | 0.007 |
| 23 | G | G | G | G | G | T | A | G | G | A | A | G | A | A | T | 0.006 | 0.016 | 0.013 | 0.006 | 0.006 | 0.008 |
| 24 | G | G | T | T | G | T | G | G | - | A | A | G | A | A | T | 0.006 | - | - | - | - | - |
| 25 | G | G | T | T | G | T | A | G | G | A | A | G | A | A | G | 0.003 | - | - | - | 0.004 | 0.004 |
| 26 | G | G | G | G | T | T | G | G | G | G | T | G | A | A | T | 0.003 | 0.006 | 0.006 | 0.006 | 0.007 | 0.007 |
| 27 | G | G | G | G | T | T | A | T | G | A | A | G | A | A | T | 0.003 | 0.003 | 0.003 | - | - | - |
| 28 | G | G | G | G | T | T | A | G | - | A | A | G | A | A | T | 0.003 | - | - | - | - | - |
| 29 | A | G | G | G | T | T | A | G | G | A | A | G | A | A | T | 0.003 | - | - | - | - | - |
| 30 | G | - | G | G | T | T | A | G | G | A | A | G | A | A | T | 0.003 | 0.003 | 0.003 | 0.003 | 0.003 | 0.003 |
| 31 | G | G | G | G | T | T | G | G | - | G | A | G | A | A | T | 0.003 | 0.010 | - | - | - | - |
| 32 | G | G | T | T | G | T | G | G | G | G | A | G | A | A | G | 0.003 | - | - | - | 0.002 | - |
| 33 | G | G | G | G | T | T | A | G | G | G | A | G | G | C | T | 0.003 | 0.003 | 0.003 | 0.003 | 0.004 | 0.004 |
| 34 | G | - | G | G | T | C | A | G | G | G | T | G | A | A | T | 0.003 | 0.003 | 0.003 | 0.003 | 0.003 | 0.003 |
| 35 | G | G | G | G | G | T | A | G | G | A | C | C | A | A | T | 0.003 | - | - | 0.003 | 0.003 | 0.003 |
| 36 | G | G | G | G | T | T | A | G | G | G | T | G | A | A | G | 0.003 | - | - | - | 0.003 | 0.003 |
| Incorrect Haplotypes | |||||||||||||||||||||
| 1 | G | - | G | G | T | T | G | G | G | G | A | G | G | C | T | - | - | - | 0.002 | 0.002 | |
| 2 | G | G | G | G | T | T | G | G | - | A | A | G | A | A | T | - | 0.006 | - | 0.009 | 0.008 | |
| 3 | G | G | G | G | T | T | G | T | - | G | A | G | A | A | T | - | - | - | - | 0.002 | |
| 4 | G | - | G | G | T | T | G | T | G | A | A | G | A | A | T | - | - | 0.003 | - | - | |
| 5 | G | - | G | G | T | T | G | T | G | G | A | G | A | A | T | - | - | - | 0.003 | 0.002 | |
| 6 | G | G | G | G | T | T | G | T | G | G | A | G | A | A | T | - | 0.003 | - | - | - | |
| 7 | G | G | G | G | T | T | G | T | G | A | A | G | A | A | T | - | - | - | - | 0.004 | |
| 8 | A | G | G | G | T | T | A | T | G | A | A | G | A | A | T | - | - | 0.003 | 0.003 | 0.003 | |
| 9 | G | G | G | G | T | T | A | G | G | G | C | C | A | A | T | 0.003 | 0.003 | - | - | - | |
| 10 | A | G | T | T | G | T | A | G | - | A | A | G | A | A | T | - | - | - | 0.002 | - | |
| 11 | A | G | T | T | G | T | A | G | G | G | T | G | A | A | T | - | - | - | 0.003 | - | |
| 12 | G | G | T | T | T | T | G | G | - | G | A | G | A | A | T | - | - | 0.006 | 0.004 | - | |
| 13 | G | G | T | T | T | T | G | G | G | A | A | G | A | A | T | - | - | - | 0.003 | - | |
| 14 | G | G | G | G | G | T | A | T | - | A | A | G | A | A | T | 0.010 | - | - | - | - | |
| 15 | G | G | G | G | G | T | A | T | G | G | A | G | A | A | T | 0.006 | 0.006 | 0.006 | 0.008 | 0.008 | |
| 16 | G | G | G | G | G | T | A | T | G | A | A | G | A | A | G | 0.006 | 0.006 | 0.006 | - | - | |
| Minor Allele Frequency | |||||||||||||||||||||
| 0.013 | 0.036 | 0.247 | 0.247 | 0.399 | 0.019 | 0.114 | 0.367 | 0.133 | 0.412 | 0.065 | 0.003 | 0.019 | 0.019 | 0.049 | |||||||
Accuracy of computational inferences of haplotype structure of the GH1 gene promoter.
| 15 Promoter SNPs | Error Rate Based on # of Heterozygous Sites (N) | |||||||||
| Error Rate | ||||||||||
| Algorithm | MSE | IH # correct/# wrong | IF | Overall | Ambiguous Individuals | 2 (13) | 3 (32) | 4 (24) | 5 (28) | 6 (11) |
| Phase v2, no LD | 3.6 × 10-5 | 0.81 27/4 | 0.91 | 0.11 | 0.15 | 0.08 | 0.09 | 0.25 | 0.11 | 0.27 |
| Phase v2, with LD | 2.2 × 10-5 | 0.81 28/5 | 0.93 | 0.10 | 0.13 | 0 | 0.09 | 0.17 | 0.14 | 0.27 |
| Haplotyper 1.0 | 2.0 × 10-5 | 0.81 28/5 | 0.93 | 0.09 | 0.13 | 0 | 0.06 | 0.13 | 0.18 | 0.27 |
| PL-EM 1.0 | 2.5 × 10-5 | 0.82 31/9 | 0.92 | 0.11 | 0.15 | 0 | 0.09 | 0.13 | 0.21 | 0.36 |
| SNPHAP 1.0 | 2.0 × 10-5 | 0.82 30/7 | 0.93 | 0.09 | 0.13 | 0 | 0.09 | 0.13 | 0.14 | 0.27 |
| 8 SNPs with Minor Allele Frequency ≥ 5% | ||||||||||
| Phase v2, no LD | 4.8 × 10-5 | 0.79 19/3 | 0.92 | 0.11 | 0.15 | |||||
| Phase v2, with LD | 4.1 × 10-5 | 0.80 20/4 | 0.92 | 0.11 | 0.15 | |||||
| Haplotyper 1.0 | 3.8 × 10-5 | 0.81 19/2 | 0.93 | 0.10 | 0.14 | |||||
| PL-EM 1.0 | 2.3 × 10-5 | 0.85 22/4 | 0.94 | 0.08 | 0.11 | |||||
| SNPHAP 1.0 | 3.3 × 10-5 | 0.85 22/4 | 0.93 | 0.08 | 0.11 | |||||