| Literature DB >> 16689984 |
Sandra Barral1, Chad Haynes, Millicent Stone, Derek Gordon.
Abstract
BACKGROUND: In the field of statistical genetics, phenotype and genotype misclassification errors can substantially reduce power to detect association with genetic case/control studies. Misclassification also can bias population frequency parameters such as genotype, haplotype, or multi-locus genotype frequencies. These problems are of particular concern in case/control designs because, short of repeated sampling, there is no way to detect misclassification errors. We developed a double-sampling procedure for case/control genetic association using a likelihood ratio test framework. Different approaches have been proposed to deal with misclassification errors. We have chosen the likelihood framework because of the ease with which misclassification probabilities may be incorporated into in the statistical framework and hypothesis testing. The statistic is called the Likelihood Ratio Test allowing for errors (LRTae) and is freely available via software download.Entities:
Mesh:
Year: 2006 PMID: 16689984 PMCID: PMC1471798 DOI: 10.1186/1471-2156-7-24
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Generating multi-locus genotype frequencies for simulations
| Multi-locus genotypes | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
| Case | 0.00 | 0.02 | 0.00 | 0.01 | 0.01 | 0.00 | 0.20 | 0.00 | 0.02 | 0.12 | 0.00 | 0.06 | 0.58 |
| Control | 0.01 | 0.04 | 0.01 | 0.05 | 0.00 | 0.03 | 0.22 | 0.03 | 0.03 | 0.20 | 0.01 | 0.05 | 0.33 |
Legend for Table 1. In this table we provide the generating frequencies in case and control populations for the thirteen multi-locus genotypes. We number the genotype frequencies 0 – 12. The multi-locus genotype corresponding to each coded genotype (0–12) is given under the heading "Genotype Codings" (see directly below). Note that, at each SNP, we use the code 1, 2, 3 to refer to the less common homozygote, heterozygote, and more common homozygote, respectively. For example, the code "3 : 3 : 3" is the multi-locus genotype consisting of the more common homozygote at each of the three SNP loci. Also note that these generating frequencies suggest a recessive mode of inheritance for AS in our simulations, since greatest risk occurs for individuals who are homozygous for each of the three SNPs (code 3 : 3 : 3).
Genotype Codings:
0 = 1 : 2 : 2
1 = 1 : 3 : 3
2 = 2 : 1 : 1
3 = 2 : 2 : 2
4 = 2 : 2 : 3
5 = 2 : 3 : 2
6 = 2 : 3 : 3
7 = 3 : 1 : 1
8 = 3 : 2 : 1
9 = 3 : 2 : 2
10 = 3 : 3 : 1
11 = 3 : 3 : 2
12 = 3 : 3 : 3
Power for LRTae and LRTstd methods using permutation p-values
| LRTae | LRTstd | ||
| 0.05 | 0.989 | 0.970 | 0.019 |
| 0.01 | 0.953 | 0.905 | 0.048 |
| 0.001 | 0.831 | 0.716 | 0.115 |
| 0.0001 | 0.677 | 0.532 | 0.145 |
Legend for Table 2. In this table, we report the proportion of replicates (out of 10,000) whose permutation p-values for each method (LRTae, LRTstd) are less than the Significance Level (Column labeled Sgn Level) threshold. The thresholds we consider are 0.05, 0.01, 0.001, and 0.0001. We also report the Power Difference (Power(LRTae) – Power(LRTstd)) for each significance level.