| Literature DB >> 16004725 |
Abstract
For population case-control association studies, the false-positive rates can be high due to inappropriate controls, which can occur if there is population admixture or stratification. Moreover, it is not always clear how to choose appropriate controls. Alternatively, the parents or normal sibs can be used as controls of affected sibs. For late-onset complex diseases, parental data are not usually available. One way to study late-onset disorders is to perform sib-pair or sibship analyses. This paper proposes sibship-based Hotelling's T2 test statistics for high-resolution linkage disequilibrium mapping of complex diseases. For a sample of sibships, suppose that each sibship consists of at least one affected sib and at least one normal sib. Assume that genotype data of multiple tightly linked markers/haplotypes are available for each individual in the sample. Paired Hotelling's T2 test statistics are proposed for high-resolution association studies using normal sibs as controls for affected sibs, based on two coding methods: 'haplotype/allele coding' and 'genotype coding'. The paired Hotelling's T2 tests take into account not only the correlation among the markers, but also take the correlation within each sib-pair. The validity of the proposed method is justified by rigorous mathematical and statistical proofs under the large sample theory. The non-centrality parameter approximations of the test statistics are calculated for power and sample size calculations. By carrying out power and simulation studies, it was found that the non-centrality parameter approximations of the test statistics were accurate. By power and type I error analysis, the test statistics based on the 'haplotype/allele coding' method were found to be advantageous in comparison to the test statistics based on the 'genotype coding' method. The test statistics based on multiple markers can have higher power than those based on a single marker. The test statistics can be applied not only for bi-allelic markers, but also for multi-allelic markers. In addition, the test statistics can be applied to analyse the genetic data of multiple markers which contain double heterozygotes--that is, unknown linkage phase data. An SAS macro, Hotel_sibs.sas, is written to implement the method for data analysis.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16004725 PMCID: PMC3530186 DOI: 10.1186/1479-7364-2-2-90
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Type I error rates of N = 200 or 300 sib-pairs at a significance level α = 0.01 using one marker, H1, or two markers, H1 and H2.
| Model | Test | df | # type I error rates | Mean | Std Dev | Minimum | Maximum |
|---|---|---|---|---|---|---|---|
| I | 1 | 100 | 0.010808 | 0.0014264 | 0.0066 | 0.0140 | |
| 2 | 100 | 0.011240 | 0.0013923 | 0.0082 | 0.0152 | ||
| II | 2 | 100 | 0.011286 | 0.0014717 | 0.0070 | 0.0146 | |
| 4 | 100 | 0.012352 | 0.0015899 | 0.0088 | 0.0160 | ||
| III | 3 | 100 | 0.011660 | 0.0014348 | 0.0078 | 0.0146 | |
| 9 | 100 | 0.014352 | 0.0018710 | 0.0102 | 0.0196 | ||
| III | 3 | 100 | 0.011186 | 0.0015669 | 0.0074 | 0.0160 | |
| 9 | 100 | 0.013076 | 0.0017027 | 0.0084 | 0.0176 | ||
In model I, one bi-allelic marker H1 is used, P(H11) = P(H12) = 0.50. In model II, two bi-allelic markers H1 and H2 are used, P(H) = 0.5, i, j = 1, 2, . In model III, one quadric-allelic marker H1 is used, P(H21) = P(H22) = 0.35, P(H23) = P(H24) = 0.15. Abbreviations: df = degrees of freedom; Std Dev = standard deviation.
Type I error rates of N = 200 or 300 sibships at a significance level α = 0.01 using one marker, H1, or two markers, H1 and H2.
| Model | Test | df | # type I error rates | Mean | Std Dev | Minimum | Maximum |
|---|---|---|---|---|---|---|---|
| I | 1 | 100 | 0.010642 | 0.0014406 | 0.0062 | 0.0134 | |
| 2 | 100 | 0.011278 | 0.0015023 | 0.0076 | 0.0154 | ||
| II | 2 | 100 | 0.011096 | 0.0014418 | 0.0078 | 0.0154 | |
| 4 | 100 | 0.012138 | 0.0014825 | 0.0082 | 0.0154 | ||
| III | 3 | 100 | 0.011536 | 0.0014156 | 0.070 | 0.0158 | |
| 9 | 100 | 0.014202 | 0.0016562 | 0.096 | 0.0182 | ||
| III | 3 | 100 | 0.011098 | 0.0015214 | 0.0076 | 0.0152 | |
| 9 | 100 | 0.012790 | 0.0016883 | 0.0086 | 0.0186 | ||
The number of sib-pairs is equal to N/2; in each sib-pair, one sibling is affected and the other is normal. The number of sibships of size 3 is N/2; in each of N/4 sibships of size 3, one is affected and the other two are normal; in the remaining N/4 sibships of size 3, two are affected and the other one is normal. The other parameters of each model are the same as those of Table 1. Abbreviations: df = degrees of freedom; Std Dev = standard deviation.
Type I error rates of N = 200 or 300 sibships at a significance level α = 0.01 using one marker, H1, or two markers, H1 and H2.
| Model | Test | df | # type I error rates | Mean | Std Dev | Minimum | Maximum |
|---|---|---|---|---|---|---|---|
| I | 1 | 100 | 0.010670 | 0.0014040 | 0.0072 | 0.0136 | |
| 2 | 100 | 0.011156 | 0.0015397 | 0.0066 | 0.0142 | ||
| II | 2 | 100 | 0.011218 | 0.0014678 | 0.0078 | 0.0166 | |
| 4 | 100 | 0.012304 | 0.0011921 | 0.0092 | 0.0156 | ||
| III | 3 | 100 | 0.011518 | 0.0014639 | 0.0082 | 0.015 | |
| 9 | 100 | 0.014356 | 0.0015381 | 0.0102 | 0.018 | ||
| III | 3 | 100 | 0.011228 | 0.0013312 | 0.0078 | 0.0160 | |
| 9 | 100 | 0.012544 | 0.0015203 | 0.0086 | 0.0182 | ||
The number of sib-pairs is equal to N/2; the number of sibships of size 3 is N/5; and the number of sibships of size 4 is 3N/10. In each sib-pair, one sibling is affected and the other is normal. In each of N/10 sibships of size 3, one is affected and the other two are normal; in the remaining N/10 sibships of size 3, two are affected and the other is normal. In each of N/10 sibships of size 4, one is affected and the other three are normal; in each of N/10 sibships of size 4, two are affected and the other two are normal; in the remaining N/10 sibships of size 4, three are affected and the other one is normal. The other parameters of each model are the same as those of Table 1. Abbreviations: df = degrees of freedom; Std Dev = standard deviation.
First set of parameters of simulated genetic models.
| Model type | |||
|---|---|---|---|
| Heterogeneous recessive | 1.00 | 0.05 | 0.05 |
| Heterogeneous dominant | 1.00 | 0.95 | 0.05 |
| Additive | 1.00 | 0.50 | 0.0 |
| Multiplicative | 0.81 | 0.045 | 0.0025 |
Second set of parameters of simulated genetic models.
| Model type | |||
|---|---|---|---|
| Heterogeneous recessive | 0.16 | 0.04 | 0.04 |
| Heterogeneous dominant | 0.08 | 0.08 | 0.02 |
| Additive | 0.108 | 0.0675 | 0.027 |
| Multiplicative | 0.12 | 0.06 | 0.03 |
Figure 1Power curves of . The power curves of Tand Tare calculated based on one marker H1. In the graphs, Delta_11 = Δ11 = P(H11D) - P(H11)Pis a measure of linkage disequilibrium (LD) between marker H1 and disease locus D; in addition, the other parameters are given by Δ21 = P(H21D) 2 P(H21)P= Δ11, Δ= P(H11H21) - P(H11)P(H21) = 0.05; and .
Figure 2Power curves of . The power curves of Tand Tare calculated based on one marker H1. In the graphs, Delta_11 = Δ11 = P(H11D) - P(H11)Pis a measure of linkage disequilibrium (LD) between marker H1 and disease locus D; in addition, the other parameters are given by Δ21 = P(H21D) - P(H21)P= Δ11, , and .
Figure 3Power curves of . Delta_11 = Δ11 = P(H11D) - P(H11)Pis a measure of linkage disequilibrium (LD) between marker H1 and disease locus D. In addition, Δ12 = -Δ11, Δ13 = -Δ14 = Δ11/2. The simulated power curves of STand STare calculated using combinations of both sib-pairs and sibships of size 3: the number of sib-pairs is equal to N/2 = 100; the number of sibships of size 3 is N/2 = 100; in each of N/4 = 50 sibships of size 3, one is affected and the other two are normal; in the remaining N/4 = 50 sibships of size 3, two are affected and the other one is normal.
Figure 4Power curves of . Delta_11 = Δ11 = P(H11D) - P(H11)PD is a measure of linkage disequilibrium (LD) between marker H1 and disease locus D. In addition, Δ12 = -Δ11, Δ13 = - Δ14 = Δ11/2. The simulated power curves of STH and STG are calculated using combinations of both sib-pairs and sibships of size 3 and sibships of size 4; the number of sib-pairs is equal to N/2 = 300; the number of sibships of size 3 is N/2 = 120; and the number of sibships of size 4 is 3N/10 = 180; in each of N/10 = 60 sibships of size 3, one is affected and the other two are normal; in the remaining N/10 = 60 sibships of size 3, two are affected and the other one is normal; in each of N/10 = 60 sibships of size 4, one is affected and the other three are normal; in each of N/10 = 60 sibships of size 4, two are affected and the other two are normal; in the remaining N/10 = 60 sibships of size 4, three are affected and the other one is normal.