| Literature DB >> 20011038 |
X Li, B N Thomas, S M Rich, D Ecker, J K Tumwine, A S Foulkes.
Abstract
Malaria is an infectious disease that is caused by a group of parasites of the genus Plasmodium. Characterizing the association between polymorphisms in the parasite genome and measured traits in an infected human host may provide insight into disease aetiology and ultimately inform new strategies for improved treatment and prevention. This, however, presents an analytic challenge since individuals are often multiply infected with a variable and unknown number of genetically diverse parasitic strains. In addition, data on the alignment of nucleotides on a single chromosome, which is commonly referred to as haplotypic phase, is not generally observed. An expectation-maximization algorithm for estimating and testing associations between haplotypes and quantitative traits has been described for diploid (human) populations. We extend this method to account for both the uncertainty in haplotypic phase and the variable and unknown number of infections in the malaria setting. Further extensions are described for the human immunodeficiency virus quasi-species setting. A simulation study is presented to characterize performance of the method. Application of this approach to data arising from a cross-sectional study of n=126 multiply infected children in Uganda reveals some interesting associations requiring further investigation.Entities:
Year: 2009 PMID: 20011038 PMCID: PMC2784907 DOI: 10.1111/j.1467-9876.2009.00673.x
Source DB: PubMed Journal: J R Stat Soc Ser C Appl Stat ISSN: 0035-9254 Impact factor: 1.864
Simulation results for the dominant model under three assumptions†
| ( | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 0.0 | 0.0038 (0.132) | — | 0.0008 (0.016) | 0.95 | — | 0.95 | 0.05 | 0.06 | |
| 0.2 | 0.0009 (0.138) | — | 0.0005 (0.015) | 0.96 | — | 0.95 | 0.35 | 0.07 | |
| 0.4 | 0.0060 (0.138) | — | 0.0013 (0.015) | 0.96 | — | 0.95 | 0.82 | 0.06 | |
| 0.6 | 0.0002 (0.126) | — | 0.0003 (0.016) | 0.95 | — | 0.95 | 0.99 | 0.06 | |
| 0.8 | 0.0016 (0.122) | — | 0.0008 (0.015) | 0.94 | — | 0.95 | 1.00 | 0.05 | |
| 0.0 | 0.0035 (0.180) | — | 0.0007 (0.018) | 0.94 | — | 0.94 | 0.08 | 0.07 | |
| 0.2 | 0.0122 (0.181) | — | 0.0009 (0.017) | 0.95 | — | 0.95 | 0.22 | 0.08 | |
| 0.4 | 0.0136 (0.187) | — | 0.0006 (0.017) | 0.95 | — | 0.95 | 0.59 | 0.08 | |
| 0.6 | 0.0265 (0.181) | — | 0.0011 (0.017) | 0.95 | — | 0.95 | 0.88 | 0.08 | |
| 0.8 | 0.0291 (0.177) | — | 0.0004 (0.017) | 0.95 | — | 0.94 | 0.97 | 0.07 | |
| 0.0 | 0.0128 (0.206) | — | 0.0066 (0.019) | 0.94 | — | 0.92 | 0.07 | 0.06 | |
| 0.2 | 0.0078 (0.223) | — | 0.0037 (0.019) | 0.97 | — | 0.94 | 0.20 | 0.09 | |
| 0.4 | 0.0443 (0.212) | — | 0.0065 (0.020) | 0.96 | — | 0.94 | 0.38 | 0.06 | |
| 0.6 | 0.0856 (0.185) | — | 0.0048 (0.020) | 0.93 | — | 0.95 | 0.62 | 0.07 | |
| 0.8 | 0.0627 (0.197) | — | 0.0046 (0.018) | 0.92 | — | 0.95 | 0.88 | 0.06 | |
| 0.0 | 0.0098 (0.126) | 0.0022 (0.111) | 0.0025 (0.020) | 0.96 | 0.94 | 0.94 | 0.04 | 0.05 | |
| 0.2 | 0.0011 (0.150) | 0.0093 (0.105) | 0.0011 (0.019) | 0.95 | 0.95 | 0.95 | 0.41 | 0.05 | |
| 0.4 | 0.0001 (0.128) | 0.0101 (0.089) | 0.0013 (0.020) | 0.96 | 0.96 | 0.97 | 0.87 | 0.06 | |
| 0.6 | 0.0240 (0.129) | 0.0042 (0.116) | 0.0018 (0.020) | 0.94 | 0.98 | 0.96 | 1.00 | 0.03 | |
| 0.8 | 0.0160 (0.146) | 0.0091 (0.104) | 0.0012 (0.019) | 0.96 | 0.95 | 0.94 | 0.99 | 0.05 | |
| 0.0 | 0.0022 (0.131) | 0.0087 (0.123) | 0.0017 (0.019) | 0.96 | 0.97 | 0.94 | 0.04 | 0.03 | |
| 0.2 | 0.0312 (0.129) | 0.0372 (0.124) | 0.0027 (0.019) | 0.95 | 0.96 | 0.95 | 0.44 | 0.04 | |
| 0.4 | 0.0002 (0.122) | 0.0043 (0.137) | 0.0017 (0.020) | 0.94 | 0.96 | 0.95 | 0.91 | 0.05 | |
| 0.4 | 0.0055 (0.129) | 0.0216 (0.137) | 0.0009 (0.018) | 0.93 | 0.96 | 0.94 | 0.99 | 0.06 | |
| 0.8 | 0.0120 (0.116) | 0.0067 (0.126) | 0.0024 (0.020) | 0.97 | 0.96 | 0.94 | 1.00 | 0.06 | |
| 0.0 | 0.0034 (0.117) | 0.0112 (0.033) | 0.0024 (0.019) | 0.95 | 0.79 | 0.96 | 0.05 | 0.03 | |
| 0.2 | 0.0082 (0.108) | 0.0119 (0.030) | 0.0027 (0.018) | 0.94 | 0.85 | 0.95 | 0.38 | 0.06 | |
| 0.4 | 0.0024 (0.118) | 0.0119 (0.029) | 0.0018 (0.018) | 0.96 | 0.81 | 0.96 | 0.94 | 0.06 | |
| 0.6 | 0.0321 (0.141) | 0.0132 (0.032) | 0.0027 (0.019) | 0.97 | 0.83 | 0.96 | 1.00 | 0.04 | |
| 0.8 | 0.0015 (0.116) | 0.0119 (0.032) | 0.0007 (0.018) | 0.96 | 0.83 | 0.95 | 1.00 | 0.05 | |
and denote averaging across all s and s respectively. and denote averaging across all θs and qs respectively.
‡β1 is the effect of haplotype h1=(A1,B1) on Y.
§Bias is defined as the absolute difference between the mean of the estimate over the simulations and the true parameter value.
§§Coverage rate is defined as the proportion of simulations for which the true parameter value is within the corresponding 95% confidence interval.
*Power is the specific power for the haplotype effect of the first haplotype h1.
**ER is the type 1 error rate.
Sensitivity analysis to model misspecification
| 0.0 | 0.0016 (0.133) | 0.0332 (0.044) | 0.95 | 0.90 | 0.03 | 0.04 | ||
| 0.2 | 0.0441 (0.165) | 0.0334 (0.045) | 0.93 | 0.92 | 0.22 | 0.04 | ||
| 0.4 | 0.0810 (0.187) | 0.0366 (0.042) | 0.92 | 0.86 | 0.59 | 0.12 | ||
| 0.6 | 0.0761 (0.251) | 0.0303 (0.041) | 0.92 | 0.88 | 0.88 | 0.22 | ||
| 0.8 | 0.1081 (0.329) | 0.0214 (0.044) | 0.93 | 0.93 | 0.95 | 0.30 | ||
| 0.0 | 0.0158 (0.178) | 0.0640 (0.104) | 0.93 | 0.99 | 0.08 | 0.07 | ||
| 0.2 | 0.1112 (0.175) | 0.0850 (0.083) | 0.89 | 0.92 | 0.13 | 0.09 | ||
| 0.4 | 0.1499 (0.187) | 0.0985 (0.065) | 0.91 | 0.64 | 0.30 | 0.16 | ||
| 0.6 | 0.2177 (0.219) | 0.0972 (0.068) | 0.86 | 0.68 | 0.65 | 0.25 | ||
| 0.8 | 0.3546 (0.353) | 0.0722 (0.092) | 0.87 | 0.98 | 0.83 | 0.40 | ||
| 0.0 | 0.0086 (0.115) | 0.0492 (0.009) | 0.0023 (0.022) | 0.97 | 0.15 | 0.95 | 0.02 | 0.04 |
| 0.2 | 0.0110 (0.142) | 0.0491 (0.009) | 0.0019 (0.022) | 0.95 | 0.14 | 0.95 | 0.37 | 0.07 |
| 0.4 | 0.0026 (0.129) | 0.0489 (0.008) | 0.0011 (0.020) | 0.96 | 0.12 | 0.94 | 0.90 | 0.05 |
| 0.6 | 0.0039 (0.141) | 0.0492 (0.008) | 0.0010 (0.021) | 0.94 | 0.13 | 0.96 | 0.99 | 0.05 |
| 0.8 | 0.0134 (0.102) | 0.0492 (0.009) | 0.0010 (0.020) | 0.95 | 0.15 | 0.94 | 1.00 | 0.06 |
| 0.0 | 0.0113 (0.114) | 0.0027 (0.019) | 0.96 | 0.95 | 0.04 | 0.05 | ||
| 0.2 | 0.0166 (0.123) | 0.0025 (0.021) | 0.95 | 0.95 | 0.34 | 0.04 | ||
| 0.4 | 0.0316 (0.147) | 0.0025 (0.020) | 0.97 | 0.96 | 0.81 | 0.04 | ||
| 0.6 | 0.0191 (0.115) | 0.0022 (0.021) | 0.95 | 0.95 | 1.00 | 0.05 | ||
| 0.8 | 0.0233 (0.121) | 0.0010 (0.019) | 0.94 | 0.94 | 1.00 | 0.04 | ||
†The data are simulated assuming between one and five infections with equal probabilities of 0.20 whereas the estimation approach assumes c=2 fixed infections. See the caption for Fig. 1 for definitions of terms.
‡The data are simulated assuming a conditional Poisson distribution with λ=2, whereas the estimation procedure assumes c=2 fixed infections.
§The data are simulated assuming between one and five infections with equal probabilities of 0.20.
§§The data are simulated assuming a conditional Poisson distribution with λ=2. The number of infections is assumed to range from 1 to 10.
Estimated haplotype effects for Uganda†
| ( | |||||
|---|---|---|---|---|---|
| 1 | T G A A C G C C G A G C | 0.328 | −0.108 | 0.099 | 0.278 |
| 2 | T G A A C G C C G A G A | 0.241 | −0.066 | 0.092 | 0.471 |
| 3 | T G A A C G C G A A G A | 0.103 | −0.032 | 0.106 | 0.762 |
| 4 | T G A A C G C G G A G A | 0.057 | −0.148 | 0.150 | 0.324 |
| 5 | T G G G T A C G G A G A | 0.044 | −0.257 | 0.151 | 0.089 |
| 6 | T G G G C G C G G A G C | 0.046 | −0.081 | 0.240 | 0.737 |
| 7 | T G A A C G C C A A G A | 0.046 | −0.023 | 0.165 | 0.891 |
| 8 | T G G A C G C C G A G C | 0.041 | −0.484 | 0.133 | <0.001‡ |
| 9 | T G A A C G C G G A G C | 0.034 | 0.200 | 0.583 | 0.731 |
| 10 | T G G G C A C G G A G A | 0.022 | 0.159 | 0.331 | 0.631 |
| 11 | T G G G T G C G G A G A | 0.011 | 0.344 | 0.008 | <0.001‡ |
| 12 | T G G A C G C C G A A T | 0.005 | −0.137 | 0.000 | <0.001‡ |
| 13 | T G G G C G A G A A G A | 0.011 | 0.292 | 0.806 | 0.717 |
| 14 | T G G A C G C C G A G A | 0.009 | 0.206 | 2.031 | 0.919 |
†The results are based on a sample of size n=126 and assume a Poisson model for the number of strains per individual.
‡The haplotype effect on the RBC count is significantly different from 0 after applying a Bonferroni adjustment for multiple comparisons.