| Literature DB >> 16737543 |
Abstract
BACKGROUND: HIV can evolve drug resistance rapidly in response to new drug treatments, often through a combination of multiple mutations 123. It would be useful to develop automated analyses of HIV sequence polymorphism that are able to predict drug resistance mutations, and to distinguish different types of functional roles among such mutations, for example, those that directly cause drug resistance, versus those that play an accessory role. Detecting functional interactions between mutations is essential for this classification. We have adapted a well-known measure of evolutionary selection pressure (Ka/Ks) and developed a conditional Ka/Ks approach to detect important interactions.Entities:
Year: 2006 PMID: 16737543 PMCID: PMC1523337 DOI: 10.1186/1745-6150-1-14
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Positive conditional selection pressure in HIV-1 protease
| Codon Y | Codon X | NXaYa | NXaYs | ( | NXoYa | NXoYs | ( | ( | LOD | p |
| 90 | 73 | 1879 | 6 | 367.63 | 6842 | 628 | 12.79 | 28.74 | >300 | <0.001 |
| 82 | 24 | 564 | 0 | 277.79 | 3625 | 680 | 2.63 | 105.8 | 98.09 | <0.001 |
| 90 | 95 | 161 | 1 | 189 | 7757 | 595 | 15.3 | 12.35 | 52.35 | <0.001 |
| 82 | 48 | 483 | 2 | 118.95 | 3727 | 663 | 2.77 | 42.96 | 79.90 | <0.001 |
| 82 | 53 | 366 | 2 | 90.13 | 4005 | 694 | 2.84 | 31.71 | 59.79 | <0.001 |
| 82 | 55 | 332 | 2 | 81.76 | 3994 | 701 | 2.81 | 29.14 | 53.96 | <0.001 |
| 30 | 88 | 1730 | 9 | 71.1 | 727 | 677 | 0.4 | 179 | 217.97 | <0.001 |
| 63 | 50 | 132 | 1 | 65.01 | 22576 | 2037 | 5.46 | 11.91 | 21.31 | <0.001 |
| 90 | 89 | 329 | 6 | 64.37 | 7677 | 524 | 17.2 | 3.74 | 100.28 | <0.001 |
| 71 | 11 | 126 | 0 | 62.06 | 8775 | 964 | 4.48 | 13.84 | 21.91 | <0.001 |
| 90 | 92 | 315 | 6 | 61.63 | 7706 | 575 | 15.73 | 3.92 | 95.67 | <0.001 |
| 63 | 73 | 1711 | 14 | 60.2 | 20832 | 2016 | 5.09 | 11.83 | 269.97 | <0.001 |
| 33 | 83 | 49 | 1 | 57.52 | 2436 | 803 | 3.56 | 16.15 | 15.09 | <0.001 |
| 90 | 84 | 2039 | 45 | 53.19 | 6756 | 596 | 13.31 | 4 | >300 | <0.001 |
| 90 | 85 | 395 | 9 | 51.52 | 8361 | 627 | 15.65 | 3.29 | 117.75 | <0.001 |
Due to the long list of positive conditional selection pressure detected in the Specialty dataset, only the 15 positive selections with the strongest increase in (K/K)Y|Xa value after conditioning were showed here. Please see the Additional File for the complete list.
Codon X: the codon site where the first mutation appears.
Codon Y: the codon site where the second mutation appears.
(K/K)Y|Xa: conditional selection pressure of codon Y in the presence of an amino acid mutation at codon X.
(K/K)Y|Xo: conditional selection pressure of codon Y in the absence of any mutation at codon X.
LOD: confidence score for (K/K)Y|Xa > 1.
p: p-value for (K/K)Y|X > 1 to be random.
Figure 1Conditional Ka/Ks measurements for primary vs. accessory drug-resistance mutations. For the two possible pathways from wildtype to a double mutant, we computed the conditional K/Kvalues for each mutation conditioned on the presence or absence of the other mutation (shown as numbers next to each edge in the figure). Primary drug resistance codons are highlighted in bold. a) 24/82; b) 54/89; c, e, g) 10/90; d, f, h) 63/90. a-d are from the Specialty dataset; e-f from the Treated dataset; and g-h from the Untreated dataset (see text).
Significant differences of selection pressure between Treated and Untreated dataset
| Position | Treated | Untreated | p-value | ||||||
| Na | Ns | LOD | Na | Ns | LOD | ||||
| 11 | 0.17 | 15 | 43 | 10.38 | 0.04 | 7 | 90 | 35.47 | 0.0017 |
| 17 | 0.06 | 17 | 142 | 49.46 | 0.16 | 55 | 176 | 40.54 | 0.0006 |
| 19 | 2.9 | 183 | 97 | 17.72 | 1.72 | 278 | 248 | 9.45 | 0.0004 |
| 0.39 | 137 | 131 | 12.56 | 0.07 | 41 | 215 | 80.54 | < 0.0001 | |
| 23 | 0.25 | 27 | 163 | 14.25 | 0.08 | 11 | 222 | 35.57 | 0.0006 |
| 0.53 | 49 | 46 | 3.08 | 0.01 | 2 | 69 | 31.72 | < 0.0001 | |
| 0.71 | 72 | 51 | 1.62 | 0.02 | 3 | 82 | 36.4 | < 0.0001 | |
| 0.9 | 27 | 13 | 0.51 | 0.07 | 2 | 12 | 5.74 | 0.0007 | |
| 58 | 0.1 | 46 | 164 | 53.2 | 0.01 | 3 | 207 | 113.48 | < 0.0001 |
| 61 | 0.1 | 55 | 211 | 70.15 | 0.05 | 63 | 431 | 172.9 | 0.0031 |
| 62 | 13.65 | 419 | 2 | 8.93 | 2.42 | 446 | 12 | 3.38 | 0.0094 |
| 6.1 | 1111 | 91 | 100.36 | 3.23 | 1343 | 208 | 71.62 | < 0.0001 | |
| 66 | 0.01 | 12 | 385 | 181.09 | 0.00 | 1 | 551 | 286.44 | 0.0002 |
| 67 | 0.2 | 64 | 118 | 26.07 | 0.10 | 75 | 290 | 95.99 | 0.0002 |
| 6.99 | 601 | 43 | 58.9 | 1.11 | 195 | 88 | 0.63 | < 0.0001 | |
| 76 | 1.02 | 34 | 40 | 0.28 | 0.16 | 10 | 74 | 11.25 | < 0.0001 |
| 24.42 | 635 | 13 | 91.3 | 9.72 | 700 | 36 | 79.13 | 0.0025 | |
| 83 | 0.15 | 18 | 45 | 13.27 | 0.03 | 10 | 107 | 49.35 | 0.0006 |
| 1.46 | 112 | 5 | 0.56 | 0.00 | 0 | 14 | 300 | < 0.0001 | |
| 85 | 1.39 | 48 | 15 | 0.79 | 0.11 | 3 | 12 | 5.01 | 0.0001 |
| 0.82 | 154 | 70 | 1.11 | 0.1 | 25 | 95 | 32.49 | < 0.0001 | |
| 92 | 0.1 | 34 | 131 | 44.36 | 0.04 | 25 | 211 | 90.47 | 0.0044 |
a Mutations at codon in Italic are known associated with drug resistance. The nine codons that underwent a confident shift from positive selection in Treated to negative selection in Untreated are in bold.
Positive selection pressure in Treated and Untreated datasets
| Selection pressure | Conditional selection pressure | |||||
| Treated | +a | + | - | +a | + | Not + selected, or not present |
| Untreated | + | -b | + | + | Not + selected, or not present | + |
| Codon | 12,13,19,33,35,37,41,57,60,62,63,6477,93 71c | 10,24,30,54,73,74,82,90 53d | None | 19, 63, 71 93e | 10, 24, 30, 35, 48, 54, 73, 82, 88, 90 | None |
a positively selected. For selection pressure, K/K> 1 and LOD > 2; for conditional selection pressure, (K/K)Y|Xa > 1, (K/K)Y|X > 1, LOD > 3 and p < 0.001.
b negatively selected (K/K< 1 and LOD > 2)
c codon 71 is positively selected in Treated samples (K/K> 1 and LOD > 2), and it has a K/Kvalue greater than 1 but a LOD score less than 2 in Untreated samples.
d codon 53 is positively selected in Treated samples (K/K> 1 and LOD > 2), and it has a K/Kvalue less than 1 but a LOD score less than 2 in Untreated samples.
e In Treated dataset, codon 93 is positively selected with (K/K)Y|Xa > 1, (K/K)Y|X > 1, LOD > 3 but p > 0.001.
Significant differences of conditional selection pressure between Treated and Untreated dataset
| Codon X | Codon Y | Treated | Untreated | p-value | ||||
| ( | NXaYa | NXaYs | ( | NXaYa | NXaYs | |||
| 20 | 10 | 5.21 | 74 | 7 | 0.08 | 1 | 6 | 0.000021 |
| 82 | 10 | 10.78 | 197 | 9 | 0.49 | 4 | 4 | 0.000512 |
| 71 | 10 | 4.39 | 285 | 32 | 0.42 | 23 | 27 | <0.000001 |
| 24 | 10 | 25.12 | 51 | 0 | 0.25 | 2 | 4 | 0.000038 |
| 12 | 19 | 18.25 | 35 | 3 | 3.85 | 59 | 24 | 0.006913 |
| 10 | 54 | 7.81 | 164 | 9 | 0.19 | 4 | 9 | <0.000001 |
| 71 | 54 | 3.65 | 162 | 19 | 0.14 | 4 | 12 | <0.000001 |
| 10 | 63 | 12.42 | 353 | 14 | 2.15 | 70 | 16 | 0.000012 |
| 93 | 63 | 21.23 | 431 | 10 | 6.72 | 382 | 28 | 0.000995 |
| 36 | 71 | 24.13 | 196 | 4 | 0.46 | 14 | 15 | <0.000001 |
| 63 | 73 | 2.76 | 112 | 20 | 0.10 | 4 | 19 | <0.000001 |
| 71 | 82 | 10.73 | 196 | 9 | 0.59 | 6 | 5 | 0.000199 |
| 63 | 90 | 18.32 | 359 | 23 | 0.34 | 9 | 31 | <0.000001 |
Due to the long list of significant shift results, only the 13 shifts that involved a positive conditional selection in Treated datasets are listed in this table.
Figure 2Positive selection pressure in the Specialty andTreated datasets. K/Kvalues for positive selection pressures in the Specialty and Treated datasets are plotted. K/Kvalues with a LOD < 2 in the Treated dataset were plotted as empty triangles. Different colors represent codons with different functions on drug resistance: Red: primary drug resistance mutation codon; Blue: accessory drug resistance mutation codon; Light blue: function unknown codon. The correlation coefficient of the selection pressures in these two datasets is 0.887, suggesting a significant quantitative agreement.
Figure 3Positive conditional selection pressure in the Specialty and Treated datasets. (K/K)Y|Xa values for positive conditional selection pressures in the Specialty and Treated datasets are plotted. To filter the dataset to sites where (K/K)Y|Xa could be estimated with accuracy, only selection pressures with NXaYa + NXaYs > 100 in both datasets were showed. An empty triangle indicated that we were not confident about the conditional selection value in the Treated dataset. The correlation coefficient of the conditional selection pressures in these two datasets is 0.495.
Figure 4Independent HIV-1 datasets yield reproducible positive selection results. Our method identified positively selected sites (marked with different symbols depending on whether the site is a known drug resistance mutation, accessory mutation, etc.) in independent analyses of four different datasets: Specialty: 48,387 U.S. patient samples, a mix of treated and untreated patients; Stanford-Treated: 1797 U.S. patient samples with specific drug treatments; Stanford-Untreated: 2628 U.S. untreated patient samples; African: 399 African patient samples, HIV-1 subtype C (predominantly untreated patients).
Figure 5Effects of mixing Treated and Untreated data on selection pressure calculation. We have mixed the Treated (1797 samples) with the Untreated (2628 samples) to assess the effects of a mixed data on selection pressure analysis. This mixing caused up to two-fold decreases in positively selected Ka/Ks values (compared with the Treated dataset), with shifts averaging of 32.7%. Out of the 24 positively selected codons in the Treated dataset, only two became negatively selected in the Mixture dataset. K/Kvalues with a LOD < 2 in the Mixture dataset were plotted as empty triangles.