| Literature DB >> 22962486 |
Nico Pfeifer1, Thomas Lengauer.
Abstract
MOTIVATION: Due to the high mutation rate of human immunodeficiency virus (HIV), drug-resistant-variants emerge frequently. Therefore, researchers are constantly searching for new ways to attack the virus. One new class of anti-HIV drugs is the class of coreceptor antagonists that block cell entry by occupying a coreceptor on CD4 cells. This type of drug just has an effect on the subset of HIVs that use the inhibited coreceptor. A good prediction of whether the viral population inside a patient is susceptible to the treatment is hence very important for therapy decisions and pre-requisite to administering the respective drug. The first prediction models were based on data from Sanger sequencing of the V3 loop of HIV. Recently, a method based on next-generation sequencing (NGS) data was introduced that predicts labels for each read separately and decides on the patient label through a percentage threshold for the resistant viral minority.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22962486 PMCID: PMC3436800 DOI: 10.1093/bioinformatics/bts373
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Choice of representatives: This plot shows information about the reads of a patient. Each cross in the plot at the top corresponds to a read of the patient and the coordinates are computed by projecting the property encoded reads onto the first and second principal components (PC1 and PC2). The chosen representatives in the PC1 dimension are marked by red circles. The plot at the bottom shows the read counts for the corresponding reads
Fig. 2.Fold reduction in plasma viral load (PVL): This plot shows the log10 reduction in PVL at week eight compared with baseline levels measured as copies/ml
log10 reduction in pVL after eight weeks
| no risk | at risk | Δ | |
|---|---|---|---|
| geno2pheno-CSanger | 2.4 | 2.3 | 0.1 |
| geno2pheno-CSanger+ | 2.4 | 2.1 | 0.3 |
| geno2pheno-CNGS–Sanger | 2.4 | 1.9 | 0.5 |
| geno2pheno-CNGS–Sanger+ | 2.4 | 1.7 | 0.7 |
| geno2pheno-CNGS | 2.4 | 1.0 | 1.4 |
| Swenson | (2.4) | (1.4) | (1.0) |
Performance comparison for the different methods regarding the median reduction of log10 pVL in the different patient groups, measured in copies/ml eight weeks after the beginning of the treatment: The first number corresponds to patients that had been classified as no risk of X4 emergence and the second number is for the patients classified as at risk of X4 emergence. The third number corresponds to the difference between the medians of the ‘no risk’ and the ‘at risk’ group. The larger the Δ the better the prediction method is able to distinguish patients with treatment success from patients without treatment success.
aIn contrast to the other approaches, parts of the test data were utilized to optimize model parameters.
Fig. 3.Visualization example of a prediction: This plot shows the visualization of the positional weights as described in the Methods section for one test sequence during one of the nested CV runs of geno2pheno-CNGS–Sanger. The letters are depicted in different colors according to the positional weights. The red color corresponds to the at risk of X4 emergence class. Positions that make the sequence more similar to no risk of X4 emergence sequences are marked in green. The higher the positional weight, the larger is the corresponding letter
Important positions for prediction success
| (pos, aa): | (pos, aa): | (pos, aa): | |
|---|---|---|---|
| run 1 | (9,K): 5.88405e–08 | (11,R): 1.09165e–06 | (27,V): 3.09939e–05 |
| run 2 | (9,K): 3.71415e–07 | (11,R): 2.24278e–05 | (27,V): 2.75498e–05 |
| run 3 | (9,K): 1.23451e–05 | ||
| run 4 | (19,V): 8.1314e–05 | (31,K): 1.25565e–05 | |
| run 5 | (11,R): 1.32773e–07 | (19,V): 1.37038e–05 | (25,A): 3.75772e–05 |
This table shows the (position, amino acid) pairs that were significant after Bonferroni correction and had a smaller p-value in the test without the modal sequence representatives.
Fig. 4.Bound V3 loop with highlighted important positions: This plot shows the V3 loop of the PDB structure 2QAD (Huang ), visualized with BALLView (Hildebrandt ). The important positions are highlighted in yellow
Sustained virologic response
| no risk | at risk | |
|---|---|---|
| geno2pheno-CSanger | 47% | 42% |
| geno2pheno-CSanger+ | 46% | 36% |
| geno2pheno-CNGS–Sanger | 47% | 33% |
| geno2pheno-CNGS–Sanger+ | 47% | 31% |
| geno2pheno-CNGS | 48% | 22% |
| Swenson | (49%) | (26%) |
Performance of the different methods regarding sustained virologic response defined as having a pVL lower than 50 copies/ml at week 48: The first number is the percentage of patients with a sustained virologic response that had been classified as no risk of X4 emergence and the second number gives this percentage for the patients classified as at risk of X4 emergence.
aIn contrast to the other approaches, parts of the test data were utilized to optimize model parameters.