| Literature DB >> 30343664 |
Shrikant D Pawar1,2, Christopher Freas1, Irene T Weber2, Robert W Harrison3,4.
Abstract
BACKGROUND: Drug resistance in HIV is the major problem limiting effective antiviral therapy. Computational techniques for predicting drug resistance profiles from genomic data can accelerate the appropriate choice of therapy. These techniques can also be used to select protease mutants for experimental studies of resistance and thereby assist in the development of next-generation therapies.Entities:
Keywords: Drug resistance; HIV protease; Machine learning; RBM; Structure-based
Mesh:
Substances:
Year: 2018 PMID: 30343664 PMCID: PMC6196403 DOI: 10.1186/s12859-018-2331-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The results of the expansion for each of the HIV-1 PR inhibitors
| Inhibitor | No. isolates | No. sequences | No. resistant | No. sensitive | Fraction resistant |
|---|---|---|---|---|---|
| SQV | 1722 | 10258 | 4206 | 6052 | 41.0 |
| DRV | 607 | 5973 | 1889 | 4084 | 31.6 |
| LPV | 1444 | 10239 | 5095 | 5144 | 49.8 |
| NFV | 1771 | 10911 | 6170 | 4741 | 56.5 |
| IDV | 1730 | 10537 | 5122 | 5415 | 48.6 |
| ATV | 1141 | 8430 | 4237 | 4193 | 50.3 |
| FPV | 1681 | 10521 | 4405 | 6116 | 41.9 |
| TPV | 847 | 7363 | 2062 | 5301 | 28.0 |
The accuracy of the machine learning model is shown here
| Inhibitor | Accuracy | PPV | Recall | F |
|---|---|---|---|---|
| Idv | 0.979 | 0.974 | 0.985 | 0.979 |
| Lpv | 0.984 | 0.977 | 0.992 | 0.984 |
| Sqv | 0.969 | 0.963 | 0.986 | 0.974 |
| Tpv | 0.987 | 0.984 | 0.998 | 0.991 |
| Drv | 0.988 | 0.985 | 0.998 | 0.992 |
| Atv | 0.983 | 0.976 | 0.989 | 0.983 |
| Nfv | 0.978 | 0.974 | 0.975 | 0.975 |
| Fpv | 0.988 | 0.984 | 0.998 | 0.991 |
The estimated standard deviation amongst the five folds is <0.013 for all values
Cross training reveals similarity between the inhibitors
| Compound | Atv | Drv | Fpv | Idv | Lpv | Nfv | Sqv | Tpv |
|---|---|---|---|---|---|---|---|---|
| Atv | 0.990 | 0.868 | 0.880 | 0.955 | 0.946 | 0.914 | 0.893 | 0.819 |
| Drv | 0.767 | 0.996 | 0.818 | 0.786 | 0.785 | 0.718 | 0.792 | 0.925 |
| Fpv | 0.929 | 0.873 | 0.981 | 0.889 | 0.886 | 0.822 | 0.822 | 0.828 |
| Idv | 0.945 | 0.863 | 0.880 | 0.989 | 0.960 | 0.905 | 0.878 | 0.809 |
| Lpv | 0.939 | 0.892 | 0.877 | 0.963 | 0.988 | 0.891 | 0.865 | 0.837 |
| Nfv | 0.923 | 0.853 | 0.824 | 0.918 | 0.901 | 0.987 | 0.837 | 0.758 |
| Sqv | 0.898 | 0.837 | 0.825 | 0.890 | 0.871 | 0.840 | 0.983 | 0.807 |
| Tpv | 0.723 | 0.929 | 0.765 | 0.729 | 0.728 | 0.655 | 0.732 | 0.993 |
These numbers show the accuracy when a model trained on the compound at the start of the row is used to classify the data from the other inhibitors
Fig. 1Principal Component Analysis on the HIV-1 PR Datasets. The similarity in the curves indicates that the datasets have a similar underlying structure
Fig. 2The chemical structures for a sulfonamide-containing (DRV) and non-sulfonamide-containing (ATV) inhibitor are shown here. These demonstrate the variety of chemistry used in inhibitors