| Literature DB >> 30689732 |
Yang Yang1,2, Timothy M Walker3, A Sarah Walker3,4, Daniel J Wilson5, Timothy E A Peto3,4, Derrick W Crook3,4,6, Farah Shamout1, Tingting Zhu1, David A Clifton1,2.
Abstract
MOTIVATION: Resistance co-occurrence within first-line anti-tuberculosis (TB) drugs is a common phenomenon. Existing methods based on genetic data analysis of Mycobacterium tuberculosis (MTB) have been able to predict resistance of MTB to individual drugs, but have not considered the resistance co-occurrence and cannot capture latent structure of genomic data that corresponds to lineages.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30689732 PMCID: PMC6748723 DOI: 10.1093/bioinformatics/btz067
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Comparing performance on F1 feature set for prediction of INH, EMB, RIF, PZA, MDR-TB as defined by WHO and PANS-TB
| Models | |||||||
|---|---|---|---|---|---|---|---|
| Drugs | DA | SVM | RF | MLKNN | ECC | DeepAMR | |
| INH | Sen |
|
|
|
|
|
|
| Spec |
|
|
|
|
|
| |
| AUROC |
|
|
|
|
|
| |
| F1 |
|
|
|
|
|
| |
|
| |||||||
| EMB | Sen |
|
|
|
|
|
|
| Spec |
|
|
|
|
|
| |
| AUROC |
|
|
|
|
|
| |
| F1 |
|
|
|
|
|
| |
|
| |||||||
| RIF | Sen |
|
|
|
|
|
|
| Spec |
|
|
|
|
|
| |
| AUROC |
|
|
|
|
|
| |
| F1 |
|
|
|
|
|
| |
|
| |||||||
| PZA | Sen |
|
|
|
|
|
|
| Spec |
|
|
|
|
|
| |
| AUROC |
|
|
|
|
|
| |
| F1 |
|
|
|
|
|
| |
|
| |||||||
| MDR | Sen |
|
|
|
|
|
|
| Spec |
|
|
|
|
|
| |
| AUROC |
|
|
|
|
|
| |
| F1 |
|
|
|
|
|
| |
|
| |||||||
| PANS | Sen |
|
|
|
|
|
|
| Spec |
|
|
|
|
|
| |
| AUROC |
|
|
|
|
|
| |
| F1 |
|
|
|
|
|
| |
Note: Sensitivity (sens), specificity (spec), area under receiving operating characteristic (AUROC) and F1 score are reported as mean and SD across 100 bootstrap samples. The P-value of performance measurement of the examined classifier compared to DA was obtained by Wilcoxon signed-rank test. *represents a P-value <0.05.
Fig. 1.Illustration of latent structure using t-SNE: (a) lineage distribution resulted from DeepAMR; (b) phenotype distribution resulted from DeepAMR; (c) lineage distribution resulted from DeepAMR_cluster and (d) predicted clusters resulted from DeepAMR_cluster
Fig. 2.Overview of phenotype of the examined 13 403 MTB isolates. (a) Histogram showing the phenotype of the MTB isolates for each individual anti-TB drug obtained by the drug susceptibility test (up to 11 anti-TB drugs were tested for all isolates). For each drug, the isolates with missing phenotype were excluded. (b) Heatmap visualizing the proportion of pair-wise resistance co-occurrence (non-diagonal) and mono-resistance (diagonal) across anti-TB drugs. The non-diagonal elements correspond to poly-resistant isolates that were resistant to at least two anti-TB drugs. The co-occurrence matrix is symmetric so the upper right half of the graph shows all pair-wise co-occurrence cases
Fig. 3.Ranked SNPs based on permutation feature importance resulting in positive metric with respect to INH, EMB, RIF and PZA, respectively