| Literature DB >> 31013272 |
Hayley Warsinske1,2, Rohit Vashisht1,2, Purvesh Khatri1,2.
Abstract
BACKGROUND: The World Health Organization (WHO) and Foundation for Innovative New Diagnostics (FIND) have published target product profiles (TPPs) calling for non-sputum-based diagnostic tests for the diagnosis of active tuberculosis (ATB) disease and for predicting the progression from latent tuberculosis infection (LTBI) to ATB. A large number of host-derived blood-based gene-expression biomarkers for diagnosis of patients with ATB have been proposed to date, but none have been implemented in clinical settings. The focus of this study is to directly compare published gene signatures for diagnosis of patients with ATB across a large, diverse list of publicly available gene expression datasets, and evaluate their performance against the WHO/FIND TPPs. METHODS ANDEntities:
Mesh:
Substances:
Year: 2019 PMID: 31013272 PMCID: PMC6478271 DOI: 10.1371/journal.pmed.1002786
Source DB: PubMed Journal: PLoS Med ISSN: 1549-1277 Impact factor: 11.069
Gene expression signatures compared within this study.
| Citation | PubMed PMID | GEO discovery dataset | Signature name | Indication | Number of genes | Statistical model | Retraining required |
|---|---|---|---|---|---|---|---|
| Anderson et al. [ | 24785206 | GSE39940 | Anderson42 | ATB vs LTBI | 42 | Difference of sums | No |
| Anderson51 | ATB vs ODs | 51 | Difference of sums | No | |||
| Berry et al. [ | 20725040 | GSE19491 | Berry393 | ATB vs (LTBI & HCs) | 393 | Yes | |
| Berry86 | ATB vs ODs | 86 | Yes | ||||
| Bloom et al. [ | 23940611 | GSE42834 | Bloom144 | ATB vs (ODs & HCs) | 144 | Support vector machine | Yes |
| Laux da Costa et al. [ | 26025597 | GSE42834 | daCosta3 | ATB vs ODs | 3 | Random forest | Yes |
| Jacobsen et al. [ | 17318616 | GSE6112 | Jacobsen3 | ATB vs LTBI | 3 | Linear discriminant analysis | Yes |
| Kaforou et al. [ | 24167453 | GSE37250 | Kaforou27 | ATB vs LTBI | 27 | Difference of means | No |
| Kaforou44 | ATB vs ODs | 44 | Difference of means | No | |||
| Kaforou52 | ATB vs (LTBI & ODs) | 52 | Difference of means | No | |||
| Leong et al. [ | 29559120 | GSE101705 | Leong24 | ATB vs LTBI | 24 | Rigid logistic regression | Yes |
| Maertzdorf et al. [ | 26682570 | GSE74092 | Maertzdorf15 | ATB vs (LTBI & HCs) | 15 | Random forest | Yes |
| Maertzdorf4 | ATB vs (LTBI & HCs) | 4 | Random forest | Yes | |||
| Sambarey et al. [ | 28065665 | GSE37250 | Sambarey10 | ATB vs (LTBI & HCs & ODs) | 10 | Linear discriminant analysis | Yes |
| Sweeney et al. [ | 26907218 | GSE19491, GSE37250, GSE42834 | Sweeney3 | ATB vs (LTBI & ODs & HCs) | 3 | Difference of geometric means | No |
| Verhagen et al. [ | 23375113 | GSE41055 | Verhagen10 | ATB vs (LTBI & HCs) | 10 | Random forest | Yes |
Gene signatures are named by combining the last name of the first author followed by the number of genes in the signature.
*The diagnostic model for the signature was created using this dataset as the original training dataset was not available.
ATB, active tuberculosis; GEO, Gene Expression Omnibus; HC, healthy control; LTBI, latent tuberculosis infection; OD, other disease.
Transcriptome datasets used for comparison of 16 gene signatures for diagnosis of ATB.
| GEO accession | GEO platform | Country | Tissue | Age (years) | HIV status | Diagnosis method | HCs | LTBI | ATB | ODs | Total | Notes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GSE19491 | GPL6947 | UK, South Africa | Whole blood | >17 | Negative | Sputum culture | 117 | 69 | 61 | 193 | 440 | ODs included staph, strep, Still disease, systemic lupus erythematosus, and pediatric systemic lupus erythematosus |
| GSE28623 | GPL4133 | The Gambia | Whole blood | 16–53 | Negative | Sputum microscopy, chest X-ray | 37 | 25 | 46 | 108 | ||
| GSE29536 | GPL6102 | UK | Whole blood | 11–88 | Negative | 6 | 9 | 15 | Only the TB dataset within this series was used | |||
| GSE34608 | GPL6480 | Germany | Whole blood | 17–73 | Negative | 18 | 8 | 18 | 108 | OD samples were sarcoidosis samples; controls may have included some IGRA-positive individuals | ||
| GSE37250 | GPL10558 | Malawi | Whole blood | >17 | Some positive | Sputum culture | 167 | 195 | 175 | 537 | ||
| GSE39939 | GPL10558 | Kenya | Whole blood | <15 | Some positive | Sputum culture | 14 | 79 | 64 | 157 | ||
| GSE39940 | GPL10558 | Malawi | Whole blood | <15 | Some positive | Sputum culture | 54 | 111 | 169 | 334 | ||
| GSE41055 | GPL5175 | Venezuela | Whole blood | <15 | Negative | 9 | 9 | 9 | 27 | |||
| GSE42834 | GPL10558 | Germany | Whole blood | >17 | Negative | Sputum culture | 118 | 40 | 123 | 281 | ODs included sarcoidosis, pneumonia, and lung cancer | |
| GSE50834 | GPL10558 | South Africa | PBMCs | 30–40 | Positive | 23 | 21 | 44 | ||||
| GSE56153 | GPL6883 | Indonesia | Whole blood | >15 | Negative | Sputum microscopy, chest X-ray, clinical presentation | 18 | 18 | 36 | HIV status not measured in controls, but HIV has very low prevalence in Indonesia | ||
| GSE54992 | GPL570 | China | PBMCs | 18–68 | Negative | Sputum microscopy, chest X-ray, clinical presentation, sputum culture | 6 | 6 | 9 | 21 | Dataset also included 18 treated samples not used in this analysis; samples confirmed with sputum culture were not identified | |
| GSE62147 | GPL6480 | Germany | Whole blood | 15–79 | Negative | Sputum culture | 14 | 12 | 26 | OD was | ||
| GSE62525 | GPL16951 | Taiwan | PBMCs | Negative | 14 | 14 | 14 | 42 | ||||
| GSE69581 | GPL10558 | South Africa | Whole blood | >17 | Positive | Microbiologically confirmed (method not specified) | 25 | 15 | 40 | Dataset also included samples with subclinical TB | ||
| GSE73408 | GPL11532 | US | Whole blood | >17 | Negative | Sputum culture | 39 | 35 | 35 | 109 | ||
| GSE79362 | GPL11154 | South Africa | Whole blood | 12–18 | Negative | Sputum smear, sputum culture | 101 | 19 | 120 | Longitudinal samples were collected | ||
| GSE81746 | GPL17077 | India | Whole blood | 25–65 | Unknown | 2 | 4 | 6 | ||||
| GSE83456 | GPL10558 | UK | Whole blood | Negative | Granuloma biopsy, clinical presentation with response to therapy, radiology with response to treatment, sputum culture | 61 | 45 | 49 | 155 | ODs included sarcoidosis; there were also extrapulmonary TB samples available from this dataset. | ||
| GSE83892 | GPL10559 | UK | Whole blood | >17 | Positive | Cerebral spinal fluid smear, cerebral spinal fluid culture, conventional and real-time PCR of cerebral spinal fluid | 17 | 99 | 116 | ATB patients in this cohort included patients with complications from immune reconstitution inflammatory syndrome and TB meningitis | ||
| GSE84076 | GPL16791 | Brazil | Whole blood | >18 | Negative | Sputum microscopy, clinical presentation, sputum culture | 6 | 6 | 9 | 21 | This dataset also included some samples taken from individuals after treatment | |
| GSE101705 | GPL18573 | India | Whole blood | >6 | Negative | Sputum culture | 16 | 28 | 44 | |||
| GSE107731 | GPL15207 | Mongolia | Whole blood | Unknown | 3 | 3 | 6 | |||||
| GSE107994 | GPL20301 | UK | Whole blood | 16–84 | Negative | Sputum culture, sputum PCR | 119 | 118 | 53 | 290 | Dataset included samples from latent TB progressors | |
ATB, active tuberculosis; GEO, Gene Expression Omnibus; HC, healthy control; IGRA, Interferon Gamma Release Assay; LTBI, latent tuberculosis infection; OD, other disease; PBMC, peripheral blood mononuclear cell; TB, tuberculosis.
Fig 1Distribution of genes across the signatures included in this study.
Each row represents a gene signature for active tuberculosis diagnosis. Each column represents 1 gene. The number at the end of a signature name represents the number of genes in the given signature. Genes present in only 1 signature are red; those in 2 or more signatures are blue.
Weighted mean AUROC, specificity at 90% sensitivity, and NPV at 2% prevalence for ATB versus all other conditions across all datasets and across only culture-confirmed datasets for each of the 16 gene signatures.
| Signature | Culture-confirmed datasets | All datasets | ||||
|---|---|---|---|---|---|---|
| AUROC (95% CI) | Specificity (95% CI) | NPV | AUROC (95% CI) | Specificity (95% CI) | NPV | |
| Sweeney3 | 0.89 (0.82–0.96) | 0.74 (0.40–0.89) | 0.99 | 0.85 (0.72–0.99) | 0.66 (0.23–0.93) | 0.98 |
| Jacobsen3 | 0.86 (0.72–1.00) | 0.68 (0.37–0.93) | 0.99 | 0.83 (0.69–0.98) | 0.59 (0.21–0.92) | 0.99 |
| daCosta3 | 0.83 (0.60–1.00) | 0.65 (0.31–0.88) | 0.99 | 0.76 (0.45–1.00) | 0.50 (0.00–0.95) | 0.94 |
| Maertzdorf4 | 0.83 (0.74–0.91) | 0.58 (0.28–0.82) | 0.99 | 0.79 (0.64–0.95) | 0.54 (0.24–0.79) | 0.99 |
| Sambarey10 | 0.90 (0.83–0.97) | 0.74 (0.36–0.94) | 0.99 | 0.82 (0.57–1.00) | 0.59 (0.18–0.94) | 0.99 |
| Verhagen10 | 0.53 (0.46–0.60) | 0.13 (0.11–0.19) | 0.98 | 0.54 (0.41–0.68) | 0.14 (0.00–0.32) | 0.92 |
| Maertzdorf15 | 0.82 (0.71–0.92) | 0.58 (0.30–0.82) | 0.99 | 0.79 (0.66–0.92) | 0.54 (0.23–0.83) | 0.99 |
| Leong24 | 0.74 (0.53–0.95) | 0.41 (0.12–0.63) | 0.99 | 0.75 (0.54–0.95) | 0.43 (0.04–0.77) | 0.99 |
| Kaforou27 | 0.86 (0.71–0.92) | 0.66 (0.40–0.92) | 0.99 | 0.83 (0.64–1.00) | 0.62 (0.28–0.94) | 0.99 |
| Anderson42 | 0.84 (0.75–0.93) | 0.61 (0.39–0.82) | 0.99 | 0.82 (0.66–0.97) | 0.58 (0.27–0.87) | 1.00 |
| Kaforou44 | 0.82 (0.67–0.97) | 0.61 (0.27–0.80) | 0.99 | 0.78 (0.56–1.00) | 0.54 (0.12–0.85) | 0.99 |
| Anderson51 | 0.60 (0.42–0.79) | 0.22 (0.00–0.44) | 0.99 | 0.58 (0.33–0.82) | 0.21 (0.00–0.52) | 0.96 |
| Kaforou52 | 0.87 (0.77–0.97) | 0.67 (0.45–0.87) | 0.99 | 0.84 (0.70–0.99) | 0.62 (0.29–0.92) | 0.99 |
| Berry86 | 0.67 (0.44–0.90) | 0.21 (0.00–0.76) | 0.47 | 0.69 (0.36–1.00) | 0.29 (0.00–0.65) | 0.47 |
| Bloom144 | 0.81 (0.61–1.00) | 0.50 (0.10–0.64) | 0.99 | 0.74 (0.52–0.96) | 0.33 (0.00–0.65) | 0.98 |
| Berry393 | 0.72 (0.43–1.00) | 0.40 (0.00–1.00) | 0.74 | 0.71 (0.43–0.99) | 0.34 (0.00–0.98) | 0.66 |
ATB, active tuberculosis; AUROC, area under the receiver operating characteristic curve; NPV, negative predictive value.
Comparison of 16 gene signatures for diagnosis of ATB using bivariate meta-analysis.
| Signature | Culture-confirmed datasets | All datasets | ||||
|---|---|---|---|---|---|---|
| DOR (95% CI) | Heterogeneity | FPR (95% CI) | DOR (95% CI) | Heterogeneity | FPR (95% CI) | |
| Sweeney3 | 30.50 (14.95–62.24) | 0 | 0.18 (0.13–0.26) | 16.66 (11.56–24.00) | 0 | 0.20 (0.16–0.24) |
| Kaforou52 | 21.05 (12.20–36.34) | 0 | 0.23 (0.16–0.33) | 14.05 (10.10–19.54) | 0 | 0.23 (0.18–0.28) |
| Kaforou44 | 12.22 (6.04–24.71) | 0 | 0.22 (0.16–0.30) | 9.05 (6.42–12.74) | 0 | 0.22 (0.17–0.29) |
| Anderson51 | 4.96 (2.86–8.59) | 0 | 0.26 (0.09–0.53) | 3.91 (2.90–5.26) | 0 | 0.32 (0.20–0.46) |
| Jacobsen3 | 19.89 (10.72–36.89) | 4.03 | 0.22 (0.16–0.30) | 13.04 (9.54–17.82) | 0 | 0.21 (0.17–0.26) |
| Kaforou27 | 17.21 (11.08–26.74) | 1.13 | 0.25 (0.17–0.34) | 13.85 (10.32–18.59) | 0 | 0.23 (0.18–0.29) |
| Maertzdorf15 | 14.38 (8.04–25.70) | 19.07 | 0.27 (0.21–0.34) | 11.65 (8.37–16.22) | 0 | 0.26 (0.20–0.31) |
| Maertzdorf4 | 13.82 (8.75–21.83) | 0.73 | 0.24 (0.18–0.31) | 9.69 (7.39–12.71) | 3.06 | 0.28 (0.23–0.33) |
| Anderson42 | 11.26 (7.50–16.92) | 4.8 | 0.28 (0.24–0.33) | 10.65 (7.87–14.42) | 8.39 | 0.26 (0.21–0.31) |
| Sambarey10 | 19.13 (10.38–35.25) | 16.87 | 0.19 (0.13–0.28) | 12.18 (8.54–17.37) | 11.61 | 0.20 (0.17–0.24) |
| daCosta3 | 32.44 (14.90–70.63) | 0 | 0.26 (0.13–0.44) | 13.89 (8.14–23.71) | 13.63 | 0.45 (0.28–0.64) |
| Verhagen10 | 1.85 (1.30–2.63) | 21.43 | 0.47 (0.28–0.67) | 2.90 (2.03–4.15) | 21.02 | 0.47 (0.31–0.65) |
| Bloom144 | 9.94 (5.49–17.99) | 50.55 | 0.21 (0.13–0.32) | 6.69 (4.71–9.49) | 23.29 | 0.24 (0.16–0.34) |
| Leong24 | 8.20 (4.75–14.16) | 46.33 | 0.27 (0.18–0.39) | 8.48 (5.96–12.06) | 23.93 | 0.26 (0.19–0.34) |
| Berry393 | 17.72 (7.41–42.35) | 33.39 | 0.16 (0.09–0.27) | 9.26 (5.90–14.53) | 25.48 | 0.45 (0.25–0.66) |
| Berry86 | 12.62 (4.98–31.99) | 42.89 | 0.19 (0.04–0.57) | 6.72 (3.81–11.85) | 27.48 | 0.66 (0.35–0.87) |
ATB, active tuberculosis; DOR, diagnostic odds ratio; FPR, false positive rate.
AUROC, PPV, and NPV for progression from LTBI to ATB in the ACS cohort up to 180 days prior to diagnosis.
| Signature | AUROC (95% CI) | PPV at 2% prevalence | NPV at 2% prevalence |
|---|---|---|---|
| daCosta3 | 0.56 (0.50–0.62) | 14.60 | 98.2 |
| Sweeney3 | 0.86 (0.78–0.94) | 13.60 | 99.4 |
| Kaforou27 | 0.86 (0.78–0.94) | 13.60 | 99.4 |
| Kaforou52 | 0.87 (0.80–0.95) | 13.20 | 100 |
| Leong24 | 0.73 (0.62–0.83) | 11.00 | 99.3 |
| Jacobsen3 | 0.85 (0.76–0.93) | 10.80 | NC |
| Anderson42 | 0.85 (0.77–0.92) | 8.80 | 99.5 |
| Bloom144 | 0.68 (0.56–0.79) | 8.30 | 98.9 |
| Sambarey10 | 0.80 (0.72–0.89) | 6.40 | 99.4 |
| Kaforou44 | 0.83 (0.76–0.90) | 6.20 | 99.4 |
| Maertzdorf15 | 0.63 (0.56–0.69) | 2.80 | 99.5 |
| Anderson51 | 0.46 (0.34–0.58) | 2.70 | 98.2 |
| Maertzdorf4 | 0.51 (0.50–0.52) | 2.00 | 99.5 |
| Verhagen10 | 0.47 (0.45–0.49) | 2.00 | NC |
| Berry86 | 0.50 (0.50–0.50) | 2.00 | 98.9 |
| Berry393 | 0.50 (0.50–0.50) | 2.00 | NC |
ACS, Adolescent Cohort Study; ATB, active tuberculosis; AUROC, area under the receiver operating characteristic curve; LTBI, latent tuberculosis infection; NC, not calculated; NPV, negative predictive value; PPV, positive predictive value.