| Literature DB >> 32178775 |
Carolin T Turner1, Rishi K Gupta2, Evdokia Tsaliki1, Jennifer K Roe1, Prasenjit Mondal1, Georgina R Nyawo3, Zaida Palmer3, Robert F Miller2, Byron Wp Reeve3, Grant Theron3, Mahdad Noursadeghi4.
Abstract
BACKGROUND: Blood transcriptional signatures are candidates for non-sputum triage or confirmatory tests of tuberculosis. Prospective head-to-head comparisons of their diagnostic accuracy in real-world settings are necessary to assess their clinical use. We aimed to compare the diagnostic accuracy of candidate transcriptional signatures identified by systematic review, in a setting with a high burden of tuberculosis and HIV.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32178775 PMCID: PMC7113842 DOI: 10.1016/S2213-2600(19)30469-2
Source DB: PubMed Journal: Lancet Respir Med ISSN: 2213-2600 Impact factor: 30.700
Figure 1Study flowchart
Baseline characteristics of study cohort
| Age, years | 35 (27–48) | 36 (28–49) | 34 (24–43) | |
| Sex | ||||
| Male | 94 (52%) | 66 (52%) | 28 (52%) | |
| Female | 87 (48%) | 61 (48%) | 26 (48%) | |
| Ethnicity | ||||
| Black | 28 (15%) | 14 (11%) | 14 (26%) | |
| Mixed ancestry | 153 (85%) | 113 (89%) | 40 (74%) | |
| HIV status | ||||
| Unknown | 1 (1%) | 1 (1%) | 0 | |
| Uninfected | 136 (75%) | 99 (78%) | 37 (69%) | |
| Infected | 44 (24%) | 27 (21%) | 17 (31%) | |
| Antiretroviral therapy | ||||
| No | 24 (55%) | 14 (52%) | 10 (59%) | |
| Yes | 15 (34%) | 12 (44%) | 3 (18%) | |
| Unknown | 5 (11%) | 1 (4%) | 4 (24%) | |
| CD4 count | 334 (192–606) | 354 (207–707) | 326 (128–484) | |
| Haemoglobin concentration, g/dL | 13·7 (12·4–14·8) | 14·2 (13·2–15·4) | 12·6 (11·3–13·6) | |
| Leucocyte count, × 109 cells per L | 8 (6·1–10·2) | 7·6 (6–9·8) | 9·1 (6·8–11) | |
| BMI, kg/m2 | 19·9 (17·8–22·5) | 20·5 (18·4–23·2) | 19·1 (16·8–21·5) | |
| Tuberculosis symptom score | 2 (2–3) | 2 (1–3) | 3 (2–5) | |
| Previous tuberculosis | ||||
| No | 115 (64%) | 81 (64%) | 34 (63%) | |
| Yes | 66 (36%) | 46 (36%) | 20 (37%) | |
| Liquid culture | ||||
| Positive | 53 (29%) | NA | 53 (98%) | |
| Negative | 128 (71%) | 128 (100%) | 1 (2%) | |
| Sputum smear | ||||
| Positive | 15 (8%) | 1 (1%) | 14 (26%) | |
| Negative | 157 (87%) | 120 (94%) | 37 (69%) | |
| Not done | 9 (5%) | 6 (5%) | 3 (6%) | |
| Xpert | ||||
| Positive | 44 (24%) | NA | 44 (81%) | |
| Negative | 134 (74%) | 124 (98%) | 10 (19%) | |
| No result | 2 (2%) | 2 (2%) | NA | |
| Not done | 1 (1%) | 1 (1%) | NA | |
| Ultra | ||||
| Positive | 51 (28%) | 10 (8%) | 41 (76%) | |
| Negative | 103 (57%) | 94 (74%) | 9 (17%) | |
| No result | 10 (6%) | 8 (6%) | 2 (4%) | |
| Not done | 17 (9%) | 15 (12%) | 2 (4%) | |
Data are n (%) or median (IQR). Individuals positive for pulmonary tuberculosis were defined as those with either a positive liquid culture or a positive Xpert MTB/RIF result, or both. Individuals with missing data: CD4 cell counts (n=1), haemoglobin concentration (n=3), leucocytes (n=3), BMI (n=1), symptom score (n=3). BMI=body-mass index. NA=not applicable.
Category excluded for χ2 statistical test.
Antiretroviral therapy and CD4 cell counts for HIV-infected patients only.
Positive Ultra results include tests where traces of Mycobacterium tuberculosis were detected.
Description of candidate blood transcriptional signatures for tuberculosis
| Population | HIV status | Setting | Approach | Tuberculosis cases | Controls | Total | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Anderson39.LTBI | 42 | Disease risk score | Tuberculosis | Children | Positive or negative | South Africa, Malawi | Elastic net using genome-wide data | 87 | 43 | 130 |
| Anderson39.OD | 51 | Disease risk score | Tuberculosis | Children | Positive or negative | South Africa, Malawi | Elastic net using genome-wide data | 87 | 134 | 221 |
| BATF2 | 1 | NA | Tuberculosis | Adults | Negative | UK | SVM using genome-wide data | 46 | 31 | 77 |
| Duffy10 | 10 | SVM (linear kernel) | Tuberculosis | Adults | Positive or negative | South Africa | Multinomial random forest using genome-wide data | 93 | 207 | 300 |
| Gjoen8 | 7 | LASSO regression | Tuberculosis | Children | Negative | India | LASSO using 198 pre-selected genes | 47 | 36 | 83 |
| Gliddon3 | 3 | (FCGR1A + C1QB) − (ZNF296) | Tuberculosis | Adults | Positive or negative | South Africa, Malawi | FS-PLS using genome-wide data | NS | NS | 285 |
| Gliddon4 | 4 | (GBP6 + PRDM1) − (TMCC1 + ARG1) | Tuberculosis | Adults | Positive or negative | South Africa, Malawi | FS-PLS using genome-wide data | NS | NS | 293 |
| Huang11 | 13 | SVM (linear kernel) | Tuberculosis | Adults | Negative | UK | Common genes from elastic net, L1/2 and LASSO models, using genome-wide data | 16 | 79 | 95 |
| Kaforou25 | 27 | Disease risk score | Tuberculosis | Adults | Positive or negative | South Africa, Malawi | Elastic net using genome-wide data | NS | NS | 285 |
| Kaforou39 | 44 | Disease risk score | Tuberculosis | Adults | Positive or negative | South Africa, Malawi | Elastic net using genome-wide data | NS | NS | 293 |
| Kaforou45 | 53 | Disease risk score | Tuberculosis | Adults | Positive or negative | South Africa, Malawi | Elastic net using genome-wide data | NS | NS | NS |
| Maertzdorf4 | 4 | Random forest | Tuberculosis | Adults | Negative | India | Random forest using 360 selected target genes | 113 | 76 | 189 |
| NPC2 | 1 | NA | Tuberculosis | Adults | NS | Brazil | Differential expression using genome-wide data | 6 | 28 | 34 |
| Penn-Nicholson6 | 6 | Difference of means | Incipient tuberculosis | Adolescents | Negative | South Africa | SVM-based gene pair models using genome-wide data | 46 | 107 | 153 |
| Qian17 | 17 | Sum of standardised expression | Tuberculosis | Adults | Negative | UK | Differential expression of Nrf2-mediated genes | 16 | 69 | 85 |
| Rajan5 | 5 | Unsigned sums | Tuberculosis | Adults | Positive | Uganda | Differential expression using genome-wide data | NS | NS | 80 (1:2 cases:controls) |
| Roe3 | 3 | SVM (linear kernel) | Incipient tuberculosis | Adults | Negative | UK | Stability selection using genome-wide data | 46 | 31 | 77 |
| Roe4 | 4 | SVM (linear kernel) | Tuberculosis | Adults | Negative | UK | SVM using genome-wide data | 23 | 35 | 58 |
| Roe5 | 5 | SVM (linear kernel) | Tuberculosis | Adults | Negative | UK | SVM using genome-wide data | 23 | 50 | 73 |
| Singhania20 | 20 | Modified disease risk score | Tuberculosis | Adults | Negative | UK, South Africa | Random forest using modular approach | NS | NS | NS |
| Suliman2 | 2 | ANKRD22 −OSBPL10 | Incipient tuberculosis | Adults | Negative | The Gambia, South Africa | Pair ratios algorithm using genome-wide data | 79 | 328 | 407 |
| Suliman4 | 4 | (GAS6 + SEPT4) – (CD1C + BLK) | Incipient tuberculosis | Adults | Negative | The Gambia, South Africa, Ethiopia | Pair ratios algorithm using genome-wide data | 45 | 141 | 186 |
| Sweeney3 | 3 | (GBP5 + DUSP3)/2 −KLF2 | Tuberculosis | Adults | Positive or negative | Meta-analysis of South Africa, Malawi, UK, France, USA | Significance thresholding and forward search in genome-wide data | 296 | 727 | 1023 |
| Walter46 | 51 | SVM (linear kernel) | Tuberculosis | Adults | Negative | USA | SVM using genome-wide data | 24 | 24 | 48 |
| Walter32 | 47 | SVM (linear kernel) | Tuberculosis | Adults | Negative | USA | SVM using genome-wide data | 24 | 24 | 48 |
| Walter101 | 119 | SVM (linear kernel) | Tuberculosis | Adults | Negative | USA | SVM using genome-wide data | 24 | 48 | 72 |
| Zak16 | 16 | SVM (linear kernel) | Incipient tuberculosis | Adolescents | Negative | South Africa | SVM-based gene pair models using genome-wide data | 37 | 77 | 114 |
Signatures were identified by systematic literature review and included for analysis. Signature names represent the first author's name of the corresponding publication, suffixed with the number of constituent genes that are present in the current RNAseq dataset. Both Anderson signatures resulted in the same number of final genes; these signatures were therefore additionally appended with the comparator control group. Details on how models were recreated are in appendix 1 (pp 2-4). LTBI=latent tuberculosis infection. OD=other diseases. NA=not applicable. HC=healthy controls. SVM=support vector machine. LASSO=least absolute shrinkage and selection operator. FS-PLS=forward selection-partial least squares. NS=not specified. Nrf2=nuclear factor, erythroid 2-like 2. PLHIV=people living with HIV.
Figure 2Tuberculosis scores of the four transcriptional signatures with the highest diagnostic accuracy overall and stratified by HIV status
Red lines represent the score threshold of the maximal Youden index, identified from analysis of all patients. The score difference between individuals with and without tuberculosis was significant for all four signatures in both the total cohort and in HIV-stratified cohort subsets (Mann-Whitney test p<0·0001).
Figure 3ROC curves for the four transcriptional signatures with the highest diagnostic accuracy in HIV-infected versus HIV-uninfected patients
Shaded areas represent the 95% CI of the ROC curve sensitivities, plotted at 1% specificity intervals (red shading represents HIV-infected patients and blue shading represents HIV-uninfected patients). AUROC values are reported with 95% CIs in brackets. p values are derived from pairwise comparison of ROC curves, using DeLong tests. AUROC values and CIs are also in appendix 1 (p 10). ROC=receiver operating characteristic. AUROC=area under the ROC curve.
Performance metrics of the four candidate blood transcriptional signatures with the highest diagnostic accuracy
| BATF2 | 87·0% (75·6–93·6) | 79·5% (71·7–85·6) | 64·4% (52·9–74·4) | 93·5% (87·2–96·8) |
| Kaforou25 | 74·1% (61·1–83·9) | 89·8% (83·3–93·9) | 75·5% (62·4–85·1) | 89·1% (82·5–93·4) |
| Roe3 | 90·7% (80·1–96·0) | 74·0% (65·8–80·9) | 59·8% (48·9–69·7) | 94·9% (88·7–97·8) |
| Sweeney3 | 87·0% (75·6–93·6) | 85·0% (77·8–90·2) | 71·2% (59·4–80·7) | 93·9% (88·0–97·0) |
| BATF2 | 90% | 59·8% (51·1–68·0) | 48·8% (39·2–58·5) | 93·4% (85·8–97·0) |
| Kaforou25 | .. | 62·2% (53·5–70·2) | 50·3% (40·5–60·1) | 93·6% (86·3–97·2) |
| Roe3 | .. | 74·0% (65·8–80·9) | 59·6% (48·7–69·5) | 94·6% (88·2–97·6) |
| Sweeney3 | .. | 75·6% (67·4–82·2) | 61·1% (50·1–71·0) | 94·7% (88·5–97·6) |
| BATF2 | 88·9% (77·8–94·8) | 70% | 55·7% (45·2–65·8) | 93·7% (86·9–97·1) |
| Kaforou25 | 83·3% (71·3–91·0) | .. | 54·2% (43·5–64·4) | 90·8% (83·4–95·1) |
| Roe3 | 90·7% (80·1–96·0) | .. | 56·3% (45·8–66·2) | 94·7% (88·1–97·7) |
| Sweeney3 | 90·7% (80·1–96·0) | .. | 56·3% (45·8–66·2) | 94·7% (88·1–97·7) |
| BATF2 | 95% | 25·2% (18·5–33·4) | 35·1% (27·8–43·1) | 92·2% (78·6–97·5) |
| Kaforou25 | .. | 28·3% (21·2–36·7) | 36·1% (28·6–44·2) | 93·0% (80·6–97·7) |
| Roe3 | .. | 13·4% (8·5–20·4) | 31·8% (25·1–39·3) | 86·3% (65·3–95·5) |
| Sweeney3 | .. | 54·3% (45·7–62·7) | 46·9% (37·8–56·2) | 96·2% (89·0–98·8) |
| BATF2 | 85·2% (73·4–92·3) | 80% | 64·4% (52·8–74·5) | 92·7% (86·3–96·3) |
| Kaforou25 | 81·5% (69·2–89·6) | .. | 63·4% (51·6–73·8) | 91·0% (84·3–95·1) |
| Roe3 | 79·6% (67·1–88·2) | .. | 62·9% (51·0–73·3) | 90·2% (83·4–94·5) |
| Sweeney3 | 88·9% (77·8–94·8) | .. | 65·4% (54·0–75·3) | 94·4% (88·4–97·4) |
| BATF2 | 65% | 85·8% (78·7–90·8) | 66·1% (52·7–77·4) | 85·2% (78·0–90·3) |
| Kaforou25 | .. | 92·1% (86·1–95·7) | 77·8% (63·8–87·5) | 86·1% (79·3–90·9) |
| Roe3 | .. | 92·1% (86·1–95·7) | 77·8% (63·8–87·5) | 86·1% (79·3–90·9) |
| Sweeney3 | .. | 93·7% (88·1–96·8) | 81·4% (67·4–90·3) | 86·3% (79·6–91·1) |
| BATF2 | 53·7% (40·6–66·3) | 98% | 91·9% (77·3–97·4) | 83·3% (76·5–88·4) |
| Kaforou25 | 31·5% (20·7–44·7) | .. | 87·0% (66·0–95·8) | 77·1% (70·0–82·9) |
| Roe3 | 33·3% (22·2–46·6) | .. | 87·6% (67·4–96·1) | 77·6% (70·5–83·3) |
| Sweeney3 | 44·4% (32–57·6) | .. | 90·4% (73·7–97·0) | 80·6% (73·6–86·0) |
Data are % (95% CI). WHO defines target product profile criteria for a tuberculosis triage test as minimum 90% sensitivity and 70% specificity, optimum 95% sensitivity and 80% specificity, and for a confirmatory test as minimum 98% specificity and 65% sensitivity. PPV=positive predictive value. NPV=negative predictive value.
Figure 4ROC curves of the four transcriptional signatures with the highest diagnostic accuracy benchmarked against WHO target product profile criteria
(A) Blue shaded areas represent the 95% CIs of the ROC curve sensitivities, plotted at 1% specificity intervals. AUROC values are reported with 95% CIs in brackets. (B) ROC curves are replicated with restricted y axes, and benchmarked against target criteria for a tuberculosis triage test. Minimum criteria (90% sensitivity, 70% specificity) are indicated by the dashed black boxes, optimum criteria (95% sensitivity, 80% specificity) are indicated by the blue boxes. Light blue shaded areas represent the 95% CIs. (C) ROC curves are replicated with restricted x axes and benchmarked against minimum criteria for a confirmatory test (dashed black box: 65% sensitivity, 98% specificity). Light blue shaded areas represent the 95% CIs. ROC=receiver operating characteristic. AUROC=area under the ROC curve.
Figure 5Tuberculosis scores of the four transcriptional signatures with the highest diagnostic accuracy in patients with Ultra-positive results
Patients with Ultra-positive results were grouped as true-positive tuberculosis (culture-positive or Xpert-positive) and false-positive non-tuberculosis (culture-negative or Xpert-negative). (A) Pie charts representing the proportion of Ultra-positive patients with tuberculosis and individuals without tuberculosis with a history of previous tuberculosis disease. (B) Scores of the four transcriptional signatures with the highest diagnostic accuracy in patients with Ultra-positive results. Red dots indicate patients for whom only traces of Mycobacterium tuberculosis were detected by Ultra analysis. Dashed lines represent the score thresholds of the maximum Youden index, identified from analysis of all patients. The score difference between patients with and without tuberculosis was significant for all four signatures (Mann-Whitney test p<0·0001).
Sensitivity and specificity of a diagnostic algorithm combining the sputum Ultra test with blood transcriptional signature analysis
| All Ultra-positive individuals | Ultra trace-positive individuals | Ultra-positive individuals with previous tuberculosis | ||
|---|---|---|---|---|
| BATF2 | 82% (69–90) | 76% (63–86) | 82% (69–90) | 80% (67–89) |
| Kaforou25 | .. | 66% (52–78); p=0·013 | 82% (69–90) | 74% (60–84) |
| Roe3 | .. | 78% (65–87) | 82% (69–90) | 80% (67–89) |
| Sweeney3 | .. | 76% (63–86) | 80% (67–89) | 78% (65–87) |
| BATF2 | 90% (83–95) | 99% (95–100); p=0·0077 | 95% (89–98) | 96% (91–98); p=0·041 |
| Kaforou25 | .. | 100% (96–100); p=0·0044 | 96% (91–98); p=0·041 | 97% (92–99); p=0·023 |
| Roe3 | .. | 99% (95–100); p=0·0077 | 95% (89–98) | 96% (91–98); p=0·041 |
| Sweeney3 | .. | 100% (96–100); p=0·0044 | 96% (91–98); p=0·041 | 97% (92–99); p=0·023 |
Data are % (95% CI). Only significant p values are shown. Sensitivity and specificity were calculated for 154 patients with Ultra results, with or without reclassification of selected Ultra-positive tests by transcriptional signatures. p value of comparison with Ultra alone using McNemar's test.