| Literature DB >> 31921195 |
Sen Wang1, Lei He1, Jing Wu1, Zumo Zhou2, Yan Gao1, Jiazhen Chen1, Lingyun Shao1, Ying Zhang3, Wenhong Zhang1.
Abstract
Mycobacterium tuberculosis (M. tuberculosis) infection in humans can cause active disease or latent infection. However, the factors contributing to the maintenance of latent infection vs. disease progression are poorly understood. In this study, we used a genome-wide RNA sequencing (RNA-seq) approach to identify host factors associated with M. tuberculosis infection status and a novel gene signature that can distinguish active disease from latent infection. By RNA-seq, we characterized transcriptional differences in purified protein derivative (PPD)-stimulated peripheral blood mononuclear cells (PBMCs) among three groups: patients with active tuberculosis (ATB), individuals with latent TB infection (LTBI), and TB-uninfected controls (CON). A total of 401 differentially expressed genes enabled grouping of individuals into three clusters. A validation study by quantitative real-time PCR (qRT-PCR) confirmed the differential expression of TNFRSF10C, IFNG, PGM5, EBF3, and A2ML1 between the ATB and LTBI groups. Additional clinical validation was performed to evaluate the diagnostic performance of these five biomarkers using 130 subjects. The 3-gene signature set of TNFRSF10C, EBF3, and A2ML1 enabled correct classification of 91.5% of individuals, with a high sensitivity of 86.2% and specificity of 94.9%. Diagnostic performance of the 3-gene signature set was validated using a clinical cohort of 147 subjects with suspected ATB. The sensitivity and specificity of the 3-gene set for ATB were 82.4 and 92.4%, respectively. In conclusion, we detected distinct gene expression patterns in PBMCs stimulated by PPD depending on the status of M. tuberculosis infection. Furthermore, we identified a 3-gene signature set that could distinguish ATB from LTBI, which may facilitate rapid diagnosis and treatment for more effective disease control.Entities:
Keywords: RNA sequence; TNFRSF10C; biomarker; latent tuberculosis infection; peripheral blood mononuclear cell; tuberculosis
Year: 2019 PMID: 31921195 PMCID: PMC6930242 DOI: 10.3389/fimmu.2019.02948
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Overall study design and subjects in the Biomarker Identification, Biomarker Validation and Biomarker Application cohorts. The subjects in LTBI and CON group from Biomarker Identification Cohort were recruited from household contacts of ATB patients from a prospective study during 6-year follow up. Subjects in this cohort were assigned to either the training set or the test set randomly. Subjects in training set were selected for RNA-seq. The differentially expressed genes found by RNA-seq were tested in the test set and then validated in an independent Biomarker Validation Cohort by qRT-PCR. The identified diagnostic gene signature was then applied in the clinical-based Biomarker Application Cohort. ATB group, active TB patients; LTBI group, subjects with latent tuberculosis infection; CON group, TB-uninfected controls; NTB group, patients without ATB; qRT-PCR, quantitative real-time PCR; T-SPOT, T-SPOT®. TB test (Oxford Immunotec Ltd, Oxford, UK); AFB, acid-fast bacilli.
General information of participants in the Biomarker Identification, Biomarker Validation, and Biomarker Application Cohorts.
| 28 | 25 | 31 | 51 | 44 | 35 | 147 | |
| Median age (range) | 41 (21–62) | 43 (23–65) | 39 (32–58) | 45 (18–71) | 43 (23–67) | 41 (21–64) | 44 (28–71) |
| Men, | 16 (57.1%) | 13 (52.0%) | 16 (51.6%) | 31 (60.8%) | 25 (56.8%) | 19 (54.3%) | 83 (56.5%) |
| HIV infected, | 0 (0%) | 0 (0%) | 0 (0%) | 1 (2.0%) | 0 (0%) | 0 (0%) | 1 (0.6%) |
| BCG Status | |||||||
| Vaccinated | 19 (67.9%) | 18 (72.0%) | 24 (77.4%) | 36 (70.6%) | 35 (79.5%) | 27 (77.1%) | 109 (74.1%) |
| Unvaccinated | 9 (32.1%) | 7 (28.0%) | 7 (22.6%) | 15 (29.4%) | 9 (20.5%) | 8 (22.9%) | 38 (25.9%) |
| T-SPOT results | |||||||
| Negative | 3 (10.7%) | 0 (0%) | 31 (100%) | 6 (11.8%) | 0 (0%) | 35 (100%) | 24 (35.8%) |
| Positive | 25 (89.3%) | 25 (100%) | 0 (0%) | 45 (88.2%) | 44 (100%) | 0 (0%) | 43 (64.2%) |
| Extrapulmonary TB | 6 (21.4%) | – | – | 16 (31.3%) | – | – | 12 (8.2%) |
| Microbiologic test | |||||||
| AFB positive | 23 (82.1%) | 0 (0%) | 0 (0%) | 41 (80.4%) | 0 (0%) | 0 (0%) | 32 (21.8%) |
| Culture positive | 20 (71.4%) | – | – | 38 (74.5%) | – | 46 (31.3%) | |
ATB, active TB patients; LTBI, subjects with latent TB infection; CON, TB-uninfected controls.
Figure 2RNA sequencing (RNA-seq) analysis results and function enrichment analysis of differentially expressed genes. (A) Unsupervised hierarchical cluster analysis of 401 differentially expressed genes in the pair-wise comparisons. There are 4 samples in each group. Pseudocolors indicate differential expression (red, up-regulation; green, down-regulation; black, no change in expression). (B) Venn diagram of differentially expressed genes in PBMC samples following PPD stimulation with P < 0.05 by Students' t-test and ratio >2.0. (C) Regulatory network built from the differentially expressed genes between ATB and LTBI group. Circle nodes represents genes, while Gray filled rectangle nodes with yellow border color indicate biological processes. For genes, borders of the nodes represent the type of the gene (up regulated in red, down regulated in green), centers of the nodes indicate the gene expression changes, color intensity is proportional to the level of regulation. Genes that not quantified are shown in gray. Protein-protein interactions are depicted as gray solid line, dashed lines show the linkage of gene to related biological processes. Big gray circles indicate main module of biological processes. (D) Gene Ontology (GO) enrichment analysis for differentially regulated genes between ATB and LTBI group. Only the top false discovery rate (FDR) ranked 10 enrichment of GO terms from “biological process” category were listed.
Significantly regulated genes in PPD-stimulated PBMCs in pair-wise comparisons validated in test set from Biomarker Identification Cohort by qRT-PCR.
| IFNG | 6.63 | 2.07 | 3.20 | 0.0341 |
| PGM5 | 3.71 | 1.03 | 3.60 | 0.0213 |
| EBF3 | 3.10 | 0.71 | 4.37 | 0.0151 |
| TNFRSF10C | 0.15 | 1.49 | 0.10 | <0.0001 |
| A2ML1 | 1.82 | 0.52 | 3.50 | 0.0434 |
| IFNG | 2.07 | 0.71 | 2.91 | 0.0115 |
| CXCL10 | 4.43 | 0.55 | 7.99 | 0.0185 |
| TNFRSF10C | 0.15 | 0.47 | 0.33 | 0.0420 |
| ENPP3 | 2.54 | 0.53 | 4.75 | 0.0321 |
| MYBPH | 4.97 | 0.78 | 6.39 | 0.0124 |
| IL26 | 0.73 | 0.22 | 3.38 | 0.0403 |
| GPR146 | 0.26 | 1.25 | 0.21 | 0.0321 |
| VCAN | 0.32 | 0.87 | 0.37 | 0.0476 |
| GPRC5A | 4.61 | 1.05 | 4.39 | 0.0436 |
| GPR64 | 0.81 | 2.45 | 0.33 | 0.0414 |
| A2ML1 | 1.82 | 0.58 | 3.13 | 0.0488 |
| EBF3 | 3.1 | 0.64 | 4.84 | 0.0231 |
| CD1A | 3.21 | 0.85 | 3.78 | 0.0325 |
| HBEGF | 10.93 | 1.74 | 6.30 | 0.0021 |
| ZBED2 | 3.80 | 1.15 | 3.32 | 0.0263 |
| HCAR2 | 5.44 | 1.39 | 3.92 | 0.0302 |
| PDSS1 | 6.73 | 2.35 | 2.86 | 0.0412 |
| KCNJ10 | 0.21 | 0.72 | 0.30 | 0.0485 |
| IFNG | 6.63 | 0.71 | 9.32 | 0.0051 |
| CXCL10 | 4.36 | 0.55 | 7.87 | 0.0134 |
| TNFRSF10C | 1.49 | 0.47 | 3.20 | 0.0355 |
Fold change: the fold change was calculated by dividing gene expression level of PPD-stimulated sample by gene expression level of unstimulated sample by qRT-PCR.
Ratio: the ratio was calculated by dividing mean fold change of one group by mean fold change of another group.
Figure 3Validation of differentially expressed genes between ATB and LTBI group by qRT-PCR in the Biomarker Validation Cohort. (A–E) Representative scatter plots of five discriminatively expressed genes (TNFRSF10C, IFNG, PGM5, EBF3, and A2ML1) between ATB and LTBI were shown detected by qRT-PCR. Error bars in the scatter-dot plots indicate the medians and IQRs fold change of each group following PPD stimulation. Kruskal–Wallis tests with Dunn's post tests were used to compare the differences among three groups. *Significant difference: 0.01< P < 0.05; **Significant difference: 0.001< P < 0.01; ***Significant difference: P < 0.0001. (F) The receiver operating characteristics (ROC) curves for the five genes in discriminating between ATB and LTBI group. The ROC curves were constructed using data from subjects in ATB group as patients and subjects in LTBI group as controls. (G) The diagnostic performance of the 3-gene signature set in discriminating between ATB and LTBI, and between ATB and LTBI+CON in the Biomarker Validation Cohort.
Diagnostic performances of the 3-gene signature set, AFB and T-SPOT in the Biomarker Application Cohort.
| AFB | 44.1 (30/68) | 97.5 (77/79) | 93.8 (30/32) | 67.0 (77/115) | 73.5 (108/147) |
| 32.9–55.9 | 90.7–99.8 | 78.8–99.3 | 57.9–74.9 | 65.8–79.9 | |
| T-SPOT | 86.8 (59/68) | 72.2 (57/79) | 72.8 (59/81) | 86.3 (57/66) | 78.9 (116/147) |
| 76.5–93.1 | 61.4–80.9 | 62.2–81.4 | 75.9–92.9 | 71.6–84.8 | |
| 3-gene set | 82.4 (56/68) | 92.4 (73/79) | 90.3 (56/62) | 85.9 (73/85) | 87.8 (129/147) |
| 71.5–89.8 | 84.1–96.8 | 80.1–95.8 | 76.8–91.9 | 81.4–92.2 | |
| 3-gene set+AFB | 89.7 (61/68) | 91.1 (72/79) | 89.7 (61/68) | 91.1 (72/79) | 90.5 (133/147) |
| 80.0–95.2 | 82.6–95.9 | 79.9–95.2 | 82.6–95.9 | 84.5–94.4 |
PPV, positive predictive value; NPV, negative predictive value.
3-gene set: 3-gene signature set including TNFRSF10C, A2ML1, and EBF3.
3-gene set test was combined with AFB in a serial manner such that a positive result was obtained when either of the test results was positive and a negative result was assigned when both test results were negative.