| Literature DB >> 35456421 |
Sudhakar Natarajan1, Mohan Ranganathan1, Luke Elizabeth Hanna1, Srikanth Tripathy1,2.
Abstract
Tuberculosis (TB) is an infectious disease caused by Mycobacterium tuberculosis (M.tb.). Our integrative analysis aims to identify the transcriptional profiling and gene expression signature that distinguish individuals with active TB (ATB) disease, and those with latent tuberculosis infection (LTBI). In the present study, we reanalyzed a microarray dataset (GSE37250) from GEO database and explored the data for differential gene expression analysis between those with ATB and LTBI derived from Malawi and South African cohorts. We used BRB array tool to distinguish DEGs (differentially expressed genes) between ATB and LTBI. Pathway enrichment analysis of DEGs was performed using DAVID bioinformatics tool. The protein-protein interaction (PPI) network of most upregulated genes was constructed using STRING analysis. We have identified 375 upregulated genes and 152 downregulated genes differentially expressed between ATB and LTBI samples commonly shared among Malawi and South African cohorts. The constructed PPI network was significantly enriched with 76 nodes connected to 151 edges. The enriched GO term/pathways were mainly related to expression of IFN stimulated genes, interleukin-1 production, and NOD-like receptor signaling pathway. Downregulated genes were significantly enriched in the Wnt signaling, B cell development, and B cell receptor signaling pathways. The short-listed DEGs were validated in a microarray data from an independent cohort (GSE19491). ROC curve analysis was done to assess the diagnostic accuracy of the gene signature in discrimination of active and latent tuberculosis. Thus, we have derived a seven-gene signature, which included five upregulated genes FCGR1B, ANKRD22, CARD17, IFITM3, TNFAIP6 and two downregulated genes FCGBP and KLF12, as a biomarker for discrimination of active and latent tuberculosis. The identified genes have a sensitivity of 80-100% and specificity of 80-95%. Area under the curve (AUC) value of the genes ranged from 0.84 to 1. This seven-gene signature has a high diagnostic accuracy in discrimination of active and latent tuberculosis.Entities:
Keywords: active TB; bioinformatics; biomarkers; differentially expressed genes; latent TB infection; tuberculosis
Mesh:
Substances:
Year: 2022 PMID: 35456421 PMCID: PMC9032611 DOI: 10.3390/genes13040616
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1(A) Volcano plot illustrating the identification of differentially expressed genes between ATB and LTBI in the Malawi and South African cohort (differentially expressed genes were distinguished with parameter −log10 p value in y axis and log2 fold change in x axis). Significant results were determined based on cut off range, p value < 0.01 and >1.5-fold change (B) Venn diagram demonstrating the intersection of differentially expressed overlapped or common genes, in Malawi and South African cohort derived from GSE37250 (Figure S1).
(A) Top 10 upregulated genes in ATB vs. LTBI. (B) Top 10 downregulated genes in ATB vs. LTBI.
|
| ||||
|
|
|
|
|
|
| 1 |
| ILMN_2176063 | 8.20 ± 1.14 | 1 × 10−7 |
| 2 |
| ILMN_2391051 | 9.80 ± 2.80 | 1 × 10−7 |
| 3 |
| ILMN_3247506 | 6.11 ± 2.24 | 1 × 10−7 |
| 4 |
| ILMN_1690241 | 12.4 ± 2.39 | 1 × 10−7 |
| 5 |
| ILMN_2148785 | 3.63 ± 0.32 | 1 × 10−7 |
| 6 |
| ILMN_1799848 | 10.25 ± 3.34 | 1 × 10−7 |
| 7 |
| ILMN_2114568 | 3.53 ± 1.03 | 1 × 10−7 |
| 8 |
| ILMN_1681301 | 4.10 ± 0.22 | 1 × 10−7 |
| 9 |
| ILMN_1756953 | 7.43 ± 2.82 | 1 × 10−7 |
| 10 |
| ILMN_1778059 | 4.40 ± 1.89 | 1 × 10−7 |
|
| ||||
|
|
|
|
|
|
| 1 |
| ILMN_2361603 | 2.17 ± 0.00 | 1 × 10−7 |
| 2 |
| ILMN_1762801 | 2.77 ± 0.00 | 1.4 × 10−6 |
| 3 |
| ILMN_1798679 | 4.05 ± 0.68 | 1 × 10−7 |
| 4 |
| ILMN_1659227 | 3.80 ± 0.50 | 1 × 10−7 |
| 5 |
| ILMN_2302757 | 3.64 ± 0.28 | 1 × 10−7 |
| 6 |
| ILMN_1710734 | 2.22 ± 0.13 | 1 × 10−7 |
| 7 |
| ILMN_1797975 | 2.27 ± 0.07 | 1 × 10−7 |
| 8 |
| ILMN_1815923 | 2.06 ± 0.44 | 4.2 × 10−6 |
| 9 |
| ILMN_2337928 | 2.86 ± 0.23 | 1 × 10−7 |
| 10 |
| ILMN_1688959 | 2.15 ± 0.32 | 1 × 10−7 |
Gene ontology and pathway enrichment analysis of upregulated genes.
| S.No | Term/Pathway | Genes | |
|---|---|---|---|
| GENE ONTOLOGY | |||
| 1 | GO:0045087 ~ innate immune response | 1.6 × 10−10 | |
| 2 | GO:0006955 ~ immune response | 1 × 10−6 | |
| 3 | GO:0050900 ~ leukocyte migration | 1 × 10−5 | |
| 4 | GO:0019731 ~ antibacterial humoral response | 3 × 10−5 | |
| 5 | GO:0042742 ~ defense response to bacterium | 3.2 × 10−4 | |
| 6 | GO:0050830 ~ defense response to Gram-positive bacterium | 4.1 × 10−4 | |
| 7 | GO:0002576 ~ platelet degranulation | 8.5 × 10−4 | |
| 8 | GO:0006954 ~ inflammatory response | 9.1 × 10−4 | |
| 9 | GO:0031640 ~ killing of cells of other organism | 0.0014 | |
| 10 | GO:0042981 ~ regulation of apoptotic process | 0.0018 | |
| KEGG PATHWAY | |||
| 1 | has05150:Staphylococcus aureus infection | 3 × 10−6 | |
| 2 | has05140:Leishmaniasis | 1 × 10−5 | |
| 3 | has05152:Tuberculosis | 9.4 × 10−5 | |
| 4 | has05322:Systemic lupus erythematosus | 0.0025 | |
| 5 | has04610:Complement and coagulation cascades | 0.0032 | |
| 6 | has04145:Phagosome | 0.0038 | |
| 7 | has05133:Pertussis | 0.0040 | |
| REACTOME PATHWAY | |||
| 1 | Neutrophil degranulation | 2 × 10−14 | |
| 2 | Immune System | 5.3 × 10−12 | |
| 3 | Innate Immune System | 6.8 × 10−10 | |
| 4 | Interferon Signaling | 8.8 × 10−8 | |
| 5 | Interferon γ signaling | 1.4 × 10−6 | |
| 6 | Fibronectin matrix formation | 1.7 × 10−5 | |
| 7 | RMTs methylate histone arginines | 3.7 × 10−5 | |
| 8 | α-defensins | 6.7 × 10−5 | |
| 9 | Cell surface interactions at the vascular wall | 7.5 × 10−5 | |
| 10 | Cytokine signaling in immune system | 3 × 10−4 | |
Gene ontology and pathway enrichment analysis of downregulated genes.
| Scheme | Term/Pathway | Genes | |
|---|---|---|---|
| GENE ONTOLOGY | |||
| 1 | GO:0016055 ~ Wnt signaling pathway | 0.0012 | |
| 2 | GO:0030154 ~ cell differentiation | 0.0035 | |
| 3 | GO:0006954 ~ inflammatory response | 0.0056 | |
| 4 | GO:0042127 ~ regulation of cell proliferation | 0.0081 |
|
| 5 | GO:0006955 ~ immune response | 0.0093 | |
| 6 | GO:0050853 ~ B cell receptor signaling pathway | 0.022 | |
| 7 | GO:0007275 ~ multicellular organism development | 0.024 | |
| KEGG PATHWAY | |||
| 1 | has05340: primary immunodeficiency | 4.9 × 10−4 | |
| 2 | hsa04060: cytokine-cytokine receptor interaction | 7.6 × 10−4 | |
| 3 | hsa04640: hematopoietic cell lineage | 0.0074 | |
| 4 | has04360: Axon guidance | 0.020 | |
| 5 | hsa04310: Wnt signaling pathway | 0.025 | |
| 6 | hsa04662: B cell receptor signaling pathway | 0.040 | |
| REACTOME PATHWAY | |||
| 1 | Repression of WNT target genes | 0.0062 |
|
| 2 | Loss of proteins required for interphase microtubule organization from the centrosome | 0.0153 | |
| 3 | Loss of Nlp from mitotic centrosomes | 0.0153 | |
| 4 | AURKA Activation by TPX2 | 0.0170 | |
| 5 | TNFs bind their physiological receptors | 0.0204 |
|
| 6 | Recruitment of mitotic centrosome proteins and complexes | 0.0216 | |
| 7 | Centrosome maturation | 0.0230 | |
| 8 | Regulation of PLK1 Activity at G2/M Transition | 0.0299 | |
| 9 | Recruitment of NuMA to mitotic centrosomes | 0.0342 | |
| 10 | Antigen activates B Cell Receptor (BCR) leading to generation of second messengers | 0.0397 | |
Figure 2PPI network displaying the interaction of proteins coded by the upregulated genes derived from ATB vs. LTBI data. (A) Results of STRING analysis (p value < 1 × 10−16). PPI network with 76 nodes connected to 151 edges. (B) Closely connected subnetworks identified by MCODE analysis plugin of cytoscape. Two clusters enriched in top with cluster score above 3 are shown. (C) Functionally enriched edges identified from PPI network using clueGO/cluepedia of cytoscape software. Network connectivity among GO term and pathway determined based on the interaction of functional cluster, edges (kappa score > 0.4), and enriched terms/pathway with p value < 0.05. Functional groups are denoted in different color codes; the most enriched functional term is indicated in bold color. (D) Cytohubba (MCC method) analysis explored the most important hub nodes; nodes in red color indicate a high MCC score, and yellow color node represents a low MCC score.
(A) Closely interlinked regions in the PPI network are clustered by MCODE analysis. (B) Top-ranked hub nodes are categorized based on MCC Method.
|
| ||||
|
|
|
|
|
|
| 1 | 7 | 7 | 21 | |
| 2 | 5 | 13 | 30 | |
|
| ||||
|
|
|
| ||
| 1 |
| 770 | ||
| 2 |
| 750 | ||
| 3 |
| 730 | ||
| 4 |
| 723 | ||
| 5 |
| 722 | ||
| 6 |
| 720 | ||
| 7 |
| 720 | ||
| 8 |
| 76 | ||
| 9 |
| 70 | ||
| 10 |
| 62 | ||
| 11 |
| 52 | ||
| 12 |
| 50 | ||
| 13 |
| 48 | ||
| 14 |
| 43 | ||
| 15 |
| 38 | ||
| 16 |
| 38 | ||
| 17 |
| 33 | ||
| 18 |
| 31 | ||
| 19 |
| 30 | ||
| 20 |
| 24 | ||
Figure 3Validation of DEG in an independent cohort using bioinformatics analysis. *** p < 0.001.
Figure 4Receiver operating characteristic curve (ROC) analysis. ROC curves for upregulated genes FCGR1B, ANKRD22, CARD17, IFITM3, and TNFAIP6 and downregulated genes FCGBP and KLF12.
Sensitivity, specificity, and AUC of seven-gene signature in discriminating active and latent tuberculosis.
| S.No. | Gene | Sensitivity | Specificity (95% CI) | AUC | 95% CI |
|---|---|---|---|---|---|
| 1 |
| 100% | 95% | 1 | 1.000 to 1.000 |
| 2 |
| 90% | 85% | 0.94 | 0.8689 to 1.000 |
| 3 |
| 85% | 90% | 0.96 | 0.9152 to 1.000 |
| 4 |
| 85% | 85% | 0.85 | 0.7261 to 0.9789 |
| 5 |
| 80% | 90% | 0.89 | 0.7831 to 1.000 |
| 6 |
| 75% | 80% | 0.84 | 0.7222 to 0.9678 |
| 7 |
| 80% | 80% | 0.9 | 0.8103 to 0.9947 |
AUC: Area under the ROC curve; CI: Confidence interval.