| Literature DB >> 31364746 |
Yu-Wei Zhang1, Yan Lin1, Hui-Yuan Yu2, Ruo-Nan Tian1, Fan Li1.
Abstract
Mycobacterium tuberculosis (M. tb) is a highly successful pathogen that has co‑existed with humans for 1,000's of years. As the cornerstone of the immune system, macrophages are a key part of innate immunity. They ingest and degrade foreign substances including aging cells and microorganisms, coordinate the inflammatory process, and are the first line of defense against M. tb infection. Recent advances in cellular mycobacteriology have indicated that M. tb uses an remarkably complex strategy to disrupt macrophage function, in order to counteract the antimicrobial mechanisms of the innate and adaptive immune responses, thereby achieving immune escape. With the popularity of microarray technology, a variety of public platforms have provided a variety of gene expression data associated with physiological and disease conditions. Meta‑analysis can systematically and quantitatively analyze multiple independent data concerning the same disease, greatly improving the statistical significance and credibility of the gene expression data analysis performed. In the present study, 6 microarray expression datasets of human acute monocytic leukemia THP‑1 cell line infected by M. tb H37Rv strain were collected from the GEO database. A total of 4 high‑quality datasets were identified using meta‑analysis methods in R language, and 306 differentially expressed genes with statistical significance were obtained. Then, a protein‑protein interaction (PPI) network of these differentially expressed genes was constructed on the Search Tool for the Retrieval of Interacting Genes/Proteins Database online tool and visualized by Cytoscape v. 3.6.1 software. Using CentiScape and MCODE plugin in the Cytoscape software to mine the functional modules associated with M. tb infection process, 32 characteristic genes were identified. Gene ontology and Kyoto Encyclopedia of Genes and Genomes analysis was performed on the 32 characteristic genes, and it was demonstrated that these genes were primarily associated with the type I interferon (IFN) pathway. In the established model of THP‑1‑derived macrophages infected by M. tb, the actual differential expression levels of IFN‑stimulated gene 15 (ISG15), 2'‑5‑oligoadenylate synthetase like (OASL), IFN regulatory factor 7 (IRF7) and DExD/H‑box helicase 58 (DDX58), the first 4 genes of the 32 characteristic genes, were verified by reverse transcription quantitative polymerase chain reaction. The results were consistent with the results of microarray analysis. The association between ISG15, OASL and IRF7 and TB infection was also verified. Although a number of studies have identified that the type I IFN pathway may assist M. tb to achieve immune escape, the present study used a meta‑analysis of microarray data and PPI network analysis to examine some of the novel genes identified in the IFN pathway. The results furthered the understanding of the molecular mechanisms of the TB immune response and provided a novel perspective for future therapeutic goals.Entities:
Mesh:
Year: 2019 PMID: 31364746 PMCID: PMC6713430 DOI: 10.3892/ijmm.2019.4293
Source DB: PubMed Journal: Int J Mol Med ISSN: 1107-3756 Impact factor: 4.101
Characteristics of gene expression datasets from Gene Expression Omnibus database.
| ID | Platform | H37Rv infection group | Control group |
|---|---|---|---|
| GSE17477 | GPL571 | 4 | 4 |
| GSE51029 | GPL4133 | 9 | 9 |
| GSE52819 | GPL6244 | 3 | 3 |
| GSE57028 | GPL16686 | 3 | 6 |
| GSE65714 | GPL10558 | 3 | 3 |
| GSE89391 | GPL6480 | 8 | 21 |
Quality control test of the gene expression datasets.
| Study | IQC | EQC C | QCg C | QCp | AQCg | AQCp | Rank |
|---|---|---|---|---|---|---|---|
| GSE17477 | 4.4 | 3.3 | 25.52 | 146.88 | 6.08 | 94.1 | 1.67 |
| GSE51029 | 4.84 | 3.19 | 21.13 | 127.06 | 2.88 | 79.93 | 2.83 |
| GSE57028 | 4.4 | 3.4 | 17.81 | 57.94 | 4.66 | 29.26 | 3.33 |
| GSE89391 | 4.4 | 2.77 | 19.03 | 106.29 | 3.9 | 64.74 | 3.5 |
| GSE52819 | 3.97 | 0.4 | 16.34 | 151.75 | 3.52 | 85.18 | 3.83 |
| GSE65714 | 0.26 | 0.75 | 0.58 | 0.96 | 0.26 | 0.49 | 5.83 |
IQC, internal quality control; EQC, external quality control; CQCg, consistency quality control of genes; CQCp, consistency quality control of pathways; AQCg, accuracy quality control of genes; AQCp, accuracy quality control of pathways.
Figure 1Principal component analysis results of the 6 gene expression datasets. IQC, internal quality control; EQC, external quality control; CQCg, consistency quality control of genes; CQCp, consistency quality control of pathways; AQCg, accuracy quality control of genes; AQCp, accuracy quality control of pathways.
Figure 2Results of the (A) Fisher, (B) rth-ordered p-value and (C) random effects model analyses in MetaDE software package of the 4 high quality datasets.
Figure 3Venn diagram of the Fisher, roP and REM analysis results. FDR, false discovery rate; roP, rth-ordered p-value; REM, random effects model.
Figure 4Hierarchical cluster analysis of 306 differentially expressed genes in four high quality datasets. (A) GSE17477 dataset. (B) GSE51029 dataset. (C) GSE57028 dataset. (D) GSE89391 dataset. Each row represents mRNA and each column represents a sample. Red indicates higher expression in TB as compared with that in normal subjects. Green indicates low expression in TB compared to that in normal subjects. TB, THP-1 derived macrophages infected by Mycobacterium tuberculosis H37Rv strain; control, normal THP-1 derived macrophages.
Top 20 shared differentially expressed genes identified in the meta-analysis, ranked by average Log2FC.
| A, Top 10 upregulated genes
| ||||
|---|---|---|---|---|
| Symbol | FDR.Fisher | FDR.roP | FDR.REM | Average Log2FC |
| CXCL10 | 1.62×10−19 | 7.52×10−04 | 1.86×10−02 | 4.46 |
| IFI44L | 1.62×10−19 | 5.41×10−19 | 5.74×10−03 | 4.00 |
| RSAD2 | 1.62×10−19 | 5.41×10−19 | 4.35×10−03 | 3.89 |
| ISG20 | 1.62×10−19 | 4.63×10−05 | 2.43×10−19 | 3.59 |
| IFIT2 | 1.62×10−19 | 5.41×10−19 | 2.43×10−19 | 3.58328139 |
| IFIT1 | 1.62×10−19 | 5.41×10−19 | 2.43×10−19 | 3.58 |
| IFIT3 | 1.62×10−19 | 5.41×10−19 | 2.43×10−19 | 3.22 |
| OASL | 1.62×10−19 | 5.41×10−19 | 5.65×10−05 | 3.15 |
| HERC5 | 1.62×10−19 | 5.41×10−19 | 3.74×10−04 | 3.01 |
| IFITM1 | 1.62×10−19 | 5.41×10−19 | 1.96×10−05 | 2.78 |
FDR, false discovery rate; FC, fold change.
Figure 5PPI networks of the DEGs in THP-1-derived macrophages infected by Mycobacterium tuberculosis H37Rv. (A) The PPI network of all DEGs in the meta-analysis, consisting of 275 nodes and 2,038 edges. (B) PPI network extracted from A by Degree Centrality, consisting of 56 nodes and 644 edges. (C) The PPI network extracted from B by MCODE, consisting of 32 nodes and 446 edges. PPI, protein-protein interaction; DEGs, differentially expressed genes.
Results of Degree Centrality and MCODE analysis.
| Symbol | BC | Degree | MCODE_Score | MCODE_Cluster |
|---|---|---|---|---|
| ISG15 | 0.27257001 | 55 | 22.77 | Cluster 1 |
| OASL | 0.05886982 | 42 | 22.77 | Cluster 1 |
| IRF7 | 0.01632719 | 38 | 22.77 | Cluster 1 |
| DDX58 | 0.01029361 | 38 | 22.77 | Cluster 1 |
| IFIH1 | 0.00863645 | 37 | 22.77 | Cluster 1 |
| IFIT1 | 0.00773668 | 37 | 22.77 | Cluster 1 |
| IFIT3 | 0.01031726 | 37 | 22.77 | Cluster 1 |
| HERC5 | 0.01926602 | 36 | 22.77 | Cluster 1 |
| IRF9 | 0.0092487 | 36 | 22.77 | Cluster 1 |
| MX2 | 0.00610183 | 36 | 22.77 | Cluster 1 |
| IFIT2 | 0.00542212 | 35 | 22.77 | Cluster 1 |
| RSAD2 | 0.00566624 | 35 | 22.77 | Cluster 1 |
| UBE2L6 | 0.01919648 | 35 | 21.70666667 | Cluster 2|Cluster 1 |
| IFI35 | 0.00540913 | 34 | 22.77 | Cluster 1 |
| XAF1 | 0.0038621 | 34 | 22.77 | Cluster 1 |
| HERC6 | 0.01571431 | 33 | 22.77 | Cluster 1 |
| IFI44 | 0.00381869 | 33 | 22.77 | Cluster 1 |
| IFI44L | 0.00381869 | 33 | 22.77 | Cluster 1 |
| DDX60 | 0.00236545 | 30 | 22.77 | Cluster 1 |
| RTP4 | 0.00168655 | 30 | 20.69950739 | Cluster 2|Cluster 1 |
| ISG20 | 0.0054854 | 29 | 20.77173913 | Cluster 2|Cluster 1 |
| USP18 | 0.00142348 | 29 | 22.77 | Cluster 1 |
| DHX58 | 0.00224466 | 28 | 21.52615385 | Cluster 2|Cluster 1 |
| SP110 | 0.00130771 | 28 | 21.84057971 | Cluster 2|Cluster 1 |
| CXCL10 | 0.00634646 | 27 | 20.91699605 | Cluster 2|Cluster 1 |
| EIF2AK2 | 0.0022294 | 27 | 21.84057971 | Cluster 2|Cluster 1 |
| BST2 | 0.00201042 | 26 | 20 | Cluster 2|Cluster 1 |
| IFIT5 | 6.43E-04 | 26 | 21.84057971 | Cluster 2|Cluster 1 |
| IFITM1 | 0.00123016 | 26 | 19.42028986 | Cluster 2|Cluster 1 |
| TRIM22 | 0.00324214 | 26 | 18.75324675 | Cluster 2|Cluster 1 |
| PLSCR1 | 7.11E-04 | 21 | 18.90952381 | Cluster 2|Cluster 1 |
| SAMD9 | 5.86E-05 | 20 | 19 | Cluster 2|Cluster 1 |
Figure 6GO and KEGG analysis of the 32 characteristic genes. (A) A total of 35 GO biological pathways were identified (B) A total of 3 KEGG pathways with P<0.05 (with Bonferroni adjustment). GO, Gene ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.
Most significant GO pathway of the 32 nodes in the protein-protein interaction network.
| GO ID | Term | P-value | Associated genes |
|---|---|---|---|
| GO:0035455 | Response to interferon-α | 3.23899×10−12 | BST2, EIF2AK2, IFIT2, IFIT3, IFITM1, MX2 |
| GO:0035456 | Response to interferon-β | 8.3886×10−8 | BST2, IFITM1, PLSCR1, XAF1 |
| GO:0060337 | Type I interferon signaling pathway | 5.06394×10−27 | BST2, IFI35, IFIT1, IFIT2, IFIT3, IFITM1, IRF7, IRF9, ISG15, ISG20, MX2, OASL, RSAD2, USP18, XAF1 |
| GO:0051607 | Defense response to virus | 4.21784×10−39 | BST2, CXCL10, DDX58, DDX60, DHX58, EIF2AK2, HERC5, IFI44L, IFIH1, IFIT1, IFIT2, IFIT3, IFIT5, IFITM1, IRF7, IRF9, ISG15, ISG20, MX2, OASL, PLSCR1, RSAD2, RTP4, TRIM22 |
| GO:0045069 | Negative regulation of viral genome replication | 2.22495×10−16 | BST2, EIF2AK2, IFIT1, IFIT5, IFITM1, ISG15, ISG20, OASL, PLSCR1, RSAD2 |
| GO:0032480 | Negative regulation of type I interferon production | 3.69762×10−10 | DDX58, DHX58, HERC5, IFIH1, ISG15, UBE2L6 |
GO, gene ontology.
Most significant KEGG pathway of the 32 nodes in the protein-protein interaction network.
| KEGG ID | Term | P-value | Associated genes identified |
|---|---|---|---|
| KEGG:04622 | Retinoic acid-inducible gene-I-like receptor signaling pathway | 1.0325×10−9 | CXCL10, DDX58, DHX58, IFIH1, IRF7, ISG15 |
| KEGG:04623 | Cytosolic DNA-sensing pathway | 0.000173925 | CXCL10, DDX58, IRF7 |
| KEGG:05164 | Influenza A | 5.76×10−9 | CXCL10, DDX58, EIF2AK2, IFIH1, IRF7, IRF9, RSAD2 |
KEGG, Kyoto Encyclopedia of Genes and Genomes.
Figure 7Co-localization of M. tb and lysosomes in macrophages (magnification, ×20). Macrophages were infected with FITC-labelled (green) M. tb H37Rv and stained with Lyso Tracker Red99 for identification of the acidic lysosomes. M. tb, Mycobacterium tuberculosis; FITC, fluorescein isothiocyanate.
Figure 8Expression levels of ISG15, OASL, IRF7 and DDX58. Validation is based on the expression levels in infected and uninfected cells, measured by reverse transcription quantitative polymerase chain reaction, and then compared with the microarray results. Experiments were performed in triplicate, *P<0.05 by ANOVA.
Figure 9Receiver operating characteristics curve of the expression of ISG15, OASL, IRF7 and DDX58. AUC, area under the curve.