| Literature DB >> 36203777 |
Yale Jiang1,2,3, Dingyuan Jiang1,4, Ulrich Costabel5, Huaping Dai1,4, Chen Wang1,4,6,7.
Abstract
Sarcoidosis is a granulomatous disease of unknown etiology, immunologically characterized by a Th1 immune response. Transcriptome-wide expression studies in various types of sarcoid tissues contributed to better understanding of disease mechanisms. We performed a systematic database search on Gene Expression Omnibus (GEO) and utilized transcriptomic data from blood and sarcoidosis-affected tissues in a meta-analysis to identify a cross-tissue, cross-platform signature. Datasets were further separated into training and testing sets for development of a diagnostic classifier for sarcoidosis. A total of 690 differentially expressed genes were identified in the analysis among various tissues. 29 of the genes were robustly associated with sarcoidosis in the meta-analysis both in blood and in lung-associated tissues. Top genes included LINC01278 (P = 3.11 × 10-13), GBP5 (P = 5.56 × 10-07), and PSMB9 (P = 1.11 × 10-06). Pathway enrichment analysis revealed activated IFN-γ, IL-1, and IL-18, autophagy, and viral infection response. IL-17 was observed to be enriched in peripheral blood specific signature genes. A 16-gene classifier achieved excellent performance in the independent validation data (AUC 0.711-0.964). This study provides a cross-tissue meta-analysis for expression profiles of sarcoidosis and identifies a diagnostic classifier that potentially can complement more invasive procedures.Entities:
Keywords: IL-17; interferon; machine learning; sarcoidosis; transcriptome
Year: 2022 PMID: 36203777 PMCID: PMC9530451 DOI: 10.3389/fmed.2022.960266
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
FIGURE 1Overview of the meta-analysis approach for development and validation of sarcoidosis transcriptome panel.
Datasets used for cross-tissue meta-analysis and sarcoidosis classifier development.
| GEO dataset | Platform | Tissue | Control samples | Sarcoid samples | ID |
| GSE19314 | Affy U133 Plus 2.0 Array | Blood | 20 | 38 | Train1 |
| GSE18781 | Affy U133 Plus 2.0 Array | Blood | 25 | 12 | Train2 |
| GSE42834 | Illumina beadchip | Blood | 113 | 61 | Train3 |
| GSE83456 | Illumina beadchip | Blood | 61 | 49 | Train4 |
| GSE16538 | Affy U133 Plus 2.0 Array | Lung | 6 | 6 | Train5 |
| GSE73394 | Affy Gene 1.0 ST array | BAL | 20 | 26 | Train6 |
| GSE148036 | Illumina HiSeq 3000 | Lung | 5 | 5 | Train7 |
| GSE37912 | Affy Exon 1.0 ST array | Blood | 35 | 39 | Test1 |
| GSE34608 | Agilent microarray | Blood | 18 | 18 | Test2 |
| GSE75023 | Affy U133A 2.0 array | AM | 12 | 15 | Test3 |
| GSE105149 | Affy U133 plus 2.0 array | Lacrimal gland | 7 | 8 | Test4 |
| GSE32887 | Affy U133 plus 2.0 array | Skin | 5 | 26 | Test5 |
| GSE119136 | Affy gene 1.0 ST array | Nasal brushing | 12 | 14 | Test6 |
Affy, Affymetrix Human Genome; AM, alveolar macrophage; Illumina beadchip, Illumina HumanHT-12 V4.0 expression beadchip; Agilent Microarray, Agilent-014850 Whole Human Genome Microarray 4 × 44K G4112F.
FIGURE 2Signature gene identification in blood, lung-associated tissues, and all available tissue types. (A) Venn plot depicted differentially expressed genes in the three sets of meta-analyses, in peripheral blood samples, lung-associated tissues, and in all 13 cross-tissue samples combined, respectively. (B) Pathway enrichment analysis of the top signature genes in the meta-analysis across various tissues. The size of the nodes reflects the enrichment significance of the terms, and the colors denote groups of biological process.
Triple-significant differentially expressed genes (DEGs) in cross-tissue, blood, and lung-associated tissue meta-analyses.
| Gene | Cross-tissue | Blood | Lung | Significance label | |||
|
|
|
| |||||
| logFC | P-adjust | logFC | P-adjust | logFC | P-adjust | ||
|
| −0.38 | 3.11 × 10–13 | −0.35 | 8.72 × 10–06 | −0.32 | 2.65 × 10–03 | //++/?/??/++? |
|
| 1.55 | 5.56 × 10–07 | 1.47 | 4.58 × 10–03 | 1.57 | 3.21 × 10–07 | +++++++/+?+// |
|
| 0.55 | 1.11 × 10–06 | 0.45 | 3.80 × 10–03 | 0.56 | 3.18 × 10–15 | ++++++//++++/ |
|
| 0.60 | 1.35 × 10–06 | 0.44 | 1.20 × 10–03 | 0.74 | 8.43 × 10–06 | //++++//++++/ |
|
| 0.86 | 2.11 × 10–06 | 0.78 | 1.31 × 10–02 | 1.14 | 9.65 × 10–22 | ++++/±+++// |
|
| 0.20 | 2.39 × 10–06 | 0.17 | 1.35 × 10–05 | 0.28 | 1.75 × 10–03 | /////+/+//+// |
|
| 0.13 | 1.69 × 10–05 | 0.16 | 6.52 × 10–03 | 0.13 | 1.90 × 10–02 | //+//+/////// |
|
| 0.48 | 5.99 × 10–05 | 0.54 | 2.11 × 10–05 | 0.34 | 5.85 × 10–09 | ++++/+??++++/ |
|
| −0.22 | 8.74 × 10–05 | −0.22 | 1.76 × 10–02 | −0.24 | 8.38 × 10–05 | /+///+//++/// |
|
| −0.33 | 9.40 × 10–05 | −0.37 | 1.96 × 10–03 | −0.38 | 4.60 × 10–03 | /+++/?/??///? |
|
| 0.56 | 1.01 × 10–04 | 0.51 | 3.50 × 10–02 | 0.70 | 1.48 × 10–02 | //++/+//+++// |
|
| 0.78 | 1.13 × 10–04 | 0.54 | 1.50 × 10–02 | 0.96 | 1.77 × 10–05 | +++++?//++++? |
|
| 0.49 | 1.77 × 10–04 | 0.38 | 1.25 × 10–02 | 0.77 | 1.80 × 10–03 | //++/+/++++// |
|
| 0.42 | 2.67 × 10–04 | 0.46 | 7.30 × 10–03 | 0.31 | 8.73 × 10–09 | /+++++///+++/ |
|
| 0.13 | 3.09 × 10–04 | 0.16 | 3.60 × 10–03 | 0.14 | 5.61 × 10–04 | ///+/+///+/// |
|
| 1.45 | 5.61 × 10–04 | 1.72 | 2.36 × 10–03 | 1.44 | 1.63 × 10–04 | +++++++/+?+// |
|
| 1.24 | 5.61 × 10–04 | 0.99 | 1.79 × 10–02 | 1.34 | 1.17 × 10–06 | +++++++/+++// |
|
| 1.09 | 6.24 × 10–04 | 1.14 | 1.52 × 10–02 | 0.93 | 3.24 × 10–02 | /+++++//+++// |
|
| 0.64 | 9.30 × 10–04 | 0.68 | 7.96 × 10–03 | 0.79 | 7.32 × 10–03 | +++++///+++// |
|
| 0.41 | 1.06 × 10–03 | 0.26 | 4.99 × 10–02 | 0.59 | 2.01 × 10–03 | /+++/++/+++// |
|
| 0.55 | 1.35 × 10–03 | 0.79 | 8.78 × 10–03 | 0.25 | 1.44 × 10–06 | ++++/+//++/+/ |
|
| 0.35 | 1.58 × 10–03 | 0.20 | 1.61 × 10–02 | 0.47 | 2.89 × 10–07 | //+/++///++// |
|
| 0.20 | 1.85 × 10–03 | 0.25 | 3.72 × 10–02 | 0.17 | 2.79 × 10–02 | //+/////+//// |
|
| 0.25 | 2.58 × 10–03 | 0.26 | 1.67 × 10–02 | 0.15 | 3.22 × 10–02 | //++////+/++/ |
|
| 0.19 | 7.84 × 10–03 | 0.31 | 4.11 × 10–14 | 0.17 | 2.10 × 10–02 | //+//+//+//// |
|
| 0.22 | 1.49 × 10–02 | 0.14 | 2.82 × 10–02 | 0.34 | 2.73 × 10–02 | //++?/////+// |
|
| 0.57 | 1.61 × 10–02 | 0.26 | 2.82 × 10–02 | 0.97 | 8.30 × 10–04 | //++/+//+++// |
|
| −0.29 | 1.70 × 10–02 | −0.39 | 1.94 × 10–02 | −0.36 | 1.14 × 10–02 | +++++//-++/// |
|
| 0.35 | 2.69 × 10–02 | 0.36 | 1.83 × 10–08 | 0.46 | 1.44 × 10–02 | +/+++//??++// |
LogFC, log-transformed fold change; P-adjust, FDR-adjusted P-value; *: label of significance in each of the 13 GEO dataset, consistent with the order listed in Table 1.+: P-value < 0.05, fold change direction consistent with cross-tissue meta-analysis; −: P-value < 0.05, fold change direction contradictory with cross-tissue meta-analysis;/: P-value ≥ 0.05; ?: not available.
FIGURE 3Tissue-specific pathways enriched in blood and lung-associated tissues. Enriched pathways specific to peripheral blood and lung-associated tissues. The analysis was performed based on the top differentially expressed genes specific to peripheral blood (A) and lung-associated tissues (B) identified in the results of meta-analyses.
FIGURE 4Cross-tissue classification model for sarcoidosis. (A) Heatmap shows expression profile of the 16 predictors in training sets. Half of the predictors were up-regulated while the others were down-regulated. (B) Importance of the predictors in GBM model. ROC curve of the classifier was shown in training sets (C), and test sets (D).