| Literature DB >> 35957686 |
Danyang Li1, Lifang Li1, Fei Quan2, Tianfeng Wang1, Si Xu1, Shuang Li1, Kuo Tian1, Meng Feng1, Ni He1, Liting Tian1, Biying Chen1, Huixue Zhang1, Lihua Wang1, Jianjian Wang1.
Abstract
Ischemic stroke (IS) is a high-incidence disease that seriously threatens human life and health. Neuroinflammation and immune responses are key players in the pathophysiological processes of IS. However, the underlying immune mechanisms are not fully understood. In this study, we attempted to identify several immune biomarkers associated with IS. We first retrospectively collected validated human IS immune-related genes (IS-IRGs) as seed genes. Afterward, potential IS-IRGs were discovered by applying random walk with restart on the PPI network and the permutation test as a screening strategy. Doing so, the validated and potential sets of IS-IRGs were merged together as an IS-IRG catalog. Two microarray profiles were subsequently used to explore the expression patterns of the IS-IRG catalog, and only IS-IRGs that were differentially expressed between IS patients and controls in both profiles were retained for biomarker selection by the Random Forest rankings. CLEC4D and CD163 were finally identified as immune biomarkers of IS, and a classification model was constructed and verified based on the weights of two biomarkers obtained from the Neural Network algorithm. Furthermore, the CIBERSORT algorithm helped us determine the proportions of circulating immune cells. Correlation analyses between IS immune biomarkers and immune cell proportions demonstrated that CLEC4D was strongly correlated with the proportion of neutrophils (r = 0.72). These results may provide potential targets for further studies on immuno-neuroprotection therapies against reperfusion injury.Entities:
Keywords: biomarker; immune; ischemic stroke; machine learning; neuroinflammation
Year: 2022 PMID: 35957686 PMCID: PMC9358692 DOI: 10.3389/fgene.2022.921582
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Information of GEO datasets.
| ID | Sample type | Platform | IS sample | Control sample |
|---|---|---|---|---|
| Training set | ||||
| GSE16561 | Peripheral whole blood | GPL6883 | 39 | 24 |
| GSE58294 | Peripheral whole blood | GPL570 | 69 | 23 |
| Test set | ||||
| GSE102541 | Peripheral whole blood | GPL22755 | 6 | 3 |
FIGURE 1Items of GO and KEGG enrichment analysis. (A) GO analysis for validated IS-IRGs. (B) KEGG analysis for validated IS-IRGs. (C) GO analysis for the IS-IRG catalog. (D) KEGG analysis for the IS-IRG catalog. GO, Gene Ontology; BP, biological process; CC, cellular component; MF, molecular function; KEGG, Kyoto Encyclopedia of Genes and Genomes; IS-IRGs, ischemic stroke-immune related genes.
FIGURE 2Identification of DEGs and IS immune-related DEGs. (A,C) Heatmap showing the differences in DEGs between ischemic stroke patients and controls in the microarray dataset. (A) represents the GSE16561 data set and (C) represents the GSE58294 data set. (B,D) Volcano plot showing the differences in DEGs between ischemic stroke patients and controls in the microarray dataset. (B) represents GSE16561 data set and (D) represents GSE58294 data set. (E) Venn diagram representation of the intersection of the following three sets: DEGs in the GSE16561 dataset, DEGs in the GSE58294 dataset, and IS-IRGs catalog.
FIGURE 3Enrichment and correlation analyses of potential IS immune biomarkers. (A) GO analysis of potential IS immune biomarkers. (B) KEGG analysis of potential IS immune biomarkers. (C,D) Heatmap showing the correlation analysis of potential IS immune biomarkers in the GSE16561 and GSE58294 datasets. The correlation coefficient (Corr) was determined via Pearson’s correlation analysis. Blank spaces indicate that correlations were not statistically significant (p > 0.05).
FIGURE 4Screening of Key Features of IS immune biomarkers. (A,B) Random Forest model screening for key features in the GSE16561 dataset. (C,D) Random Forest model screening for key features in the GSE58294 dataset. (E) Venn diagram showing the intersection of key features between the GSE16561 and GSE58294 datasets.
FIGURE 5The Diagnostic Value of Three Key Immune-Related Biomarkers in IS. (A,B) Box plot analysis showing the expression values of IS immune-related biomarkers in IS patients vs controls. (The p-value was obtained from the t-test. ****p < 0.0001.) (C) Neural Network model constructed using GSE16561. (D) Neural Network model constructed using GSE58294. (E) ROC curves of the Neural Network diagnostic model constructed using GSE16561. (F) ROC curves of the Neural Network diagnostic model constructed using GSE58294. (G) Validation of the Neural Network model in the GSE102541 dataset.
FIGURE 6Peripheral Blood Immune Cell Infiltration in IS. (A) Violin plot of 22 kinds of immune cells’ differentially infiltrated fractions in peripheral blood between healthy controls and IS patients. (B) Heatmap showing the correlation between IS immune-related biomarkers and 22 circulating immune cells. Correlation coefficients were calculated using Spearman’s correlation analysis. The correlation was interpreted primarily according to the magnitude of the correlation coefficient: Corr >0.70 indicates a strong correlation; Corr of 0.50–0.70 indicates a moderate correlation; Corr of 0.30–0.50 indicates a weak-moderate correlation, and Corr <0.30 indicates a weak correlation.