| Literature DB >> 35280300 |
Keping Chai1, Xiaolin Zhang2, Huitao Tang1, Huaqian Gu1, Weiping Ye1, Gangqiang Wang1, Shufang Chen1, Feng Wan2, Jiawei Liang3, Daojiang Shen1.
Abstract
Multiple sclerosis (MS) is a chronic inflammatory disease of the central nervous system characterized by demyelination, which leads to the formation of white matter lesions (WMLs) and gray matter lesions (GMLs). Recently, a large amount of transcriptomics or proteomics research works explored MS, but few studies focused on the differences and similarities between GMLs and WMLs in transcriptomics. Furthermore, there are astonishing pathological differences between WMLs and GMLs, for example, there are differences in the type and abundance of infiltrating immune cells between WMLs and GMLs. Here, we used consensus weighted gene co-expression network analysis (WGCNA), single-sample gene set enrichment analysis (ssGSEA), and machine learning methods to identify the transcriptomic differences and similarities of the MS between GMLs and WMLs, and to find the co-expression modules with significant differences or similarities between them. Through weighted co-expression network analysis and ssGSEA analysis, CD56 bright natural killer cell was identified as the key immune infiltration factor in MS, whether in GM or WM. We also found that the co-expression networks between the two groups are quite similar (density = 0.79), and 28 differentially expressed genes (DEGs) are distributed in the midnightblue module, which is most related to CD56 bright natural killer cell in GM. Simultaneously, we also found that there are huge disparities between the modules, such as divergences between darkred module and lightyellow module, and these divergences may be relevant to the functions of the genes in the modules.Entities:
Keywords: RNA-seq; WGCNA; multiple sclerosis; random forest; ssGSEA
Year: 2022 PMID: 35280300 PMCID: PMC8907380 DOI: 10.3389/fneur.2022.807349
Source DB: PubMed Journal: Front Neurol ISSN: 1664-2295 Impact factor: 4.003
Figure 1(A) Volcano plot of DEGs in WM. (B) Heatmap of DEGs in WM. (C) Using the Venn tools to find the overlap genes between downregulated genes in DEGs and genes in black module.
Figure 2(A,C) Heatmap shows the ssGSEA scores of different gene sets in WM (A) and GM (C). (B,D) The bar plot shows the relative importance of features (immune infiltration pathway) in the random forest classification model [(B) WM; (D) GM].
Figure 3WM–GM consensus module construction. (A) Pearson correlation coefficients between the ssGSEA scores and module eigengenes (MEs) in WM dataset; numbers in brackets indicate the corresponding p-values. (B) Pearson correlation coefficients between the ssGSEA scores and module eigengenes in GM dataset; numbers in brackets indicate the corresponding p-values. (C) Pearson correlation coefficients between the ssGSEA scores and consensus module eigengenes; numbers in brackets indicate the corresponding p-values.
Figure 4(A,B) Clustering dendrograms of consensus module eigengenes (MEs) for identifying meta-modules show the presence of similar major branching pattern in WM and GM eigengene networks. (C,F) The heatmap shows the eigengene adjacencies in WM and GM eigengene networks. Each row and column corresponds to an eigengene tagged by consensus module colors. Within each heatmap, red represents high adjacency (positive correlation) and blue represents low adjacency (negative correlation) as represented by the color legend. (D) Bar plot shows the preservation degree of each consensus eigengene as the height of the bar (y-axis), and each colored bar corresponds to the eigengene of the associated consensus module. The high-density value D (preserve WM and GM) = 0.79 indicates the high overall preservation between the WM and GM networks. (E) Adjacency heatmap of the preservation network between WM and GM consensus eigengene networks. The saturation of the red color indicates correlation preservation of WM and GM MEs.
Figure 5(A) Identifying the overlap genes between DEGs and genes in consensus modules. (B) Heatmap shows the expression of the overlapping genes in the midnightblue module. (C) The plot shows the changes of the f1 index with the changes of the max-feature in the training set and test set. (D) The bar plot shows the relative importance of features (genes) in the random forest classification model. (E) The PPI network of important genes via GeneMANIA.