| Literature DB >> 36213445 |
Hongyu Sun1, Jin Yang1, Xiaohui Li1, Yi Lyu1, Zhaomeng Xu1, Hui He1, Xiaomin Tong1, Tingyu Ji1, Shihan Ding1, Chaoli Zhou1, Pengyong Han2, Jinping Zheng1,2,3.
Abstract
While Alzheimer's disease (AD) can cause a severe economic burden, the specific pathogenesis involved is yet to be elucidated. To identify feature genes associated with AD, we downloaded data from three GEO databases: GSE122063, GSE15222, and GSE138260. In the filtering, we used AD for search keywords, Homo sapiens for species selection, and established a sample size of > 20 for each data set, and each data set contains Including the normal group and AD group. The datasets GSE15222 and GSE138260 were combined as a training group to build a model, and GSE122063 was used as a test group to verify the model's accuracy. The genes with differential expression found in the combined datasets were used for analysis through Gene Ontology (GO) and The Kyoto Encyclopedia of Genes and Genome Pathways (KEGG). Then, AD-related module genes were identified using the combined dataset through a weighted gene co-expression network analysis (WGCNA). Both the differential and AD-related module genes were intersected to obtain AD key genes. These genes were first filtered through LASSO regression and then AD-related feature genes were obtained for subsequent immune-related analysis. A comprehensive analysis of three AD-related datasets in the GEO database revealed 111 common differential AD genes. In the GO analysis, the more prominent terms were cognition and learning or memory. The KEGG analysis showed that these differential genes were enriched not only in In the KEGG analysis, but also in three other pathways: neuroactive ligand-receptor interaction, cAMP signaling pathway, and Calcium signaling pathway. Three AD-related feature genes (SST, MLIP, HSPB3) were finally identified. The area under the ROC curve of these AD-related feature genes was greater than 0.7 in both the training and the test groups. Finally, an immune-related analysis of these genes was performed. The finding of AD-related feature genes (SST, MLIP, HSPB3) could help predict the onset and progression of the disease. Overall, our study may provide significant guidance for further exploration of potential biomarkers for the diagnosis and prediction of AD.Entities:
Keywords: Alzheimer's disease; GEO; WGCNA; bioinformatics; predict model
Year: 2022 PMID: 36213445 PMCID: PMC9536257 DOI: 10.3389/fncom.2022.1001546
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 3.387
Figure 1The workflow of data preparation, processing, analysis, and validation.
Figure 2Clustering dendrogram of 399 samples.
Figure 3(A) Merged dataset differential gene volcano plot. (B) Merged dataset differential gene heatmap.
Figure 4GO and KEGG analyses of merged dataset of differential genes. (A) GO analysis of merged dataset of differential genes. BP, biological process; CC, cellular components; MF, molecular function. (B) KEGG analysis of merged dataset of differential genes.
Figure 5(A) Analysis of the scale-free index for various soft-threshold powers (β). (B) Analysis of the mean connectivity for various soft-threshold powers. (C) Identification of co-expression gene modules. (D) A heatmap showing the correlation between each module eigengene and phenotype.
Figure 6(A) Venn diagram show the intersection of differential genes from the merged dataset and hub genes derived from WGCNA. (B,C) LASSO regression screened the best AD-related feature genes.
The characteristics of the AD-related feature genes.
|
|
|
| ||
|---|---|---|---|---|
|
|
|
| ||
| SST | Somatostatin | −2.164 | 2.34E-28 | 7.32E-25 |
| MLIP | Muscular LMNA-interacting protein | −1.746 | 9.61E-19 | 1.15E-16 |
| HSPB3 | Heat shock protein beta-3 | −1.499 | 1.04E-18 | 1.22E-16 |
Figure 7AD-related feature transcription factor analysis.
Figure 8(A) Expression of AD-related feature genes in Training group. (B) Expression of AD-related feature genes in Training group. ***P <0.001. (C,D) ROC curves of AD-related feature genes in Training group and Test group.
Figure 9(A) Immune-related function heatmap. (B) violin diagram of AD-related differential immune cell. (C) Heatmap of correlations between AD-related feature genes and immune cells.