| Literature DB >> 34315930 |
Abstract
Alzheimer's disease (AD) is a complex and heterogeneous disease that can be affected by various genetic factors. Although the cause of AD is not yet known and there is no treatment to cure this disease, its progression can be delayed. AD has recently been recognized as a brain-specific type of diabetes called type 3 diabetes. Several studies have shown that people with type 2 diabetes (T2D) have a higher risk of developing AD. Therefore, it is important to identify subgroups of patients with AD that may be more likely to be associated with T2D. We here describe a new approach to identify the correlation between AD and T2D at the genetic level. Subgroups of AD and T2D were each generated using a non-negative matrix factorization (NMF) approach, which generated clusters containing subsets of genes and samples. In the gene cluster that was generated by conventional gene clustering method from NMF, we selected genes with significant differences in the corresponding sample cluster by Kruskal-Wallis and Dunn-test. Subsequently, we extracted differentially expressed gene (DEG) subgroups, and candidate genes with the same regulation direction can be extracted at the intersection of two disease DEG subgroups. Finally, we identified 241 candidate genes that represent common features related to both AD and T2D, and based on pathway analysis we propose that these genes play a role in the common pathological features of AD and T2D. Moreover, in the prediction of AD using logistic regression analysis with an independent AD dataset, the candidate genes obtained better prediction performance than DEGs. In conclusion, our study revealed a subgroup of patients with AD that are associated with T2D and candidate genes associated between AD and T2D, which can help in providing personalized and suitable treatments.Entities:
Mesh:
Year: 2021 PMID: 34315930 PMCID: PMC8316581 DOI: 10.1038/s41598-021-94048-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Procedure for relation extraction between two diseases. (a) Gene expression data of AD and T2D patients are given. (b) The convex non-negative matrix factorization (NMF) is used to decompose the input expression matrices. Gene and sample clusters are obtained from the NMF decomposed matrices. (c) DEG genes are assigned to clusters. (d) Common candidate genes are extracted from a related cluster pair between two diseases. (b) is generated by the R software (R version 3.6.1, https://www.r-project.org/) using an example dataset.
Figure 2Cophenetic correlation coefficients for the consensus matrix in (a) Alzheimer’s disease (AD) and (b) type 2 diabetes (T2D) datasets.
Clustered samples and genes for (a) Alzheimer’s disease (AD) and (b) type 2 diabetes (T2D).
| Cluster | |||
|---|---|---|---|
| 1 | 93 | 3004 | 1989 |
| 2 | 90 | 1242 | 1742 |
| 3 | 102 | 1129 | 748 |
Figure 3Heatmap of gene clustering method in Alzheimer’s disease (AD) and type 2 diabetes (T2D). (a) and (b) show genes and sample clusters from AD patients. (c) and (d) show genes and sample clusters from T2D patients. Top panels show clusters obtained using the basic “Max” ((a) and (c)) and “Min” ((b) and (d)) methods. Bottom panels show clusters obtained using the Kruskal–Wallis and Denn test for relatively upregulated genes ((a) and (c)) and relatively downregulated genes ((b) and (d)). This figure is generated by the R software (R version 3.6.1, https://www.r-project.org/) and python (version 3.6.8, https://www.python.org/).
Number of DEGs in each cluster.
| Cluster | |||
|---|---|---|---|
| 1 | 93 | 1281 | 825 |
| 2 | 90 | 1443 | 2422 |
| 3 | 102 | 1482 | 1670 |
Numbers of type 2 diabetes (T2D)-related pathways for each Alzheimer’s disease (AD) differentially expressed gene module.
| AD | Enriched pathways | Common pathways with 1757 T2D-related pathways |
|---|---|---|
| 16 | 0 | |
| 10 | 5 | |
| 30 | 9 | |
| 101 | 4 | |
| 205 | 119 | |
| 164 | 8 |
Numbers of Alzheimer’s disease (AD)-related pathways in each type 2 diabetes (T2D) differentially expressed gene module.
| T2D | Enriched pathways | Common pathways with 1675 AD-related pathways |
|---|---|---|
| 4 | 0 | |
| 5 | 5 | |
| 0 | 0 | |
| 0 | 0 | |
| 135 | 31 | |
| 87 | 16 |
Numbers of common pathways between disease subgroup pairs and DigSee.
| AD | T2D | Pathways from common genes | Common pathways with DigSee |
|---|---|---|---|
| 0 | 0 | ||
| 2 | 0 | ||
| 0 | 0 | ||
| 0 | 0 | ||
| 0 | 0 | ||
| 0 | 0 | ||
| 0 | 0 | ||
| 0 | 0 | ||
| 0 | 0 | ||
| 0 | 0 | ||
| 23 | 3 | ||
| 0 | 0 | ||
| 0 | 0 | ||
| 0 | 0 | ||
| 0 | 0 | ||
| 1 | 0 | ||
| 21 | 12 | ||
| 18 | 7 |
Common pathways between candidate genes and DigSee genes.
| Pathways | Adjusted | |
|---|---|---|
| GO: Positive regulation of cell-cell adhesion | 3.79E−09 | 2.85E−05 |
| KEGG: Pancreatic cancer | 2.17E-07 | 4.03E−05 |
| GO: T cell selection | 5.83E-09 | 4.39E−05 |
| GO: Positive T cell selection | 7.33E−09 | 5.52E−05 |
| GO: T cell differentiation | 5.64E−08 | 0.000425056 |
| KEGG: Adherens junction | 3.57E−06 | 0.000663158 |
| KEGG: Epithelial cell signaling in Helicobacter pylori infection | 2.23E−05 | 0.004142846 |
| KEGG: Chemokine signaling pathway | 2.58E−05 | 0.004806622 |
| KEGG: JAK-STAT signaling pathway | 2.62E−05 | 0.004876146 |
| GO: Vacuolar lumen | 1.12E−05 | 0.011236912 |
| KEGG: T cell receptor signaling pathway | 6.41E−05 | 0.011931807 |
| KEGG: Colorectal cancer | 0.000123344 | 0.022942068 |
| KEGG: Neurotrophin signaling pathway | 0.000188801 | 0.035116995 |
| KEGG: Fc gamma R-mediated phagocytosis | 0.00020364 | 0.037877102 |
Performance of classification of Alzheimer’s disease (AD) and controls in the ADNI cohort using different sets of genes. () represents the common genes between the AD module and type 2 diabetes (T2D) module used for classification.
| (AD | AUC |
|---|---|
| ( | 0.5173 |
| ( | 0.6034 |
| ( | 0.5810 |
| ( | 0.5264 |
| ( | 0.5763 |
| ( | 0.6256 |
| ( | 0.5411 |
| ( | 0.6022 |
| ( | 0.6906 |
Figure 4Performance comparison of Alzheimer’s disease (AD) classification from controls based on the area under the curve (AUC) using different gene sets.