| Literature DB >> 32694537 |
Margherita Squillario1, Giulia Abate2, Federico Tomasi3, Veronica Tozzo3, Annalisa Barla3, Daniela Uberti2.
Abstract
Genome-wide association studies (GWAS) have revealed a plethora of putative susceptibility genes for Alzheimer's disease (AD), with the sole exception of APOE gene unequivocally validated in independent study. Considering that the etiology of complex diseases like AD could depend on functional multiple genes interaction network, here we proposed an alternative GWAS analysis strategy based on (i) multivariate methods and on a (ii) telescope approach, in order to guarantee the identification of correlated variables, and reveal their connections at three biological connected levels. Specifically as multivariate methods, we employed two machine learning algorithms and a genetic association test and we considered SNPs, Genes and Pathways features in the analysis of two public GWAS dataset (ADNI-1 and ADNI-2). For each dataset and for each feature we addressed two binary classifications tasks: cases vs. controls and the low vs. high risk of developing AD considering the allelic status of APOEe4. This complex strategy allowed the identification of SNPs, genes and pathways lists statistically robust and meaningful from the biological viewpoint. Among the results, we confirm the involvement of TOMM40 gene in AD and we propose GRM7 as a novel gene significantly associated with AD.Entities:
Mesh:
Year: 2020 PMID: 32694537 PMCID: PMC7374579 DOI: 10.1038/s41598-020-67699-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The alternative GWAS’s analysis strategy. ADNI-1 and ADNI-2 datasets were analyzed unimputed at the SNP, Gene and Pathway levels using three machine learning methods [i.e., l1l2FS within PALLADIO framework, SKAT and Group Lasso with overlap (w. o.)]. The global signature represents the summary of the single integrated signatures identified within the proposed GWAS strategy.
Figure 2SNP-based results of ADNI-1. (A) The classification performance of SNP based analyses performed in ADNI-1 considering two classification tasks: AD vs. healthy controls (cases@controls) or 1/2 APOEe4 vs. 0 APOEe4 carriers (APOEe4 task). B. ACC, Balanced Accuracy; MCC, Matthews Correlation Coefficient; #genes*, number of genes or intergenic regions. (B) Balanced accuracy distribution plots of the regular (light blue) and the permutation batches (red) related to chromosomes 6 and 20 in the cases@controls task. (C) Balanced Accuracy distribution plots of the regular (light blue) and the permutation (red) batches related to chromosomes 1, 3, 9, 19 and 20 in the APOEe4 task.
Figure 3SNP-based results of ADNI-2. (A) The classification performance of SNP based analyses performed in ADNI-2 considering two classification tasks: AD vs. healthy controls (cases@controls) or 1/2 APOEe4 vs. 0 APOEe4 carriers (APOEe4 task). B. ACC, Balanced Accuracy; MCC, Matthews Correlation Coefficient; #genes*, number of genes or intergenic regions. (B) Balanced accuracy distribution plots of regular (light blue) and permutation (red) batches related to chromosomes 9, 10, 14, 20, 21 in the cases@controls task. (C) Balanced accuracy distribution plots of regular (light blue) and permutation (red) batches related to chromosome 19 in the APOEe4 task.
Gene-based signatures identified in ADNI-1.
| Gene symbol | P-value | Chr | Significant SNPs/# tot. SNPs |
|---|---|---|---|
| 2.21e−7 | 19 | rs2075650/3 (intron) | |
| 9.97e−6 | 22 | rs738499 (intron)/(2 intron + 1 coding) | |
| 1.61e−5 | 22 | rs2073080/3 (intergenic) | |
| 1.82e−5 | 2 | rs2270280 (intron)/4 (2 UTR + 2 intron) | |
| 7.34e−5 | 11 | rs12279328/121 (intergenic) | |
| 7.88e−38 | 19 | rs2075650/3 (intron) | |
| 9.64e−16 | 19 | rs439401/2 (intergenic) | |
| 1.13e−7 | 19 | rs405509/1 (intergenic) | |
| 4.13e−5 | 13 | rs11841624/24 (intergenic) | |
Lists of genes identified by the SKAT software in the cases@controls and APOEe4 tasks. The genes with P value < 1.37 × 10–6 are considered significant.
Pathway-based signatures identified in ADNI-1 and ADNI-2.
| Dataset | Test score | Pathways signatures | |
|---|---|---|---|
ADNI-1 APOEε4 task | 0.68 | Amyloid fib. F., gamma carboxylation, unfolded protein resp., chaperonin, post-chap., | |
| 0.62 | PIP3 activates AKT signaling, | ||
| 0.62 | |||
ADNI-2 cases@controls task | 0.71 | ||
| Group 9c | 0.67 | Signaling by receptor tyrosine kinases | |
ADNI-2 APOEe4 task | Group 9c | 0.71 | Signaling by TGF-beta family memebers |
| 0.65 | Amyloid fib. F., reg. of insulin growth factor, gamma carboxylation, unfolded protein resp., post-chap., | ||
| 0.64 | |||
| 0.62 |
Lists of the groups of pathways found statistically significant in APOEe4 task for ADNI-1 and in both tasks (cases@controls and APOEe4) for ADNI-2. The groups 1c, 5a, 9a were in common with ADNI-1 and 2. The test score shows the classification performance of Group Lasso with overlap. See Tables S4 and S5 for the complete list of all the pathways analyzed inside each group in ADNI-1 and ADNI-2 dataset respectively.
In bold are highlighted those groups of pathways that are in common among the different analysis performed in ADNI-1 and ADNI-2 dataset.
Functional characterization in KEGG.
| Functional characterization | ||||
|---|---|---|---|---|
| KEGG database | ||||
| Pathway name | #Genes | Adj-P-value | Gene symbol | |
| ADNI-1 | 5 | 5.99e-06 | ||
| ADNI-2 | Calcium signaling pathway | 4 | 0.0040 | |
| 4 | 0.0064 | |||
| Axon guidance | 3 | 0.0064 | ||
| Chemokine signaling pathway | 5 | 0.0080 | ||
Pathways enrichment results of the SNPs signatures identified in ADNI-1 and ADNI-2 dataset, considering the APOEe4 and cases@controls tasks respectively. #genes*, number of genes Adj-P-value, adjusted P-value.
The pathways names highlighted in bold are commented in the main text.