| Literature DB >> 26366461 |
Fayroz F Sherif1, Nourhan Zayed2, Mahmoud Fakhr2.
Abstract
Single nucleotide polymorphisms (SNPs) contribute most of the genetic variation to the human genome. SNPs associate with many complex and common diseases like Alzheimer's disease (AD). Discovering SNP biomarkers at different loci can improve early diagnosis and treatment of these diseases. Bayesian network provides a comprehensible and modular framework for representing interactions between genes or single SNPs. Here, different Bayesian network structure learning algorithms have been applied in whole genome sequencing (WGS) data for detecting the causal AD SNPs and gene-SNP interactions. We focused on polymorphisms in the top ten genes associated with AD and identified by genome-wide association (GWA) studies. New SNP biomarkers were observed to be significantly associated with Alzheimer's disease. These SNPs are rs7530069, rs113464261, rs114506298, rs73504429, rs7929589, rs76306710, and rs668134. The obtained results demonstrated the effectiveness of using BN for identifying AD causal SNPs with acceptable accuracy. The results guarantee that the SNP set detected by Markov blanket based methods has a strong association with AD disease and achieves better performance than both naïve Bayes and tree augmented naïve Bayes. Minimal augmented Markov blanket reaches accuracy of 66.13% and sensitivity of 88.87% versus 61.58% and 59.43% in naïve Bayes, respectively.Entities:
Year: 2015 PMID: 26366461 PMCID: PMC4561111 DOI: 10.1155/2015/639367
Source DB: PubMed Journal: Adv Bioinformatics ISSN: 1687-8027
Figure 1Summary of the proposed system.
The top candidate genes and the number of SNPs among each gene.
| Gene | Chromosome | Number of SNPs | Potential pathways |
|---|---|---|---|
| APOE | 19 | 6 | Cholesterol/lipid metabolism |
| BIN1 | 2 | 101 | Endocytic pathways |
| CLU | 8 | 32 | Immune and cholesterol/lipid metabolism |
| ABCA7 | 19 | 36 | Cholesterol/lipid metabolism; immune and complement systems/inflammatory response |
| CR1 | 1 | 71 | Immune and complement systems/inflammatory response |
| PICALM | 11 | 138 | Endocytic pathways |
| MS4A6A | 11 | 12 | Immune and complement systems/inflammatory response |
| CD33 | 19 | 13 | Immune and complement systems/inflammatory response |
| CD2AP | 6 | 61 | Endocytic pathways; immune and complement systems/inflammatory response |
Figure 2(a) Naïve Bayes structure. (b) Tree augmented naïve Bayes structure.
Figure 3The network structure of (a) Markov blanket algorithm and (b) minimal augmented Markov blanket.
Figure 4Top related SNPs with Alzheimer' disease using minimal augmented Markov blanket (SNPs kgp11800793 and kgp5536625 overlapped as they have the same mutual information with AD).
Prediction accuracy results, sensitivity, and specificity for various used algorithms.
| Algorithm | Accuracy | Sensitivity | Specificity | Number of SNPs |
|---|---|---|---|---|
| Naïve Bayes | 61.58% | 59.43% | 65.6% | 435 |
| Tree augmented naïve Bayes | 64.29% | 67.55% | 58.16% | 435 |
| Markov blanket | 65.64% | 77.55% | 43.26% | 13 |
| Minimal augmented Markov blanket | 66.13% | 88.87% | 16.31% | 11 |
Figure 5Comparative ROC curve of the four resulting structures.