| Literature DB >> 19756043 |
Wei Liu1, Jinhui Ding, Jesse Raphael Gibbs, Sue Jane Wang, John Hardy, Andrew Singleton.
Abstract
Here we propose a simple statistical algorithm for rapidly scoring loci associated with disease or traits due to recessive mutations or deletions using genome-wide single nucleotide polymorphism genotyping case-control data in unrelated individuals. This algorithm identifies loci by defining homozygous segments of the genome present at significantly different frequencies between cases and controls. We found that false positive loci could be effectively removed from the output of this procedure by applying different physical size thresholds for the homozygous segments. This procedure is then conducted iteratively using random sub-datasets until the number of selected loci converges. We demonstrate this method in a publicly available data set for Alzheimer's disease and identify 26 candidate risk loci in the 22 autosomes. In this data set, these loci can explain 75% of the genetic risk variability of the disease.Entities:
Mesh:
Year: 2009 PMID: 19756043 PMCID: PMC2758715 DOI: 10.1038/msb.2009.53
Source DB: PubMed Journal: Mol Syst Biol ISSN: 1744-4292 Impact factor: 11.429
Figure 1Scheme for computing the proportion of a locus on AHs. For a given chromosome of a subject, the symbols (•, ○) represent SNP loci. The shaded segments denote AHs with size greater than or equal to a pre-selected threshold C. The proportion of a locus on AHs is computed as p= (the number of AHs containing this locus)/(the total number of individuals), for example p1=4/6 for SNP-1.
Figure 2The plot of z versus nucleotide basepair of chromosome 19 in the AD data set: (A) before and (B) after the procedure of adjacent-C-selection, (C) the most significant region—the peak locus is rs4420638, (D) the most significant region with two loci on APOE (↓).
Figure 3Convergence of the loci number. (A) At a level of significance α=0.001, a total of 607 loci (□) were selected from the 4054 loci for which ∣z∣⩾ z1−α/2 (Δ) by applying the procedure of adjacent-C-selection in the AD data set. Random case–control subsets were generated using f=0.9 and used in screening iteration (○). (B) The enlarged plot showing the convergence of selected loci to the number 26.
List of candidate loci associated with AD from the 22 autosome of the AD SNP genotype data (Coon )
| CHR | SNP ID | Locationa | Function | Gene | Gene ID | Effect |
|---|---|---|---|---|---|---|
| aIn nucleotide basepair. | ||||||
| bLoci remained in the model on logistic regression selection with a | ||||||
| cLoci in homozygous regions containing candidate loci of recessive genetic lesion causing AD ( | ||||||
| dGenes are on known functional pathways and networks as revealed by the use of Ingenuity Pathway Analysis (Ingenuity Systems, | ||||||
| eA SNP in Affymetrix 500K GeneChip, but without NCBI ID. | ||||||
| 1 | rs17325887b,c | 69998761 | Intron | 57554 | Risk | |
| 1 | rs7520521c | 70020703 | Intron | 57554 | Risk | |
| 1 | rs1913269b,c | 70052194 | Intron | 57554 | Risk | |
| 1 | rs10754339b | 117491795 | mRNA–UTR | 79679 | Protect | |
| 1 | rs16842422b | 196366613 | –66918 | 647195 | Protect | |
| 2 | rs7582851 | 192032391 | –392328 | 647167 | Protect | |
| 3 | rs6784615b | 52481466 | Intron | 11188 | Protect | |
| 4 | rs9994615 | 40786592 | Intron | 323 | Risk | |
| 4 | rs10015784b | 40793978 | Intron | 323 | Risk | |
| 5 | rs1602843b,c | 86324342 | 0 | 255631 | Risk | |
| 5 | rs2913719b | 163947773 | 2403 | 440700 | Protect | |
| 6 | rs13213247b | 81572755 | –91974 | 729817 | Risk | |
| 6 | rs16892285 | 81592721 | –72008 | 729817 | Risk | |
| 6 | rs13193950 | 81593433 | –71296 | 729817 | Risk | |
| 6 | rs156232b | 104979509 | 481535 | 642337 | Risk | |
| 10 | rs10827687b | 36999313 | –39887 | 2899 | Risk | |
| 10 | rs10824310b | 53698470 | Intron | 5592 | Risk | |
| 10 | rs10740548 | 54877234 | –2797 | 374977 | Risk | |
| 11 | rs1038891b,c | 40895642 | 0 | 9783 | Risk | |
| 12 | rs1354470b | 59088188 | –32939 | 645757 | Risk | |
| 12 | rs7967572 | 73396068 | 51514 | 126811 | Risk | |
| 18 | rs1785928b | 31979929 | Coding non-synonymous | 55250 | Risk | |
| 19 | rs11879589 | 50065116 | Intron | 5819 | Protect | |
| 19 | rs4420638b | 50114786 | Locus region | 341 | Protect | |
| 19 | e | 50150075 | ||||
| 19 | rs204907 | 50153836 | Intron | 1209 | Protect | |