| Literature DB >> 27805002 |
Yen-Ching Chen1,2,3, Chi-Jung Hsiao1, Chien-Cheng Jung1, Hui-Han Hu4, Jen-Hau Chen1,5, Wen-Chung Lee1,2,3, Jeng-Min Chiou6, Ta-Fu Chen7, Yu Sun8, Li-Li Wen9, Ping-Keung Yip10,11, Yi-Min Chu12, Chien-Jen Chen4, Hwai-I Yang4,13.
Abstract
Previous genome-wide association studies using P-values to select single nucleotide polymorphisms (SNPs) have suffered from high false-positive and false-negative results. This case-control study recruited 713 late-onset Alzheimer's disease (LOAD) cases and controls aged ≥65 from three teaching hospitals in northern Taiwan from 2007 to 2010. Performance metrics were used to select SNPs in stage 1, which were then genotyped to another dataset (stage 2). Four SNPs (CPXM2 rs2362967, APOC1 rs4420638, ZNF521 rs7230380, and rs12965520) were identified for LOAD by both traditional P-values (without correcting for multiple tests) and performance metrics. After correction for multiple tests, no SNPs were identified by traditional P-values. Simultaneous testing of APOE e4 and APOC1 rs4420638 (the SNP with the best performance in the performance metrics) significantly improved the low sensitivity of APOE e4 from 0.50 to 0.78. A point-based genetic model including these 2 SNPs and important covariates was constructed. Compared with elders with low-risks score (0-6), elders belonging to moderate-risk (score = 7-11) and high-risk (score = 12-18) groups showed a significantly increased risk of LOAD (adjusted odds ratio = 7.80 and 46.93, respectively; Ptrend < 0.0001). Performance metrics allow for identification of markers with moderate effect and are useful for creating genetic tests with clinical and public health implications.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27805002 PMCID: PMC5090242 DOI: 10.1038/srep36155
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographics of the study population.
| Variables | Stage 1: Training set (N = 94) | Stage 2: Validation set (N = 619) | ||||
|---|---|---|---|---|---|---|
| LOAD | Control | LOAD | Control | |||
| N = 55 | N = 39 | N = 196 | N = 423 | |||
| Age (years) | 78.3 ± 5.6 | 75.4 ± 4.8 | 79.9 ± 6.5 | 73.1 ± 5.9 | ||
| Years of education | 8.3 ± 5.5 | 11.4 ± 4.4 | 7.9 ± 5.0 | 12.7 ± 4.1 | ||
| BMI (kg/m2) | 23.3 ± 1.7 | 23.4 ± 1.9 | 0.75 | 22.8 ± 3.6 | 24.2 ± 3.1 | |
| Female | 38 (69) | 24 (62) | 0.45 | 121 (62) | 210 (50) | |
| 27 (50) | 4 (11) | 72 (37) | 59 (15) | |||
| Alcohol drinking | 4 (7) | 1 (3) | 0.32 | 27 (14) | 50 (12) | |
| Ever smoking | — | — | — | 56 (29) | 75 (18) | |
| Cardiovascular disease | 11 (20) | 7 (18) | 0.85 | 50 (26) | 137 (33) | 0.08 |
| Stroke | 2 (4) | 1 (3) | 0.77 | 7 (4) | 5 (1) | 0.05 |
| Hypertension | 24 (44) | 18 (46) | 0.81 | 74 (38) | 236 (56) | |
| DM | — | — | — | 48 (24) | 67 (16) | |
| Hypercholesterolemia | — | — | — | 46 (24) | 141 (33) | |
Abbreviations: LOAD, late-onset Alzheimer’s disease; SD, sample standard deviation; BMI, body mass index; , apolipoprotein E; DM, diabetes mellitus.
The P-value was obtained by comparing the distribution of LOAD with the controls.
Numbers in bold indicate significant differences between cases and controls.
Because of restriction and matching in the training set (stage 1), some variables showed empty cells in the table.
Comparison between two APOE SNPs and the top SNPs selected by performance metrics or traditional measures in stage 1 and stage 2.
| Gene | SNP | Performance metrics | Traditional measures | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sensitivity | Specificity | Youden | PPV | NPV | DOR | Accuracy | Net sensitivitya | Net specificitya | AUC (95% CI) | AOR (95% CI) | |||
| Stage 1 (Training set, N = 94) | |||||||||||||
| rs429358 & rs7412 | 0.50 | 0.89 | 0.39 | 0.87 | 0.56 | 8.5 | 0.66 | — | — | 0.835 (0.749–0.922) | 3.0 × 10−4 | 11.68 (3.13–43.53) | |
| rs10501617 | 5.8 × 10−4 | 3.3 (1.6–6.5) | |||||||||||
| rs2362967 | |||||||||||||
| rs4420638 | |||||||||||||
| rs7230380d | |||||||||||||
| rs12965520d | ( | ||||||||||||
| rs2137946 | 0.38 | 0.18 | −0.44 | 0.39 | 0.17 | 0.13 | 0.29 | 0.43 | 0.15 | 0.860 (0.781–0.938) | |||
| rs2137947 | 0.38 | 0.18 | −0.44 | 0.39 | 0.17 | 0.13 | 0.29 | 0.43 | 0.15 | 0.862 (0.785–0.939) | |||
| Stage 2 (Validation set, N = 619) | |||||||||||||
| rs429358 & rs7412 | 0.37 | 0.85 | 0.22 | 0.55 | 0.73 | 3.26 | 0.69 | — | — | 0.879 (0.850–0.909) | <0.0001 | 3.99 (2.36–6.75) | |
| rs10501617 | 0.44 | 0.54 | −0.02 | 0.31 | 0.67 | 0.91 | 0.51 | 0.64 | 0.46 | 0.879 (0.849–0.909) | 6.3 × 10−1 | 0.92 (0.64–1.31) | |
| rs2362967 | 0.30 | 0.03 | 0.33 | 0.71 | 1.16 | 0.44 | 0.25 | 0.879 (0.850–0.909) | 6.4 × 10−1 | 0.93 (0.68–1.28) | |||
| rs4420638 | 0.42 | 0.63 | 1.85 (0.90–3.79) | ||||||||||
| rs7230380 | 0.71 | 0.25 | −0.04 | 0.30 | 0.65 | 0.82 | 0.40 | 0.82 | 0.21 | 0.879 (0.850–0.909) | 2.4 × 10−1 | ||
| rs12965520 | 0.71 | 0.25 | −0.03 | 0.31 | 0.66 | 0.85 | 0.40 | 0.82 | 0.22 | 0.879 (0.850–0.909) | 2.8 × 10−1 | 0.36 (0.06–2.32) | |
Abbreviations: SNP, single nucleotide polymorphism; PPV, positive predictive value; NPV, negative predictive value; DOR, diagnostic odds ratio; AOR, adjusted odds ratio; AUC, area under the receiver operating characteristic curve; CI, confidence interval.
aNet sensitivity and net specificity were obtained for each selected SNP and APOE e4 for simultaneous screening purposes.
bNone of the SNPs reached statistical significance (P < 10−7) after correction for multiple tests using the traditionally measured P-value.
cAPOE e4 status was determined by rs429358 & rs7412; the performance metrics estimation was based on APOE e4 carriers versus non-carriers.
dBecause of ties (P = 1.3 × 10−4 for ZNF521 rs7230380 and rs12965520), 6 SNPs were selected using the traditional P-value.
Except for APOE e4, the performance metrics or traditional measures for the remaining SNPs were estimated based on dominant genetic models.
For SNPs selected based on performance metrics or P-value, numbers in bold indicate the highest value of the corresponding indices or the lowest for P-value.
Italicised numbers indicate the top 5 SNPs selected by performance metrics and the top 6 SNPs (because of ties) selected by traditional P-values (regardless of statistical significance after correction for multiple tests), respectively.
BHLHB2 rs2137946 and rs2137947 were selected using traditional P-value regardless of statistical significance. Because these two SNPs were not identified by performance metrics, they were not genotyped in stage 2.
Numbers in bold indicate the largest value of each performance matrix or traditional measure.
Characteristics of 2 APOE SNPs and the top 5 SNPs selected by performance metrics in stage 2 (N = 619).
| Gene | SNP | Nucleotide change | Chromosome/Location | HapMap or dbSNP MAF | Controls (n = 423) | Cases (n = 196) | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| CEU | CHB | JPT | MAF | HWE | MAF | HWE | ||||
| rs429358 ( | T → C | 19/exon | 0.15* | 0.00 | 0.01 | 0.08 | 0.68 | 0.23 | 0.27 | |
| rs7412 ( | C → T | 19/exon | 0.08* | 0.11 | 0.05 | 0.08 | 0.22 | 0.06 | 0.97 | |
| rs4420638 | G → A | 19/3′UTR | 0.18 | 0.12 | 0.07 | 0.11 | 0.93 | 0.23 | 0.75 | |
| rs10501617 | T → C | 11/intronic | 0.40 | 0.20 | 0.16 | 0.27 | 0.37 | 0.26 | 0.64 | |
| rs7230380 | A → C | 18/intronic | 0.47 | 0.49 | 0.50 | 0.50 | 0.90 | 0.48 | 0.36 | |
| rs12965520 | T → C | 18/intronic | 0.50 | 0.45 | 0.50 | 0.50 | 0.58 | 0.48 | 0.33 | |
| rs2362967 | C → T | 10/intronic | 0.26 | 0.50 | 0.58 | 0.48 | 0.03 | 0.48 | 0.78 | |
Abbreviations: SNP, single nucleotide polymorphism; UTR, untranslated region; CEU, Utah residents with Northern and Western European ancestry from the CEPH collection; JPT, Japanese in Tokyo, Japan; CHB, Han Chinese in Beijing; MAF, minor allele frequency; HWE, Hardy-Weinberg equilibrium test.
*MAF data were obtained from dbSNP because the data were not available from the HapMap Project.
Final genetic model for predicting late-onset Alzheimer’s disease.
| Variables | Coefficient | Point |
|---|---|---|
| Age groups | ||
| 0 | 0 | |
| 2.42 | 7 | |
| Sex | ||
| Men | 0 | 0 |
| Women | 0.37 | 1 |
| Education | ||
| >6 years | 0 | 0 |
| ≤6 years | 2.11 | 6 |
| Non-carriers | 0 | 0 |
| Carriers | 0.91 | 2 |
| Non-carriers | 0 | 0 |
| Carriers | 0.64 | 2 |
| Total range | 0–18 | |
| Mean score ± SD | 6.2 ± 5.1 | |
| AUC (95% CI) | 0.87 (0.83–0.90) | |
| Corrected for optimism | 0.866 | |
Abbreviations: SD, sample standard deviation; AUC, area under the receiver operating characteristic curve; CI, confidence interval.
The correction for optimism was performed by leave-one-out cross-validation (LOOCV) in stage 2 (validation set).
Figure 1Distribution of late-onset Alzheimer’s disease cases and controls with low, moderate, and high risk.