| Literature DB >> 26255974 |
Joseph M Simonett1, Mahsa A Sohrab1, Jennifer Pacheco2, Loren L Armstrong3, Margarita Rzhetskaya3, Maureen Smith2, M Geoffrey Hayes4, Amani A Fawzi1.
Abstract
Age-related macular degeneration (AMD), a multifactorial, neurodegenerative disease, is a leading cause of vision loss. With the rapid advancement of DNA sequencing technologies, many AMD-associated genetic polymorphisms have been identified. Currently, the most time consuming steps of these studies are patient recruitment and phenotyping. In this study, we describe the development of an automated algorithm to identify neovascular (wet) AMD, non-neovascular (dry) AMD and control subjects using electronic medical record (EMR)-based criteria. Positive predictive value (91.7%) and negative predictive value (97.5%) were calculated using expert chart review as the gold standard to assess algorithm performance. We applied the algorithm to an EMR-linked DNA bio-repository to study previously identified AMD-associated single nucleotide polymorphisms (SNPs), using case/control status determined by the algorithm. Risk alleles of three SNPs, rs1061170 (CFH), rs1410996 (CFH), and rs10490924 (ARMS2) were found to be significantly associated with the AMD case/control status as defined by the algorithm. With the rapid growth of EMR-linked DNA biorepositories, patient selection algorithms can greatly increase the efficiency of genetic association study. We have found that stepwise validation of such an algorithm can result in reliable cohort selection and, when coupled within an EMR-linked DNA biorepository, replicates previously published AMD-associated SNPs.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26255974 PMCID: PMC4530462 DOI: 10.1038/srep12875
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1High-throughput clinical phenotyping algorithm outline.
Final HTCP algorithm applied to EMR-linked DNA biorepository. Red criteria were added after first round of case selection/expert chart review. ICD-9: International Classification of Disease-9, CPT: current procedural terminology.
Final Algorithm Performance Metrics.
| Classification | PPV | NPV | FNR |
|---|---|---|---|
| Overall AMD | 91.7% | 97.5% | 1.8% |
| Dry AMD | 73.3% | 95.7% | 12.0% |
| Wet AMD | 86.7% | 92.9% | 16.1% |
Final algorithm performance metrics for classifying cases as AMD, regardless of dry vs wet status, and for determining dry vs wet status of identified AMD cases. PPV: Positive predictive value, NPV: negative predictive value, FNR: false negative rate.
Demographic Characteristics of HTCP Defined AMD cases and controls.
| Phenotype | AMD identified cases | Control identified cases | |
|---|---|---|---|
| Age at last eye exam | 78.3 | 69.3 | |
| Female sex | 70.5% | 78.4% | 0.15 |
| History of smoking | 52.5% | 52.1% | 0.95 |
| BMI | 26.3 | 27.9 | 0.07 |
| Type 2 DM | 29.5% | 34.1% | 0.62 |
| Glaucoma | 42.6% | 35.5% | 0.36 |
| Cataracts | 86.9% | 68.9% |
BMI: Body mass index, DM: Diabetes mellitus. Bolded P values are statistically significant (P < 0.05).
Pooled Imputed and Directly Genotyped Association Results Between Previously Identified AMD Risk Alleles and HTCP Defined AMD case/control status.
| SNP | Near by gene | Previously reported risk allele | RAF in AMD Cases | RAF in controls | OR (CI) | Previously reported OR (CI) | Previously reported RAF | Published source | |
|---|---|---|---|---|---|---|---|---|---|
| rs1061170 | C | 0.580 | 0.363 | 2.43 (1.55–3.79) | 1.86 (1.77–1.97) | 0.49 | Sofat | ||
| rs10490924 | T | 0.321 | 0.201 | 1.88 (1.15–3.07) | 2.76 (2.72–2.80) | 0.30 | Fritsche | ||
| rs1410996 | C | 0.732 | 0.584 | 1.94 (1.20–3.14) | 1.98 (1.44–2.72) | 0.60 | Mori | ||
| rs11200638 | A | 0.304 | 0.205 | 1.69 (1.03–2.78) | 9.6E-03 | 1.80 (1.34–2.39) | 0.31 | Hadley | |
| rs2230199 | C | 0.607 | 0.448 | 1.90 (1.22–2.97) | 0.033 | 1.42 (1.37–1.47) | 0.20 | Fritsche | |
| rs8017304 | A | 0.670 | 0.581 | 1.46 (0.92–2.31) | 0.13 | 1.11 (1.08–1.14) | 0.61 | Fritsche | |
| rs833069 | G | 0.286 | 0.377 | 0.66 (0.41–1.06) | 0.04 | 1.69 (1.26–2.26) | 0.26 | Galan |
SNP: Single nucleotide polymorphism, RAF: Risk allele frequency, OR (CI): Odds ratio (95% Confidence Interval). Bolded P values are statistically significant after Bonferroni correction (P < 4.5 × 10−3).