| Literature DB >> 24949630 |
Jonathan D Mosley1, Sara L Van Driest2, Peter E Weeke1, Jessica T Delaney1, Quinn S Wells1, Lisa Bastarache3, Dan M Roden1, Josh C Denny4.
Abstract
The coupling of electronic medical records (EMR) with genetic data has created the potential for implementing reverse genetic approaches in humans, whereby the function of a gene is inferred from the shared pattern of morbidity among homozygotes of a genetic variant. We explored the feasibility of this approach to identify phenotypes associated with low frequency variants using Vanderbilt's EMR-based BioVU resource. We analyzed 1,658 low frequency non-synonymous SNPs (nsSNPs) with a minor allele frequency (MAF)<10% collected on 8,546 subjects. For each nsSNP, we identified diagnoses shared by at least 2 minor allele homozygotes and with an association p<0.05. The diagnoses were reviewed by a clinician to ascertain whether they may share a common mechanistic basis. While a number of biologically compelling clinical patterns of association were observed, the frequency of these associations was identical to that observed using genotype-permuted data sets, indicating that the associations were likely due to chance. To refine our analysis associations, we then restricted the analysis to 711 nsSNPs in genes with phenotypes in the On-line Mendelian Inheritance in Man (OMIM) or knock-out mouse phenotype databases. An initial comparison of the EMR diagnoses to the known in vivo functions of the gene identified 25 candidate nsSNPs, 19 of which had significant genotype-phenotype associations when tested using matched controls. Twleve of the 19 nsSNPs associations were confirmed by a detailed record review. Four of 12 nsSNP-phenotype associations were successfully replicated in an independent data set: thrombosis (F5,rs6031), seizures/convulsions (GPR98,rs13157270), macular degeneration (CNGB3,rs3735972), and GI bleeding (HGFAC,rs16844401). These analyses demonstrate the feasibility and challenges of using reverse genetics approaches to identify novel gene-phenotype associations in human subjects using low frequency variants. As increasing amounts of rare variant data are generated from modern genotyping and sequence platforms, model organism data may be an important tool to enable discovery.Entities:
Mesh:
Year: 2014 PMID: 24949630 PMCID: PMC4065041 DOI: 10.1371/journal.pone.0100322
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Overview of the nsSNP selection process.
There was no difference in number of diagnoses significantly associated with the 1,658 nsSNPs when compared to genotype-permuted data. Hence, a nsSNP selection strategy that compared to diagnoses to those reported in either OMIM or the KO Mouse data was used. A multi-step selection and review process identified 12 candidate nsSNPs.
Population characteristics.
| Total Subjects (n) | 8645 |
| No. males (%) | 4079 (47.2) |
| No. females (%) | 4566 (52.8) |
| No. European Ancestry (%) | 6002 (69.4) |
| No. African American (%) | 1734 (20.1) |
| No. other races (%) | 909 (10.5) |
| Mean (std) age of last available diagnosis (years) | 52 (18) |
| Mean (std) duration of EMR follow-up (years) | 7 (5) |
Characteristics of the selected nsSNPs.
| SNP | Gene | Chr | Position | OMIM/KO Mouse phenotype(s) | MAF white/black | SNP function | PolyPhen prediction |
| rs17255978 |
| 7 | 87754915 | Peripheral neuropathy | 0.04/0.10 | missense | benign |
| rs33986943 |
| 17 | 41004637 | Abnormal leukocyte adhesion; decreased lymphocytes in Peyer patches; decreased reduced serum IgA | 0.10/0.02 | missense | benign |
| rs16027 |
| 19 | 13397560 | Migraine, familial hemiplegic | 0.09/0.03 | missense | unknown |
| rs3735972 |
| 8 | 87588198 | Macular degeneration/Achromatopsia | 0.09/0.08 | missense | unknown |
| rs1800067 |
| 16 | 14029033 | Xeroderma Pigmentosum, XFE progeroid syndrome | 0.08/0.01 | missense | probably damaging |
| rs6031 |
| 1 | 169511903 | Factor V deficiency | 0.00/0.08 | missense | unknown |
| rs2291628 |
| 5 | 127609633 | Syn/polydactyly, osteoporosis, abnormal bone remodeling | 0.08/0.08 | missense | benign |
| rs13157270 |
| 5 | 90012379 | Febrile seizures, familial | 0.09/0.02 | missense | benign |
| rs16844401 |
| 4 | 3449652 | Impaired intestinal mucosal healing | 0.07/0.03 | missense | benign |
| rs17537869 |
| 16 | 81922813 | Familial cold auto-inflammatory syndrome | 0.07/0.01 | missense | probably damaging |
| rs5939 |
| 1 | 28476520 | Decreased infection susceptibility, including streptococcus | 0.00/0.08 | missense | unknown |
| rs8192619 |
| 6 | 132966348 | Increased NE/dopamine, abnormal prepulse inhibition | 0.05/0.06 | stop-gained | unknown |
OMIM/KO mouse phenotypes are associated at the gene level, not the specific nsSNP. Minor allele frequencies (MAF) are based on the frequencies observed in this study population. Chromosome and position are from Human Annotation Release 104.
Association statistics for the 12 candidate nsSNPs.
| SNP/Gene | Phenotypes | Total minor allele HZ | Affected minor allele HZ | Total common allele HZ | Affected common allele HZ | OR | 95% CI | p-value |
| rs17255978/ | Peripheral neuropathy | 28 | 9 | 1402 | 143 | 4.2 | (1.9–9.4) | 0.0006 |
| Demyelination disease | 28 | 0 | 1402 | 7 | n/a | |||
| Seizures | 28 | 1 | 1402 | 55 | 0.9 | (0.1–6.8) | 0.92 | |
| rs33986943/ | Gram negative sepsis | 35 | 7 | 1023 | 56 | 4.3 | (1.8–10.3) | 0.001 |
| All sepsis | 35 | 16 | 1023 | 307 | 2.0 | (1.0–3.9) | 0.051 | |
| Gram positive sepsis | 35 | 11 | 1023 | 190 | 2.0 | (1.0–4.2) | 0.06 | |
| Decrease serum IgA | 35 | 0 | 1023 | 4 | n/a | |||
| rs16027/ | Migraine | 54 | 10 | 1512 | 114 | 2.8 | (1.4–5.7) | 0.004 |
| Convulsions | 54 | 10 | 1512 | 128 | 2.5 | (1.2–5.0) | 0.01 | |
| Seizures | 54 | 4 | 1512 | 69 | 1.7 | (0.6–4.8) | 0.33 | |
| rs3735972/ | Cataract | 74 | 18 | 1554 | 162 | 2.6 | (1.5–4.6) | 0.0007 |
| Cataract (age>50) | 43 | 15 | 860 | 124 | 3.2 | (1.7–6.1) | 0.0005 | |
| Macular degeneration | 74 | 4 | 1554 | 19 | 4.4 | (1.5–13.3) | 0.008 | |
| Colorblindness | 74 | 0 | 1554 | 0 | n/a | |||
| Retinopathy (not hypertension or diabetes) | 74 | 7 | 1554 | 76 | 1.9 | (0.9–4.3) | 0.11 | |
| rs1800067/ | Seborrheic keratosis | 42 | 8 | 1512 | 124 | 2.6 | (1.2–5.8) | 0.016 |
| rs6031/ | Pregnancy loss | 13 | 3 | 715 | 23 | 9.0 | (2.3–35.0) | 0.001 |
| On anti-coagulant | 15 | 4 | 825 | 74 | 3.7 | (1.1–11.9) | 0.028 | |
| Stroke | 15 | 4 | 825 | 81 | 3.3 | (1.0–10.7) | 0.04 | |
| Venous thrombosis | 15 | 3 | 825 | 52 | 3.7 | (1.0–13.6) | 0.047 | |
| Budd-Chiari syndrome | 15 | 0 | 825 | 1 | n/a | |||
| rs2291628/ | Avascular necrosis | 62 | 7 | 1488 | 29 | 6.4 | (2.7–15.3) | <.0001 |
| Osteomyelitis | 62 | 8 | 1488 | 76 | 2.8 | (1.3–6.0) | 0.01 | |
| Bone fracture | 62 | 20 | 1488 | 306 | 1.8 | (1.1–3.2) | 0.029 | |
| Pathologic fracture | 62 | 5 | 1488 | 43 | 2.9 | (1.1–7.7) | 0.027 | |
| Osteoporosis | 62 | 7 | 1488 | 206 | 0.8 | (0.4–1.8) | 0.56 | |
| Joint disease | 62 | 27 | 1488 | 663 | 1.0 | (0.6–1.6) | 0.87 | |
| Polydactyly | 62 | 0 | 1488 | 5 | n/a | |||
| rs13157270/ | Epilepsy | 48 | 7 | 1488 | 77 | 3.1 | (1.4–7.2) | 0.007 |
| Febrile seizure | 48 | 0 | 1488 | 4 | ||||
| Convulsions | 48 | 7 | 1488 | 140 | 1.6 | (0.7–3.7) | 0.23 | |
| rs16844401/ | GI bleed | 32 | 6 | 1503 | 68 | 4.9 | (1.9–12.2) | 0.0007 |
| GI infections (bacterial) | 32 | 1 | 1503 | 109 | 0.4 | (0.1–3.1) | 0.38 | |
| rs17537869/ | Extrinsic asthma | 11 | 2 | 1279 | 18 | 15.6 | (3.1–77.2) | 0.0008 |
| Humoral immunity/Decreased IgA,IgM | 11 | 1 | 1279 | 0 | 9.7 | (1.2–81.7) | 0.03 | |
| Allergic reactions | 11 | 5 | 1279 | 251 | 3.4 | (1.0–11.3) | 0.04 | |
| Allergic rhinitis | 11 | 2 | 1279 | 182 | 1.3 | (0.3–6.3) | 0.70 | |
| Cold induced urticaria | 11 | 0 | 1279 | 1 | n/a | |||
| rs5939/ | Bacterial meningitis | 17 | 3 | 964 | 7 | 29.3 | (6.9–125.1) | <.0001 |
| Acute upper respiratory infection | 17 | 11 | 964 | 199 | 7.0 | (2.6–19.3) | 0.0001 | |
| Chronic sinusitis | 17 | 6 | 964 | 103 | 4.6 | (1.7–12.6) | 0.003 | |
| Sinusitis | 17 | 8 | 964 | 188 | 3.7 | (1.4–9.6) | 0.008 | |
| Acute sinusitis | 17 | 6 | 964 | 137 | 3.3 | (1.2–9.1) | 0.02 | |
| All sepsis | 17 | 3 | 964 | 84 | 2.2 | (0.6–8.0) | 0.21 | |
| Gram positive sepsis | 17 | 2 | 964 | 53 | 2.3 | (0.5–10.3) | 0.27 | |
| Strep infections | 17 | 1 | 964 | 41 | 1.4 | (0.2–10.9) | 0.74 | |
| Gram negative sepsis | 17 | 0 | 964 | 12 | ||||
| rs8192619/ | Anxiety | 17 | 7 | 725 | 132 | 4.6 | (1.7–12.5) | 0.002 |
| Depression | 17 | 9 | 725 | 173 | 3.6 | (1.4–9.4) | 0.009 | |
| Schizophrenia | 17 | 0 | 725 | 12 | n/a |
For each nsSNP, clinical phenotypes were constructed using diagnosis codes that closely approximated the phenotype descriptions in the OMIM and KO mouse databases. Shown are the subject counts and results of exact logistic regression analyses comparing minor allele homozygotes to matched common allele homozygotes. The common allele homozygotes were matched for age, race, gender and data set.
Replication analyses.
| European Americans | African Americans | |||||||||
| SNP/Gene | Phenotypes | Cases/Controls | OR | 95% CI | p-value | Cases/Controls | OR | 95% CI | p-value | |
| rs17255978/ | Peripheral neuropathy | 2284/17315 | 1.0 | (0.9–1.2) | 0.83 | 264/1729 | 1.2 | (0.9–1.6) | 0.33 | |
|
| ||||||||||
| rs33986943/ | Gram negative sepsis | — | — | — | — | — | — | — | — | |
|
| All sepsis | 12038/7561 | 1.0 | (0.9–1.1) | 0.62 | 1456/537 | 1.0 | (0.6–1.6) | 0.89 | |
| rs3735972/ | Cataract | 2879/16720 | 1.1 | (1.0–1.2) |
| 418/1575 | 0.9 | (0.6–1.2) | 0.34 | |
|
| Macular degeneration | 809/18790 | 1.2 | (1.0–1.4) |
| 56/1937 | 1.1 | (0.5–2.2) | 0.84 | |
| rs1800067/ | Seborrheic keratosis | 2952/16647 | 1.0 | (0.9–1.2) | 0.5 | — | — | — | — | |
|
| ||||||||||
| rs6031/ | Pregnancy loss | — | — | — | — | — | — | — | — | |
|
| On anti-coagulant | — | — | — | — | — | — | — | — | |
| Stroke | — | — | — | — | 255/1738 | 1.4 | (1.0–1.9) |
| ||
| Venous thrombosis | — | — | — | — | 181/1812 | 0.8 | (0.5–1.2) | 0.32 | ||
| rs2291628/ | Avascular necrosis | 147/19452 | 1.0 | (0.6–1.5) | 0.82 | — | — | — | — | |
|
| Osteomyelitis | 309/19290 | 1.0 | (0.8–1.4) | 0.75 | 60/1933 | 1.2 | (0.6–2.3) | 0.61 | |
| Bone fracture | 2706/16893 | 1.0 | (0.9–1.1) | 0.63 | 294/1699 | 0.8 | (0.6–1.2) | 0.29 | ||
| Pathologic fracture | 525/19074 | 1.1 | (0.8–1.3) | 0.64 | — | — | — | — | ||
| rs13157270/ | Epilepsy | 659/18940 | 1.0 | (0.8–1.2) | 0.91 | 83/1910 | 2.0 | (1.0–4.2) | 0.06 | |
|
| Convulsions | 1215/18384 | 1.0 | (0.8–1.1) | 0.53 | 164/1829 | 1.9 | (1.1–3.3) |
| |
| rs16844401/ | GI bleed | 1630/17969 | 1.2 | (1.0–1.4) |
| 251/1742 | 1.0 | (0.5–1.9) | 0.97 | |
|
| ||||||||||
| rs17537869/ | Extrinsic asthma | 360/19239 | 1.2 | (0.9–1.6) | 0.12 | 73/1920 | 0.0 | — | 0.99 | |
|
| Humoral immunity/Decreased IgA,IgM | 75/19524 | 1.1 | (0.6–2.0) | 0.81 | — | — | — | — | |
| Allergic reactions | 3720/15879 | 0.9 | (0.8–1.0) | 0.11 | 355/1638 | 1.2 | (0.5–2.8) | 0.68 | ||
| rs5939/ | Bacterial meningitis | — | — | — | — | — | — | — | — | |
|
| Acute upper respiratory infection | — | — | — | — | 466/1527 | 1.2 | (0.9–1.6) | 0.15 | |
| Chronic sinusitis | — | — | — | — | — | — | — | — | ||
| Sinusitis | — | — | — | — | — | — | — | — | ||
| Acute sinusitis | — | — | — | — | — | — | — | — | ||
Replication analyses for nsSNP-phenotype associations using an additive logistic regression model adjusting for age, gender and principal components. A (—) indicates that less than 50 cases (i.e., individuals with the given phenotype) were available for analyses.