| Literature DB >> 27109359 |
Jonathan D Mosley1, John S Witte2, Emma K Larkin1, Lisa Bastarache3, Christian M Shaffer1, Jason H Karnes1, C Michael Stein1, Elizabeth Phillips1, Scott J Hebbring4, Murray H Brilliant4, John Mayer5, Zhan Ye5, Dan M Roden1, Joshua C Denny1,3.
Abstract
We hypothesized that generalized linear mixed models (GLMMs), which estimate the additive genetic variance underlying phenotype variability, would facilitate rapid characterization of clinical phenotypes from an electronic health record. We evaluated 1,288 phenotypes in 29,349 subjects of European ancestry with single-nucleotide polymorphism (SNP) genotyping on the Illumina Exome Beadchip. We show that genetic liability estimates are primarily driven by SNPs identified by prior genome-wide association studies and SNPs within the human leukocyte antigen (HLA) region. We identify 44 (false discovery rate q<0.05) phenotypes associated with HLA SNP variation and show that hypothyroidism is genetically correlated with Type I diabetes (rG=0.31, s.e. 0.12, P=0.003). We also report novel SNP associations for hypothyroidism near HLA-DQA1/HLA-DQB1 at rs6906021 (combined odds ratio (OR)=1.2 (95% confidence interval (CI): 1.1-1.2), P=9.8 × 10(-11)) and for polymyalgia rheumatica near C6orf10 at rs6910071 (OR=1.5 (95% CI: 1.3-1.6), P=1.3 × 10(-10)). Phenome-wide application of GLMMs identifies phenotypes with important genetic drivers, and focusing on these phenotypes can identify novel genetic associations.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27109359 PMCID: PMC4848547 DOI: 10.1038/ncomms11433
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Genetic liability estimates and P-values for PheWAS phenotypes.
Each point represents the results from a mixed-models analysis using all SNPs with MAF>0.01 on the exome chip, adjusted for age, sex and 20 principal components. PheWAS phenotypes with P-values<10−16 (3 phenotypes) were set to 10−16 for display purposes. Phenotypes with a genetic liability estimate<0 are not shown.
PheWAS phenotypes associated with SNP variation in the HLA region*.
| Ankylosing spondylitis | 0.074 (0.013) | <1.0 × 10−20 | 0.0006 | 1 | 0.012 |
| Type I diabetes | 0.099 (0.014) | <1.0 × 10−20 | 1 | 0.00002 | 0.0033 |
| Rheumatoid arthritis | 0.035 (0.008) | <1.0 × 10−20 | 1 | 0.005 | 0.041 |
| Multiple sclerosis | 0.023 (0.006) | <1.0 × 10−20 | 0.68 | 0.01 | 0.23 |
| Hypothyroidism | 0.009 (0.003) | 5.6 × 10−9 | 0.7 | 0.067 | 0.13 |
| Psoriasis | 0.017 (0.005) | 6.5 × 10−9 | 0.044 | 0.97 | 0.17 |
| Juvenile rheumatoid arthritis | 0.025 (0.008) | 1.1 × 10−8 | 0.21 | 0.026 | 0.42 |
| Primary biliary cirrhosis | 0.017 (0.006) | 6.7 × 10−8 | 0.41 | 0.057 | 0.19 |
| Coeliac disease | 0.008 (0.004) | 3.8 × 10−7 | 1 | 0.062 | 0.31 |
| Macular degeneration | 0.013 (0.005) | 3.2 × 10−5 | 1 | 1 | 0.014 |
| Ulcerative colitis | 0.007 (0.003) | 3.9 × 10−5 | n/a | n/a | n/a |
| Systemic lupus erythematosus | 0.005 (0.003) | 2.7 × 10−4 | 0.89 | 0.59 | 0.22 |
| Premature menopause | 0.006 (0.003) | 1.0 × 10−3 | 0.72 | 0.72 | 0.25 |
| Sicca syndrome | 0.009 (0.004) | 1.8 × 10−6 | 1 | 0.14 | 0.28 |
| Dermatomyositis and polymyositis | 0.008 (0.004) | 1.1 × 10−5 | 0.11 | 0.88 | 0.61 |
| Polymyalgia rheumatica | 0.010 (0.005) | 3.5 × 10−4 | 1 | 0.8 | 0.042 |
| Cholangitis | 0.006 (0.004) | 5.8 × 10−4 | 0.38 | 1 | 0.17 |
| Dermatophytosis of the body | 0.004 (0.003) | 6.8 × 10−4 | 1 | 1 | 0.16 |
*Only phenotypes with an FDR q<0.05 and that mapped to the GWAS Catalog or have unreported associations are shown.
†From a multivariable GLMM analysis that incorporated a HLA and non-HLA GRM and adjusted for age and sex each of the PheWAS phenotypes. P-values correspond to the HLA variance component.
‡Shown are P-values from a multivariable GLMM analysis incorporating GRMs comprising SNPs within the specificied HLA region (see Methods).
||An estimate could not be obtained because the statistical model did not converge.
§A SNP association with a P-value<5 × 10−8 in the GWAS Catalog reported in the specified region.
Figure 2SNP association analysis for hypothyroidism and polymyalgia rheumatica.
All analyses used an additive genetic model adjusted for three principal components, age and sex. (a) Manhattan plot for the hypothyroidism phenotype (3,242 cases, 6,484 controls) and (b) LocusZoom plot highlighting SNP rs6906021. (c) Manhattan plot for polymyalgia rheumatica (413 cases, 5,782 controls) and (d) LocusZoom plot highlighting SNP rs6910071.
Significant SNP associations and replication of HLA-associated SNPs for hypothyroidism.
| rs6679677 | chr1:113761186 | A | 0.12/0.09 | 1.39 | 1.25–1.53 | 2.5 × 10−11 |
| rs2476601 | chr1:113834946 | A | 0.12/0.09 | 1.39 | 1.26–1.53 | 2.1 × 10−11 |
| rs6906021 | chr6:32658534 | C | 0.49/0.44 | 1.19 | 1.12–1.27 | 3.8 × 10−8 |
| rs965513 | chr9:97793827 | A | 0.30/0.35 | 0.78 | 0.73–0.84 | 1.6 × 10−13 |
| rs6906021 | chr6:32658534 | C | 0.49/0.46 | 1.16 | 1.07–1.26 | 2.0 × 10−4 |
| rs6906021 | chr6:32658534 | C | 1.18 | 1.12–1.24 | 9.8 × 10−11 | |
*Coordinates are for genome assembly GRCh38.p2.
†From an additive logistic regression model, adjusting for age, gender and three principal components.
‡From an additive logistic regression model, adjusting for age and gender.
Significant SNP associations and replication of HLA-associated SNPs for polymyalgia rheumatica.
| rs3096702 | chr6:32224554 | T | 0.48/0.39 | 1.52 | 1.31–1.76 | 2.0 × 10−8 |
| rs6910071 | chr6:32315077 | G | 0.30/0.22 | 1.58 | 1.34–1.85 | 2.7 × 10−8 |
| rs6910071 | chr6:32315077 | G | n/a | 1.24 | 1.01–1.52 | 0.04 |
| rs3096702 | chr6:32224554 | T | 0.40/0.36 | 1.20 | 0.84–1.52 | 0.14 |
| rs6910071 | chr6:32315077 | G | 0.26/0.18 | 1.53 | 1.17–2.0 | 0.002 |
| rs6910071 | 1.46 | 1.30–1.63 | 1.3 × 10−10 | |||
*Coordinates are for genome assembly GRCh38.p2.
†From an additive logistic regression model, adjusting for age, gender and three principal components (PCs).
‡Results extracted from the PheWAS Catalog for an additive logistic regression model, adjusting for age, gender and three PCs.
§From an additive logistic regression model, adjusting for age and gender.