| Literature DB >> 28774272 |
Joe M Butler1, Neil Hall2, Niro Narendran3, Yit C Yang3, Luminita Paraoan4.
Abstract
BACKGROUND: Human polymorphisms with derived alleles that are protective against disease may provide powerful translational opportunities. Here we report a method to identify such candidate polymorphisms and apply it to common non-synonymous SNPs (nsSNPs) associated with common diseases. Our study also sought to establish which of the identified protective nsSNPs show evidence of positive selection, taking this as indirect evidence that the protective variant has a beneficial effect on phenotype. Further, we performed an analysis to quantify the predicted effect of each protective variant on protein function/structure.Entities:
Keywords: Age-related macular degeneration; Ancestral allele; Candidate gene; Common variants; Gain of function; Gene selection; Inflammatory bowel disease; Missense mutation; Multiple sclerosis and rheumatoid arthritis; Non-synonymous SNP; Protective variants; Type 2 diabetes
Mesh:
Substances:
Year: 2017 PMID: 28774272 PMCID: PMC5543444 DOI: 10.1186/s12864-017-3964-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Non-synonymous SNPs associated with AMD validated by meta-analysis
| Position | Locus | nsSNP | Derived allele | Meta-analysis validation | |
|---|---|---|---|---|---|
| chr1: 196,659,237 |
| rs1061170:T > C | Y402H | risk | [ |
| chr1: 196,642,233 | rs800292:A > G | I62V | risk | [ | |
| chr6: 31,903,804 |
| rs9332739:G > C | E318D | protective | [ |
| chr6: 31,914,180 |
| rs641153:C > T | R32Q | protective | [ |
| chr10: 124,214,448 |
| rs10490924:G > T | A69S | risk | [ |
| chr19: 6,718,387 |
| rs2230199:C > G | R102G | risk | [ |
| chr19: 45,411,941 |
| rs429358:T > C | C112R | risk | [ |
| chr19: 45,412,079 | rs7412:C > T | R158C | risk | [ | |
Fig. 1Integrated haplotype score analysis of the SNP sets corresponding to two AMD candidate protective nsSNPs in C2 and CFB genes and one nsSNP protective against hypertension in LPL. High LD SNP sets, represented as genetic trees, for (a) CFB R32Q, (b) C2 E318D and (d) LPL S447X, where SNPs are numbered on the branches. The numbers at the bottom of each branch represent the number of haplotypes it is comprised of. SNPs on genetic trees are aligned against UCSC genome browser (c) and (e), displaying RefSeq genes in the respective regions (nsSNPs are designated with an asterisk *). The integrated haplotype score (iHS) for each of these SNPs is calculated, represented by the distributions shown in (f) for each nsSNP. Mean iHS is represented by a black circle (●), median by a thick black line and the box represents the central 50% of the data. An outlier is represented by white circle (∘)
Candidate protective nsSNPs
| Disease | Gene | rsID | Chr | Position | Anc | Der | ORb | SNPs in high LD set | Mean iHS | GWAS | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Multiple sclerosis | IL7R | rs6897932 | 5 | 35,874,575 | C | T | 1.12 | 6 | −1.462 | [ |
| 2 | Glaucoma (primary open-angle) | COL11A1 | rs3753841 | 1 | 103,379,918 | G | A | 1.20 | 22 | −1.270 | [ |
| 3 | Type 2 diabetes | SLC30A8 | rs13266634 | 8 | 118,184,783 | C | T | 1.16 | 5 | −1.218 | [ |
| 4 | Psoriasis | TYK2 | rs12720356 | 19 | 10,469,975 | A | C | 1.40 | 1 | −1.197 | [ |
| 5 | Acute lymphoblastic leukemia (childhood) | KCNE4 | rs12621643 | 2 | 223,917,983 | T | G | 1.48 | 3 | −1.167 | [ |
| 6 | Ulcerative colitis | IL17REL | rs5771069 | 22 | 50,435,480 | G | A | 1.11 | 10 | −1.121 | [ |
| 7 | Esophageal cancer and gastric cancer | PLCE1 | rs3765524 | 10 | 96,058,298 | T | C | 1.35 | 23 | −0.897 | [ |
| 8 | IgA nephropathy | TNFSF13 | rs3803800 | 17 | 7,462,969 | A | G | 1.21 | 23 | −0.819 | [ |
| 9 | Systemic lupus erythematosus | WDFY4 | rs7097397 | 10 | 50,025,396 | G | A | 1.30 | 18 | −0.767 | [ |
| 10 | Esophageal cancer | PLCE1 | rs2274223 | 10 | 96,066,341 | G | A | 1.34 | 38 | −0.740 | [ |
| 11 | AMD | CFB | rs641153 | 6 | 31,914,180 | G | A | 2.44 | 13 | −0.723 | - |
| 12 | Inflammatory bowel disease | CD6 | rs11230563 | 11 | 60,776,209 | C | T | 1.09 | 21 | −0.705 | [ |
| 13 | Behcet’s disease | KLRC4 | rs2617170 | 12 | 10,560,957 | T | C | 1.28 | 46 | −0.657 | [ |
| 14 | Coronary heart disease | ZC3HC1 | rs11556924 | 7 | 129,663,496 | C | T | 1.09 | 6 | −0.649 | [ |
| 15 | Type 1 diabetes | TYK2 | rs2304256 | 19 | 10,475,652 | C | A | 1.16 | 7 | −0.627 | [ |
| 16 | Obesity | GIPR | rs1800437 | 19 | 46,181,392 | G | C | 1.1 | 10 | −0.606 | [ |
| 17 | AMD | C2 | rs9332739 | 6 | 31,903,804 | G | C | 1.82 | 9 | −0.551 | - |
| 18 | Type 2 diabetes | PPARG | rs1801282 | 3 | 12,393,125 | C | G | 1.14 | 11 | −0.537 | [ |
| 19 | Inflammatory bowel disease | TUBD1 | rs1292053 | 17 | 57,963,537 | G | A | 1.08 | 41 | −0.498 | [ |
| 20 | Migraine | MMP17 | rs6598163 | 12 | 132,325,239 | G | A | 1.15 | 22 | −0.493 | [ |
| 21 | Glaucoma (exfoliation) | LOXL1 | rs3825942 | 15 | 74,219,582 | G | A | 20.1 | 26 | −0.453 | [ |
| 22 | Inflammatory bowel disease | IL23R | rs11209026 | 1 | 67,705,958 | G | A | 2.01 | 2 | −0.410 | [ |
| 23 | Multiple sclerosis | MPV17L2 | rs874628 | 19 | 18,304,700 | A | G | 1.11 | 8 | −0.292 | [ |
| 24 | Rheumatoid arthritis | RTKN2 | rs3125734 | 10 | 63,958,112 | T | C | 1.20 | 12 | −0.263 | [ |
| 25 | Interstitial lung disease | LRRC34 | rs6793295 | 3 | 169,518,455 | C | T | 1.30 | 19 | −0.228 | [ |
| 26 | Obesity | SH2B1 | rs7498665 | 16 | 28,883,241 | G | A | 1.07 | 27 | −0.210 | [ |
| 27 | Breast cancer | ANKLE1 | rs8100241 | 19 | 17,392,894 | G | A | 1.14 | 11 | −0.185 | [ |
| 28 | Systemic lupus erythematosus | BANK1 | rs10516487 | 4 | 102,751,076 | G | A | 1.38 | 8 | −0.184 | [ |
| 29 | Ovarian cancer | ANKLE1 | rs2363956 | 19 | 17,394,124 | T | G | 1.10 | 13 | −0.153 | [ |
| 30 | Rheumatoid arthritis | NFKBIE | rs2233434 | 6 | 44,232,920 | G | A | 1.20 | 32 | −0.114 | [ |
|
| Hypertension | LPL | rs328 | 8 | 19,819,724 | C | G | - | 28 | −0.031 |
|
| 31 | Type 2 diabetes | WFS1 | rs1801214 | 4 | 6,303,022 | T | C | 1.13 | 8 | 0.314 | [ |
| 32 | Type 2 diabetes | THADA | rs7578597 | 2 | 43,732,823 | T | C | 1.15 | 49 | 0.379 | [ |
| 33 | Psoriasis | IL13 | rs20541 | 5 | 131,995,964 | G | A | 1.27 | 8 | 0.872 | [ |
| 34 | Prostate cancer | MLPH | rs2292884 | 2 | 238,443,226 | G | A | 1.14 | 40 | 0.883 | [ |
adenotes the nsSNP has been detected by more than one GWAS in the catalog, we only report the first chronologically
bOR for ancestral allele
Fig. 2Integrated haplotype score distributions of the SNP sets corresponding to the 34 candidate protective nsSNPs. High LD SNP sets are computed for each candidate protective nsSNP, from which a distribution of the integrated haplotype scores is calculated and presented as boxplots. The distribution for LPL S447X is shown in green and those for the two AMD nsSNPs in magenta and cyan. The number identifying the nsSNP on the y-axis corresponds to the full description in Table 2
Fig. 3a Probabilities of the candidate protective variants altering their respective protein function. These probabilities are the Naïve Bayes posterior probability that the missense mutation is damaging based on the PolyPhen-2 HumDiv model. b Positive correlation between PolyPhen-2 and CADD scores