| Literature DB >> 28930629 |
Yan Zhang1, Qiuming He1, Ruizhong Zhang1, Hong Zhang1, Wei Zhong1, Huimin Xia1.
Abstract
Hischsprung disease (HSCR) is an intestinal disorder with strong genetic components. RET was considered as the strongest contributor. Multiple single nucleotide polymorphisms (SNP) were demonstrated as associated with HSCR in different populations. However, whether the associations of reported SNPs derived from one causal variants or congregations of multiple variants were still not clear. In this study, we successfully genotyped 16 SNPs in RET with a largest case-control study to date, totaling 1470 HSCR and 1473 control subjects in South Chinese population. Multiple independent contributors were identified through pairwise and stepwise logistic regression. The intragenic synergistic effect among these SNPs were further explored and cross validated by logistic regression and multifactor dimensionality reduction (MDR). Noteworthy, in further subclinical manifestation analysis, the six potential independent contributors in RET were more essential for the patients with short-segment aganglionosis (S-HSCR). Although functional evaluations are required, our comprehensive analysis for RET gene integrating detailed disease subphenotypes might facilitate improved understanding for the genetic understanding of HSCR etiology.Entities:
Keywords: Hirschsprung disease; association; epistasis; subclinical stratification
Mesh:
Substances:
Year: 2017 PMID: 28930629 PMCID: PMC5636671 DOI: 10.18632/aging.101294
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Replication results on sixteen selected SNPs in RET in South Chinese population using 1470 cases and 1473 controls
| CHR | SNP | BP | Func.refgene | Gene.refgene | A1/A2 | F_A | F_U | P | P_adj | OR |
|---|---|---|---|---|---|---|---|---|---|---|
| 10 | rs2506030 | 42952399 | intergenic | G/A | 0.80 | 0.73 | 3.2E-09 | 1.9E-07 | 1.41(1.24~1.61) | |
| 10 | rs10900297 | 43077063 | upstream | C/A | 0.82 | 0.76 | 2.0E-03 | 2.8E-03 | 1.44(1.13~1.82) | |
| 10 | rs2506011 | 43079488 | intronic | T/C | 0.81 | 0.66 | 1.1E-39 | 5.3E-32 | 2.19(1.92~2.49) | |
| 10 | rs2435357 | 43086608 | intronic | T/C | 0.71 | 0.44 | 9.5E-94 | 1.6E-65 | 2.80(2.49~3.15) | |
| 10 | rs2435356 | 43087702 | intronic | A/G | 0.70 | 0.44 | 2.6E-88 | 3.7E-62 | 2.71(2.41~3.05) | |
| 10 | rs2505532 | 43099097 | intronic | C/T | 0.75 | 0.56 | 1.2E-50 | 2.9E-38 | 2.21(1.96~2.49) | |
| 10 | rs2565206 | 43100333 | intronic | G/T | 0.93 | 0.88 | 1.1E-13 | 1.9E-11 | 1.98(1.63~2.42) | |
| 10 | rs1800858 | 43100520 | exonic | A/G | 0.69 | 0.43 | 1.2E-85 | 5.8E-60 | 2.62(2.34~2.95) | |
| 10 | rs1800860 | 43111239 | exonic | G/A | 0.84 | 0.78 | 6.3E-09 | 2.3E-08 | 1.51(1.30~1.74) | |
| 10 | rs2742234 | 43117161 | intronic | C/T | 0.69 | 0.47 | 4.7E-61 | 6.9E-46 | 2.35(2.09~2.64) | |
| 10 | rs1800861 | 43118395 | exonic | G/T | 0.69 | 0.47 | 5.2E-64 | 7.2E-48 | 2.36(2.10~2.65) | |
| 10 | rs17158558 | 43124887 | exonic | C/T | 0.98 | 0.98 | 0.09 | 0.35 | 1.22(0.81~1.86) | |
| 10 | rs2742236 | 43125103 | intronic | G/A | 0.81 | 0.68 | 5.8E-30 | 3.0E-25 | 2.00(1.76~2.28) | |
| 10 | rs7893332 | 43157312 | intronic | T/G | 0.92 | 0.88 | 5.5E-07 | 1.9E-06 | 1.57(1.30~1.89) | |
| 10 | rs1254958 | 43206695 | intronic | T/C | 0.69 | 0.47 | 1.2E-65 | 2.6E-48 | 2.36(2.10~2.65) | |
| 10 | rs2505526 | 43274443 | intergenic | G/A | 0.66 | 0.51 | 1.1E-33 | 6.8E-28 | 1.90(1.69~2.13) |
CHR: Chromosome; SNP: Single Nucleotide Polymorphism; BP: Base pair of where the SNP is located. Func.refgene: The function role of SNP in the gene. Gene.refgene: The gene where the SNP located to; A1/A2 indicates the risk allele and protective allele to disease; F_A/F_U indicates risk allele frequency of the SNP in cases or controls. The P value indicates the significance based on allelic association tests. P_adj indicates the significance based on the logistic regression tests adjusting the potential difference of age and gender for each sample. The calculation of odds ratio (OR) is also based on the risk allele of each SNP.
Independence test by adjusting for the effects of other SNPs in the RET region
| SNP | SNP whose effect was adjusted for * | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| rs2506030 | rs2506011 | rs2435357 | rs2505532 | rs2565206 | rs1800860 | rs2742234 | rs2742236 | rs7893332 | rs2505526 | |
| 2.1E-13 | 4.4E-02 | 2.1E-05 | 4.1E-06 | 1.6E-07 | 7.0E-04 | 1.3E-08 | 4.3E-09 | 5.9E-06 | ||
| rs2506011 | 5.0E-39 | 3.2E-02 | 1.9E-40 | 7.7E-30 | 2.3E-04 | 6.0E-17 | 1.1E-30 | 2.2E-14 | ||
| 3.4E-69 | 5.0E-41 | 4.7E-31 | 3.0E-64 | 3.2E-67 | 2.5E-21 | 4.7E-51 | 2.0E-68 | 1.2E-46 | ||
| rs2505532 | 1.3E-41 | 1.3E-12 | 1.6E-53 | 1.3E-40 | 2.9E-05 | 1.6E-27 | 1.2E-39 | 4.7E-20 | ||
| rs2565206 | 4.4E-10 | 7.7E-18 | 4.5E-23 | 3.6E-06 | 3.9E-03 | 4.0E-04 | 3.0E-13 | 1.0E-08 | ||
| rs1800860 | 3.2E-07 | 1.3E-03 | 4.7E-03 | 2.9E-04 | 1.6E-02 | 1.6E-05 | 4.8E-05 | 3.1E-02 | ||
| rs2742234 | 1.3E-47 | 3.8E-22 | 6.8E-14 | 8.3E-45 | 7.6E-45 | 2.3E-28 | 1.8E-48 | 4.5E-24 | ||
| rs2742236 | 1.7E-27 | 1.2E-07 | 7.0E-09 | 1.7E-20 | 2.3E-25 | 1.4E-33 | 1.4E-09 | |||
| rs7893332 | 4.1E-07 | 2.3E-07 | 3.7E-02 | 7.7E-12 | 4.7E-01 | |||||
| rs2505526 | 5.0E-28 | 3.4E-11 | 3.2E-05 | 2.3E-28 | 3.6E-25 | 1.3E-13 | 1.1E-26 | |||
* The data in each column represent the remaining effect of association (P-values) after adjusting for the effect of SNP(s) on the top row of each column
Significant results from stepwise logistic regression of the six SNPs in the RET region
| SNPs* | OR (CI 0.95) | |
|---|---|---|
| rs2435357_T | 2.5E-10 | 3.31(2.29~4.81) |
| rs2505532_C | 0.011 | 1.49(1.09~2.03) |
| rs1800860_G | 0.024 | 1.33(1.04~1.71) |
| rs2742234_C | 0.008 | 1.55(1.12~2.14) |
| rs2742236_G | 0.055 | 1.39(0.99~1.95) |
SNPs significantly improved the disease association model was shown
Association of the haplotypes derived from six independent SNPs in RET identified in current study
| rs2506030 | rs2435357 | rs2505532 | rs1800860 | rs2742234 | rs2742236 | F_A | F_U | OR | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Risk haplotype* | 2.12E-75 | |||||||||
| G | T | C | G | C | G | 0.56 | 0.35 | 5.18E-54 | 2.37(2.05~2.76) | |
| A | T | C | G | C | G | 0.1 | 0.08 | 7.77E-03 | 1.29(1.00~1.67) | |
| Protective haplotype* | ||||||||||
| G | C | T | G | T | A | 0.1 | 0.19 | 9.35E-22 | 0.46(0.37~0.57) | |
| A | C | T | G | T | G | 0.04 | 0.07 | 7.39E-10 | 0.46(0.33~0.65) | |
| G | C | T | A | T | G | 0.05 | 0.07 | 5.40E-05 | 0.62(0.45~0.85) | |
| G | C | C | A | T | A | 0.02 | 0.05 | 5.65E-06 | 0.50(0.33~0.75) | |
* Haplotypes with minor haplotype frequency in controls larger than 0.05 were shown.
F_A/F_U indicates risk haplotype frequency of the SNP in cases or controls. The P value indicates the significance based on haplotypic association tests.
P_OMNIBUS indicates the omnibus association significance across all the risk haplotypes and protective haplotypes.
Pair-wise epistatic interacting results among six independent variants in RET done by logistic regression and Multifactor dimensionality reduction (MDR)
| rs2506030 | rs2435357 | rs2505532 | rs1800860 | rs2742234 | rs2742236 | ||
|---|---|---|---|---|---|---|---|
| Logistic Regression | |||||||
| rs2506030 | Multiple Dimension Reduction | NA | OR=1.36(1.14 ~1.63) | OR=1.41(1.17~1.69) | P=0.32 | OR=1.23(1.03~1.47) | OR=1.32(1.07~1.62) |
| rs2435357 | CVC=10/10 BA=0.68 | NA | OR=1.71(1.44~2.02) | P=0.2 | OR=1.73(1.47~2.03) | OR=1.50(1.24~1.80) | |
| rs2505532 | CVC=10/10 BA=0.64 | CVC=10/10 BA=0.69 | NA | OR=1.64(1.35~2.01) | OR=1.97(1.66~2.34) | OR=1.87(1.56~2.23) | |
| rs1800860 | CVC=10/10 BA=0.57 | CVC=10/10 BA=0.69 | CVC=10/10 BA=0.65 | NA | P=0.16 | OR=1.28(1.03~1.59) | |
| rs2742234 | CVC=10/10 BA=0.66 | CVC=10/10 BA=0.69 | CVC=10/10 BA=0.67 | CVC=10/10 BA=0.66 | NA | OR=1.83(1.49~2.25) | |
| rs2742236 | CVC=10/10 BA=0.61 | CVC=10/10 BA=0.68 | CVC=10/10 BA=0.65 | CVC=10/10 BA=0.61 | CVC=10/10 BA=0.67 | NA | |
OR means odds ratio for interaction, and a value of 1.0 indicates no effect. Cross-validation consistency (CVC) reflects the number of times MDR analysis identified the same model as the data were divided into different segments. Balanced accuracy is defined as (sensitivity + specificity)/2.
The association results of six independent SNPs in RET to different subclinical features classified by aganglionosis length including short-length(S-HSCR), long-length (L-HSCR) and TCA
| SNP | S-HSCR | L-HSCR | TCA | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1033 cases | 294 cases | 82 cases | |||||||||
| A1/A2 | F_A | OR | F_A | OR | F_A | OR | |||||
| rs2506030 | G/A | 6.40E-03 | 0.81 | 1.70E-08 | 1.52 | 0.8 | 1.40E-02 | 1.33 | 0.71 | 0.42 | 0.86 |
| rs2435357 | T/C | 4.20E-05 | 0.74 | 5.10E-64 | 3.16 | 0.68 | 7.00E-19 | 2.55 | 0.59 | 4.10E-04 | 1.87 |
| rs2505532 | C/T | 5.30E-03 | 0.77 | 8.10E-37 | 2.38 | 0.72 | 1.20E-10 | 2.01 | 0.68 | 5.70E-03 | 1.67 |
| rs1800860 | G/A | 6.90E-02 | 0.85 | 9.70E-09 | 1.59 | 0.83 | 5.10E-03 | 1.43 | 0.8 | 0.7 | 1.09 |
| rs2742234 | C/T | 2.40E-03 | 0.71 | 2.60E-45 | 2.58 | 0.66 | 5.10E-12 | 2.03 | 0.62 | 1.40E-03 | 1.76 |
| rs2742236 | G/A | 1.40E-03 | 0.83 | 6.70E-27 | 2.27 | 0.77 | 8.50E-05 | 1.56 | 0.77 | 1.80E-02 | 1.62 |
P_Quantitative: The patient-only linear regression test among three subclinical groups including S-HSCR, L-HSCR and TCA; F_A indicates risk allele frequency of the SNP in each subclinical group.