| Literature DB >> 28512340 |
Hsin-Chou Yang1, I-Chen Chen1,2, Yuh-Chyuan Tsay1, Zheng-Rong Li1, Chun-Houh Chen1, Hai-Gwo Hwu3,4, Chen-Hsin Chen5,6.
Abstract
Case-control genetic association studies typically ignore possible later disease onset in currently healthy subjects and assume that subjects with diseases equally contribute to the likelihood for inference, regardless of their onset age. Therefore, we used an event-history with risk-free model to simultaneously characterize alcoholism susceptibility and onset age in 65 independent non-Hispanic Caucasian males in the Collaborative Study on the Genetics of Alcoholism. Following data quality control, we analysed 22 single nucleotide polymorphisms (SNPs) on 12 candidate genes. The single-SNP analysis showed that the dominant minor allele of rs2134655 on DRD3 increases alcoholism susceptibility; the dominant minor allele of rs1439047 on NTRK2 delays the alcoholism onset age, but the additive minor allele of rs172677 on GRIN2B and the dominant minor allele of rs63319 on ALDH1A1 advance the alcoholism onset age; and the dominant minor allele of rs1079597 on DRD2 shortens the onset age range. Similarly, multiple-SNPs analysis revealed joint effects of rs2134655, rs172677 and rs1079597, with an adjustment for habitual smoking. This study provides a more comprehensive understanding of the genetics of alcoholism than previous case-control studies.Entities:
Mesh:
Year: 2017 PMID: 28512340 PMCID: PMC5434012 DOI: 10.1038/s41598-017-01791-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Kaplan–Meier (step function) and mixture regression (solid smooth curve) estimators of overall and conditional event curves for alcoholism onset age stratified by genotypes of a single SNP. (a) SNPs with an equal probability of susceptibility only: rs172677 and rs1439047, (b) SNPs with unequal probabilities of susceptibility only: rs2134655, (c) SNPs with unequal susceptibilities and onset age: rs63319 and rs1079597. Note that all the Kaplan–Meier curves are stratified by genotypes AA, Aa and aa while estimated mixture regression curves are stratified by genotypes AA and Aa + aa except the SNP rs172677.
Figure 2LD plot of 28 SNPs on 13 candidate genes. In the LD heat map, 13 genes are arranged according to their chromosomes. SNPs (RS numbers) within the same gene are arranged by physical positions and framed by a green rectangle. Pairwise LD of SNPs within the same gene was measured using D′ and r2 [50]. The colored rhombus reflects the D′ magnitude (higher LD, red) and the numbered rhombus indicates the r2 value. SNPs with a strong LD are framed in a black inverse diamond block, which is defined according to the confidence interval method[51]. Six SNPs with a star sign concatenated with the RS number are removed from the subsequent analysis because of a complete LD and/or HWE violation.
(A) Single-SNP analysis by using the logistic-AFT mixture regression model. (B) Corresponding bootstrap validation results based on 400 bootstrap samples of size 65.
| SNP [Gene] (# Subjects)a | Covariates (Genotypes) | Logistic Regression Submodel | AFT Submodel (Log-logistic Event Time Distribution) | LRTb for Mixture Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Location Regression Part | Scale Regression Part | ||||||||||||
| OR | 95% CI | p-value | Estimate | 95% CI | p-value | Estimate | 95% CI | p-value |
| d.f. | p-value | ||
|
| |||||||||||||
| rs172677 [ | Intercept | 1 | Referent ( | 3.33 | 3.15, 3.51 | Referent | −1.69 | −2.07, −1.31 | Referent | 6.22 | 2 | 0.045 | |
|
| 1 | −0.31 | −0.69, 0.07 | 0.107 | 0 | ||||||||
|
| 1 | 0.28 | −0.04, 0.61 | 0.087 | 0 | ||||||||
| rs172677 [ | Intercept | 1 | Referent ( | 3.62 | 3.38, 3.86 | Referent | −1.69 | −2.07, −1.32 | Referent | 6.22 | 1 | 0.013 | |
|
| 1 | −0.29 | −0.51, −0.08 | 0.006 | 0 | ||||||||
| rs1439047 [ | Intercept | 1 | Referent ( | 2.97 | 2.82, 3.12 | Referent | −2.34 | −3.05, −1.63 | Referent | 10.73 | 2 | 0.005 | |
|
| 1 | 0.52 | 0.27, 0.76 | <0.001 | 0.74 | −0.13, 1.61 | 0.093 | ||||||
| rs2134655 [ | Intercept | 1 | Referent ( | 3.37 | 3.20, 3.54 | Referent | −1.52 | −1.91, −1.13 | Referent | 4.87 | 1 | 0.027 | |
|
| 3.44 | 1.12, 10.59 | 0.031 | 0 | 0 | ||||||||
| rs63319 [ | Intercept | 1 | Referent ( | 3.68 | 3.43, 3.93 | Referent | −1.67 | −2.11, −1.24 | Referent | 6.81 | 2 | 0.033 | |
|
| 0.38 | 0.10, 1.49 | 0.166 | −0.41 | −0.73, −0.10 | 0.010 | 0 | ||||||
| rs1079597 [ | Intercept | 1 | Referent ( | 3.40 | 3.28, 3.52 | Referent | −1.40 | −1.82, −0.98 | Referent | 7.14 | 2 | 0.028 | |
|
| 0.28 | 0.07, 1.12 | 0.073 | 0 | −1.32 | −2.34, −0.30 | 0.011 | ||||||
|
| |||||||||||||
| rs172677 [ | Intercept | 1 | 3.61 | 3.30, 3.85 | −1.77 | −2.15, −1.44 | |||||||
|
| 1 | −0.29 | −0.50, −0.02 | 0 | |||||||||
| rs1439047 [ | Intercept | 1 | 3.02 | 2.82, 3.50 | −2.53 | −3.93, −0.95 | |||||||
|
| 1 | 0.47 | 0.05, 0.78 | 0.86 | −0.56, 2.29 | ||||||||
| rs2134655 [ | Intercept | 1 | 3.38 | 3.20, 3.57 | −1.56 | −1.91, −1.24 | |||||||
|
| 4.49 | 1.24, 12.56c | 0 | 0 | |||||||||
| rs63319 [ | Intercept | 1 | 3.65 | 3.39, 3.85 | −1.77 | −2.20, −1.39 | |||||||
|
| 0.51 | 0.07, 1.63 | −0.37 | −0.65, −0.06 | 0 | ||||||||
| rs1079597 [ | Intercept | 1 | 3.40 | 3.25, 3.53 | −1.44 | −1.86, −1.15 | |||||||
|
| 0.36 | 0.09, 0.92 | 0 | −1.39 | −2.23, −0.03 | ||||||||
aThe sample size for each single-gene analysis. bChi-squared statistic, degrees of freedom and p-value for the likelihood ratio test. cbootstrap percentile confidence intervals. Abbreviations: OR, odds ratio; CI, confidence interval; LRT, likelihood ratio test; d.f., degrees of freedom.
Multiple-SNPs analysis by using the logistic-AFT mixture regression model with chi-squared statistic 15.48 (p = 0.004) of the likelihood ratio test in the mixture model.
| SNP [Gene] | Covariates (Genotypes) | Logistic Regression Submodel | AFT Submodel (Log-logistic Event Time Distribution) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Location Regression Part | Scale Regression Part | |||||||||
| OR | 95% CI | p-value | Estimate | 95% CI | p-value | Estimate | 95% CI | p-value | ||
| rs2134655 [ | Intercept | 1 | Referent | |||||||
|
| 3.14 | 0.96, 10.32 | 0.059 | |||||||
| rs172677 [ | Intercept | 3.66 | 3.40, 3.92 | Referent | ||||||
|
| −0.29 | −0.53, −0.04 | 0.020 | |||||||
| rs1079597 [ | Intercept | −1.55 | −1.99, −1.11 | Referent | ||||||
|
| 0.36 | 0.08, 1.52 | 0.164 | −1.13 | −2.24, −0.03 | 0.044 | ||||
Abbreviations: OR, odds ratio; CI, confidence interval.
Analysis of multiple SNPs and habitual smoking by using the logistic-AFT mixture regression model with chi-squared statistic 21.30 (p < 0.001) of the likelihood ratio test in the mixture model.
| Factors SNP [Gene] | Covariates | Logistic Regression Submodel | AFT Submodel (Log-logistic Event Time Distribution) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Location Regression Part | Scale Regression Part | |||||||||
| OR | 95% CI | p-value | Estimate | 95% CI | p-value | Estimate | 95% CI | p-value | ||
| rs2134655 [ | Intercept | 1 | Referent ( | |||||||
|
| 3.33 | 1.08, 10.29 | 0.037 | |||||||
| rs172677 [ | Intercept | 3.85 | 3.67, 4.03 | Referent | ||||||
|
| −0.29 | −0.47, −0.11 | 0.002 | |||||||
| HS | −0.23 | −0.33, −0.13 | <0.001 | |||||||
| rs1079597 [ | Intercept | −1.64 | −2.05, −1.22 | Referent | ||||||
|
| −1.97 | −3.03, −0.92 | <0.001 | |||||||
Abbreviations: HS, Habitual smoking (Yes = 1, No = 0); OR, odds ratio; CI, confidence interval.
Figure 3Kaplan–Meier (step function) and multiple mixture regression (smooth curve) estimators of overall and conditional event curves in all habitual smokers for the alcoholism onset age by genotypes of three SNPs and habitual smoking.
Figure 4GAP clustering of three multiple SNPs (genes) and habitual smoking of 65 subjects.
Figure 5GAP clustering of the pathways of five identified significant genes from the KEGG for alcoholism.