| Literature DB >> 23049750 |
Natsuhiko Kumasaka1, Masayuki Aoki, Yukinori Okada, Atsushi Takahashi, Kouichi Ozaki, Taisei Mushiroda, Tomomitsu Hirota, Mayumi Tamari, Toshihiro Tanaka, Yusuke Nakamura, Naoyuki Kamatani, Michiaki Kubo.
Abstract
Smoking is a major public health problem, but the genetic factors associated with smoking behaviors are not fully elucidated. Here, we have conducted an integrated genome-wide association study to identify common copy number polymorphisms (CNPs) and single nucleotide polymorphisms (SNPs) associated with the number of cigarettes smoked per day (CPD) in Japanese smokers (N = 17,158). Our analysis identified a common CNP with a strong effect on CPD (rs8102683; P=3.8 x 10(-42)) in the 19q13 region, encompassing the CYP2A6 locus. After adjustment for the associated CNP, we found an additional associated SNP (rs11878604; P=9.7 x 10(-30)) located 30 kb downstream of the CYP2A6 gene. Imputation of the CYP2A6 locus revealed that haplotypes underlying the CNP and the SNP corresponded to classical, functional alleles of CYP2A6 gene that regulate nicotine metabolism and explained 2% of the phenotypic variance of CPD (ANOVA F-test P=9.5 x 10(-52)). These haplotypes were also associated with smoking-related diseases, including lung cancer, chronic obstructive pulmonary disease and arteriosclerosis obliterans.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23049750 PMCID: PMC3458030 DOI: 10.1371/journal.pone.0044507
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Manhattan plot showing the significance of association for all CNPs and SNPs in the genome-wide CPD analysis.
The SNPs consist of both Illumina 610K chip and imputed HapMap SNPs. All CNPs and SNPs are plotted on the axis according to their positions on each chromosome against association with CPD on the axis (-value). SNPs and CNPs with -values are highlighted in green. A dark gray line shows the genome-wide significance level after Bonferroni's correction with a total of 2,312,503 SNPs and 4,256 CNPs.
Figure 2Signal plots before conditioning (a), after conditioning on rs8102683 (b) and after conditioning on rs8102683 and rs11878604 (c).
SNPs and CNPs are plotted on the axis according to their positions on chromosome 19 (NCBI Build 37; hg19) against association with CPD on the left axis (-value). SNPs from the 1000 genomes imputation (phase I; 2011-11-23) are indicated with circles, SNPs genotyped on the Illumina 610K chip are indicated with triangles and CNPs genotyped with PlatinumCNV using the raw signal intensity data from the chip are indicated with diamonds. SNPs in the commonly deleted region (shown by the gray shaded area) are imputed as tri-allelic SNPs with deletions (see Materials and Methods for details). All SNP associations are assessed using allele dosages (the number difference between the A and B alleles; see Materials and Methods), and the CNP associations are assessed using the posterior mean copy number dosage [8]. Recombination rates (cM/Mb) across the region are shown by the purple line plotted against the right axis. The most significant variant of the SNP or CNP for each panel is pink, and the surrounding SNPs and CNPs are color-coded to reflect the strength of LD with the top variant according to the Pearson's values.
Frequency and relative effect size of haplotype between rs11878604 (SNP) and rs8102683 (CNP).
| Haplotype | GWAS set | Replication set | |||||
| rs11878604 (SNP) | rs8102683 (CNP) | Haplotype Freq. | Relative effect size |
| Haplotype Freq. | Relative effect size |
|
| T | 1 copy | 41.41 |
| 41.8 |
| ||
| T | 0 copy | 19.22 | −4.00 (0.290) | 3.8×10−43 | 18.6 | −4.30(0.384) | 8.3×10−29 |
| C | 1 copy | 38.14 | −2.69 (0.237) | 9.4×10−30 | 38.4 | −2.63(0.309) | 2.3×10−17 |
| C | 0 copy | 0.76 | −5.05 (1.548) | 1.1×10−3 | 0.63 | −3.84(2.200) | 0.081 |
| C | 2 copies | 0.23 | −3.07 (2.900) | 0.30 | 0.24 | −2.62(3.612) | 0.47 |
| T | 2 copies | 0.28 | −0.29 (2.413) | 0.90 | 0.31 | −0.414(2.879) | 0.89 |
| Explained variance | 1.83% (ANOVA F-test | 2.23% (ANOVA F-test | |||||
Haplotypes consisting of alleles at rs11878604 [T/C] and haplotypic copy numbers at rs8102683 [0/1/2].
Haplotype frequencies and relative effect sizes are jointly evaluated using a haplotype-specific linear regression model as described previously [27]. All model parameters were estimated using a standard EM (expectation-maximization) algorithm [28].
Variance explained by these six haplotypes and its significance was assessed in the standard ANOVA (analysis of variance) framework.
Association of the haplotype between rs11878604 (SNP) and rs8102683 (CNP) for three dieseases.
| Haplotype | COPD (982 cases, 4,480 controls) | Lung Cancer (997 cases, 6,491 controls) | ASO (499 cases, 10,975 controls) | |||||||||||
| rs11878604 (SNP) | rs8102683 (CNP) | case (%) | ctrl (%) | OR* (95%CI) |
| case (%) | ctrl (%) | OR* (95%CI) |
| case (%) | ctrl (%) | OR* (95%CI) |
| |
| T | 1 copy | 51.65 | 39.61 |
| 50.30 | 41.15 |
| 45.69 | 41.11 |
| ||||
| T | 0 copy | 13.49 | 19.76 | 0.20(0.14–0.27) | 3.9×10−21 | 14.24 | 19.47 | 0.33(0.25–0.45) | 5.6×10−13 | 16.29 | 19.41 | 0.53(0.36–0.77) | 9.3×10−4 | |
| C | 1 copy | 33.95 | 39.45 | 0.38(0.30–0.49) | 1.2×10−13 | 34.53 | 38.12 | 0.55(0.44–0.69) | 1.7×10−7 | 36.71 | 38.19 | 0.72(0.54–0.97) | 0.028 | |
| C | 0 copy | 0.36 | 0.64 | 0.40(0.04–3.85) | 0.43 | 0.44 | 0.76 | 0.29(0.04–2.03) | 0.21 | 0.73 | 0.77 | 0.99(0.15–6.77) | 0.99 | |
| C | 2 copies | 0.21 | 0.23 | 3.28(0.11–95.47) | 0.49 | 0.23 | 0.25 | 1.25(0.06–27.88) | 0.89 | 0.24 | 0.23 | 0.77(0.02–31.39) | 0.89 | |
| T | 2 copies | 0.34 | 0.31 | 0.53(0.05–5.88) | 0.60 | 0.26 | 0.27 | 0.24(0.02–3.33) | 0.29 | 0.34 | 0.29 | 1.24(0.10–16.04) | 0.87 | |
| Liability variance† | 4.0% (Likelihood Ratio–test | 0.71% (Likelihood Ratio–test | 0.23% (Likelihood Ratio–test | |||||||||||