| Literature DB >> 32753590 |
Boram Park1, Jaehoon An1, Wonji Kim2, Hae Yeon Kang3, Sang Baek Koh4, Bermseok Oh5, Keum Ji Jung6, Sun Ha Jee6, Woo Jin Kim7, Michael H Cho8,9, Edwin K Silverman8,9, Taesung Park10,11, Sungho Won12,13,14.
Abstract
Smoking is a major risk factor for chronic obstructive pulmonary disease (COPD); however, more than 25% of COPD patients are non-smokers, and gene-by-smoking interactions are expected to affect COPD onset. We aimed to identify the common genetic variants interacting with pack-years of smoking on FEV1/FVC ratios in individuals with normal lung function. A genome-wide interaction study (GWIS) on FEV1/FVC was performed for individuals with FEV1/FVC ratio ≥ 70 in the Korea Associated Resource cohort data, and significant SNPs were validated using data from two other Korean cohorts. The GWIS revealed that rs10947231 and rs8192575 met genome-wide significant levels; For [Formula: see text] the likelihood ratio (LR) test was conducted, and its P values, PLR, for rs10947231 and rs8192575 were 2.23 × 10-12 and 1.18 × 10-8, respectively. Interaction between rs8192575 and smoking is significantly replicated with two additional data (PINT = 0.0454, 0.0131). Expression quantitative trait loci, topologically associated domains, and PrediXcan analyses revealed that rs8192575 is significantly associated with AGER expression. SNPs on the 6p21 region are associated with FEV1/FVC, and the effect of smoking on FEV1/FVC differs among the associated genotypes.Entities:
Mesh:
Year: 2020 PMID: 32753590 PMCID: PMC7403370 DOI: 10.1038/s41598-020-70092-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flow diagram for KARE cohort Fig. 1 explains how the individuals and SNPs were included and excluded. After quality controls and imputations, finally 7473 healthy individuals and 3,351,033 SNPs were used for analyses.
Descriptive statistics of healthy individuals Means of variables and their standard errors are calculated for continuous variables.
| KARE | GENIE | ARIRANG | |
|---|---|---|---|
| Participants | 7473 | 4413 | 513 |
| Age (years) | 52.0 ± 8.9 | 49.7 ± 6.8 | 59.0 ± 7.4 |
| Male | 3310 (44.3%) | 2507 (56.8%) | 208 (40.5%) |
| Female | 4163 (55.7%) | 1906 (43.2%) | 305 (59.5%) |
| Body mass index (kg/m2) | 24.7 ± 3.1 | 23.3 ± 2.8 | 25.3 ± 3.2 |
| Height (cm) | 159.8 ± 8.7 | 165.6 ± 7.9 | 157.5 ± 8.5 |
| Non-smokers | 4768 (63.8%) | 2520 (57.1%) | 369 (71.9%) |
| Former smokers | 1140 (15.3%) | 1380 (31.3%) | 65 (12.7%) |
| Current smokers | 1565 (20.9%) | 513 (11.6%) | 79 (15.4%) |
| Pack-years of smoking | 22.8 ± 17.2 | 16.2 ± 24.1 | 27.5 ± 15.3 |
| FEV1 (liters) | 2.9 ± 0.7 | 3.0 ± 0.6 | 2.4 ± 0.6 |
| FVC (liters) | 3.6 ± 0.9 | 3.7 ± 0.8 | 3.1 ± 0.8 |
| FEV1/FVC ratio | 81.4 ± 5.4 | 81.7 ± 5.5 | 79.2 ± 5.1 |
Figure 2Quantile–quantile plot and Manhattan plot from GWIS with healthy individuals in KARE data (A) is obtained from the comparison between observed P values quantiles relative and expected quantiles under the uniform distribution (null hypothesis). The variance inflation factor (VIF) was 1.002, suggesting our results are free of systematic P value inflation. (B) was plotted from the logarithms of the P values of 3,351,033 SNPs against its physical chromosomal position. The red line represents genome-wide significance level (5 × 10–8), and several SNPs located at 6p21 meet this significance level. The plot was generated by software R version 3.6.1 (R Foundation for Statistical Computing; Vienna, Austria).
Figure 3Genomic region on chromosome 6 near rs10947231 and rs8192575 Fig. 3 suggests that the rs10947231 and rs8192575 located at gene dense region. The plot was generated by software R version 3.6.1 (R Foundation for Statistical Computing; Vienna, Austria).
Results for rs10947231 and rs8192575 from discovery GWIS For FEV1/FVC, GWIS was performed on healthy individuals with 3,351,033 SNPs, and the genome-wide significant result are summarized. and are the coefficients for the main SNP and interaction effects between SNP and pack-years of smoking, respectively. Overall effects indicate P values (PLR) for testing the null hypotheses by F test.
| Data | Minor/Major alleles | MAF | Missing rate | INFO† | Overall effects (PLR) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Discovery | KARE | A/C | 0.165 | 0.767 | 0.015 | 0.983 | 0.835 (0.13) | 1.42 × 10–10 | 0.001 (0.007) | 0.8417 | 2.23 × 10–12 |
| Replication | GENIE | A/C | 0.18 | 0.902 | 0 | 0.999 | 0.178 (0.198) | 0.3678 | 0.026 (0.014) | 0.0684 | 0.0698 |
| ARIRANG | A/C | 0.17 | 0.638 | 0.016 | 0.985 | 0.967 (0,473) | 0.04151 | − 0.056 (0.03) | 0.0583 | 0.0714 | |
| Discovery | KARE | G/C | 0.15 | 0.549 | 0.027 | 0.976 | 0.821 (0.136) | 1.77 × 10–9 | − 0.02 (0.008) | 0.0165 | 1.18 × 10–8 |
| Replication | GENIE | G/C | 0.163 | 0.425 | 0 | 1 | 0.533 (0.21) | 0.0113 | − 0.031 (0.016) | 0.0454 | 0.0173 |
| ARIRANG | G/C | 0.176 | 0.357 | 0.006 | 0.996 | 0.702 (0.484) | 0.1471 | − 0.067 (0.027) | 0.0131 | 0.0457 | |
BP physical position (Based on hg19), MAF minor allele frequencies, HWE Hardy–Weinberg equilibrium, SE standard error.
†INFO is the imputation quality metric obtained from IMPUTE2.
Figure 4Declines of FEV1/FVC along age Changes in FEV1/FVC along age according to smoking status and rs8192575 were plotted by generalized additive models (GAM). Figure 4 suggests that smoking has a significant effect on FEV1/FVC ratios for healthy individuals. The plot was generated by software R version 3.6.1 (R Foundation for Statistical Computing; Vienna, Austria).
Figure 5The boxplot of the FEV1/FVC grouped by rs8192575, smoking status, and age from KARE data We visualized the effects of rs819275 on FEV1/FVC according to smoking status and age. The plot represents the effects of rs8192575 for healthy individuals, and the allele G of rs8192575 has positive effects if healthy individuals have not smoked. The red diamond symbols represent mean value, which is displayed as a number. The plot was generated by software R version 3.6.1 (R Foundation for Statistical Computing; Vienna, Austria).
Figure 6Estimated FEV1/FVC according to pack-years of smoking and rs8192575 The estimated FEV1/FVC ratio is generated by gene-by-smoking analysis using healthy individuals of KARE. Figure 6 implies that minor alleles G has negative impact in proportion to each increment of pack-years of smoking. The plot was generated by software R version 3.6.1 (R Foundation for Statistical Computing; Vienna, Austria).
Result of PrediXcan for FEV1/FVC ratio on KARE using lung tissue prediction model The results for PrediXcan for the top 10 genes on chromosome 6 are summarized.
| Gene | PrediXcan Beta | PrediXcan |
|---|---|---|
| AGER | − 2.797 | 8.59 × 10–6 |
| HLA-S | − 0.833 | 4.60 × 10–5 |
| C4B | − 0.736 | 0.0003 |
| MDGA1-201 | 1.312 | 0.0012 |
| HLA-J | 1.065 | 0.0019 |
| NKAPL | − 0.941 | 0.0042 |
| HLA-DPA1 | − 0.791 | 0.0052 |
| HCG4B | 0.716 | 0.0088 |
| TPT1P4 | 1.522 | 0.0094 |
| HLA-C | -0.549 | 0.0099 |
Independence test between rs8192575 and smoking variables We tested gene-smoking dependency. The association between rs8192575 and pack-years of smoking are analyzed by linear regression, and the association between rs8192575 and smoking status are analyzed by logistic regression. For KARE and ARIRANG data the smoking status indicates non- and ever-smokers. For GENIE data smoking status indicates non- and current smokers. P value greater than 0.05 suggests that there was no dependency between rs8192575 and smoking variables.
| DATA | |
|---|---|
| KARE | 0.659 |
| GENIE | 0.3294 |
| ARIRANG | 0.8955 |
| KARE | 0.7734 |
| GENIE | 0.2881 |
| ARIRANG | 0.7462 |