| Literature DB >> 32583859 |
Pashupati P Mishra1,2,3, Ismo Hänninen1,2,3, Emma Raitoharju1,2,3, Saara Marttila1,2,3,4, Binisha H Mishra1,2,3, Nina Mononen1,2,3, Mika Kähönen2,5, Mikko Hurme4,6, Olli Raitakari7,8,9, Petri Törönen10, Liisa Holm10,11, Terho Lehtimäki1,2,3.
Abstract
Smoking as a major risk factor for morbidity affects numerous regulatory systems of the human body including DNA methylation. Most of the previous studies with genome-wide methylation data are based on conventional association analysis and earliest threshold-based gene set analysis that lacks sensitivity to be able to reveal all the relevant effects of smoking. The aim of the present study was to investigate the impact of active smoking on DNA methylation at three biological levels: 5'-C-phosphate-G-3' (CpG) sites, genes and functionally related genes (gene sets). Gene set analysis was done with mGSZ, a modern threshold-free method previously developed by us that utilizes all the genes in the experiment and their differential methylation scores. Application of such method in DNA methylation study is novel. Epigenome-wide methylation levels were profiled from Young Finns Study (YFS) participants' whole blood from 2011 follow-up using Illumina Infinium HumanMethylation450 BeadChips. We identified three novel smoking related CpG sites and replicated 57 of the previously identified ones. We found that smoking is associated with hypomethylation in shore (genomic regions 0-2 kilobases from CpG island). We identified smoking related methylation changes in 13 gene sets with false discovery rate (FDR) ≤ 0.05, among which is olfactory receptor activity, the flagship novel finding of the present study. Overall, we extended the current knowledge by identifying: (i) three novel smoking related CpG sites, (ii) similar effects as aging on average methylation in shore, and (iii) a novel finding that olfactory receptor activity pathway responds to tobacco smoke and toxin exposure through epigenetic mechanisms.Entities:
Keywords: Metylation; Tobacco smoke; epigenomics
Mesh:
Substances:
Year: 2020 PMID: 32583859 PMCID: PMC7340865 DOI: 10.1042/BSR20200596
Source DB: PubMed Journal: Biosci Rep ISSN: 0144-8463 Impact factor: 3.840
Population characteristics of Young Finns Study participants; data are mean ± SD or proportions
| Characteristics | Active smokers | Non-smokers |
|---|---|---|
| Number of subjects | 21 | 104 |
| Sex (%men) | 57% | 34% |
| Age, years | 45 (±3.95) | 44.1 (±3.1) |
| Body mass index, kg/m2 | 26.59 (±4.38) | 25.7 (±4.36) |
List of differentially methylated CpG sites with respect to smoking habit with false discovery rate (FDR) < 0.05 identified with model 1, their corresponding effect size, standard error, P-values, FDR, and genomic regions in relation to both gene region (TSS200, 0–200 bases upstream of the transcriptional start site; TSS1500, 200–1500 bases upstream of the TSS; 5′UTR, within the 5′ untranslated region, between the TSS and the ATG start site; Body, between the ATG and stop codon irrespective of the presence of introns, exons, TSS, or promoters; 3′UTR, between the stop codon and poly A signal) and CpG islands (Shores, 0–2 kb from CpG island; Shelves, 2–4 kb from CpG island)
| CpG Sites | Effect size | Standard error | FDR | Gene/Island name | Gene region/Relation to island | Chromosome | ||
|---|---|---|---|---|---|---|---|---|
| 1 | cg20295214 | 0.03 | 0.005 | 9.53 × 10-9 | 1.84 × 10-4 | AVPR1B | Body | 1 |
| 2 | cg08709672 | 0.02 | 0.004 | 3.21 × 10-6 | 2.82 × 10-2 | AVPR1B | 5′ UTR 1stExon | 1 |
| 3 | cg09935388 | 0.06 | 0.009 | 9.87 × 10-10 | 2.65 × 10-5 | GFI1 | Body | 1 |
| 4 | cg18316974 | 0.03 | 0.006 | 7.64 × 10-7 | 8.87 × 10-3 | GFI1 | Body | 1 |
| 5 | cg18146737 | 0.05 | 0.01 | 5.55 × 10-6 | 4.18 × 10-2 | GFI1 | Body | 1 |
| 6 | cg25189904 | 0.03 | 0.006 | 3.85 × 10-6 | 3.18 × 10-2 | GNG12 | TSS1500 | 1 |
| 7 | 0.02 | 0.005 | 2.18 × 10-6 | 2.03 × 10-2 | RAB3GAP2 | Body | 1 | |
| 8 | cg08869700 | 0.01 | 0.003 | 2.75 × 10-6 | 2.46 × 10-2 | RFX5 | Body | 1 |
| 9 | cg13898430 | 0.008 | 0.002 | 5.07 × 10-6 | 3.96 × 10-2 | RUNX3 | TSS1500 | 1 |
| 10 | cg04885881 | 0.02 | 0.004 | 1.52 × 10-7 | 2.43 × 10-3 | chr1:11119030 | Shelf | 1 |
| 11 | cg27537125 | 0.02 | 0.003 | 9.85 × 10-9 | 1.84 × 10-4 | – | OpenSea | 1 |
| 12 | cg12547807 | 0.01 | 0.002 | 6.36 × 10-7 | 7.84 × 10-3 | – | OpenSea | 1 |
| 13 | cg11555067 | 0.02 | 0.003 | 3.42 × 10-6 | 2.94 × 10-2 | INPP4A | 5′ UTR | 2 |
| 14 | 0.03 | 0.002 | 1.02 × 10-23 | 2.19 × 10-18 | chr2:233283397 | Island | 2 | |
| 15 | 0.05 | 0.004 | 2.46 × 10-21 | 3.53 × 10-16 | chr2:233283397 | Island | 2 | |
| 16 | 0.05 | 0.004 | 8.74 × 10-19 | 7.51 × 10-14 | chr2:233283397 | Island | 2 | |
| 17 | cg03329539 | 0.02 | 0.003 | 2.71 × 10-9 | 6.86 × 10-5 | chr2:233283397 | Shore | 2 |
| 18 | cg06644428 | 0.06 | 0.01 | 9.64 × 10-7 | 1.09 × 10-2 | chr2:233283397 | Island | 2 |
| 19 | cg21949194 | 0.02 | 0.003 | 6.06 × 10-6 | 4.49 × 10-2 | chr2:39351355 | Shelf | 2 |
| 20 | cg23079012 | 0.02 | 0.003 | 5.43 × 10-10 | 1.67 × 10-5 | – | OpenSea | 2 |
| 21 | cg09355027 ( | 0.02 | 0.004 | 1.43 × 10-6 | 1.46 × 10-2 | – | OpenSea | 2 |
| 22 | 0.03 | 0.004 | 9.59 × 10-12 | 3.75 × 10-7 | GPR15 | 1st Exon | 3 | |
| 23 | cg24719910 | 0.02 | 0.003 | 5.41 × 10-6 | 4.15 × 10-2 | TGFBR2 | Body | 3 |
| 24 | 0.10 | 0.006 | 8.53 × 10-28 | 3.66 × 10-22 | AHRR | Body | 5 | |
| 25 | 0.04 | 0.004 | 1.09 × 10-19 | 1.17 × 10-14 | AHRR | Body | 5 | |
| 26 | 0.04 | 0.004 | 5.82 × 10-13 | 3.57 × 10-8 | AHRR | Body | 5 | |
| 27 | 0.05 | 0.005 | 2.01 × 10-12 | 9.60 × 10-8 | AHRR | Body | 5 | |
| 28 | 0.03 | 0.003 | 4.53 × 10-12 | 1.95 × 10-7 | AHRR | Body | 5 | |
| 29 | 0.03 | 0.005 | 1.38 × 10-10 | 4.56 × 10-6 | AHRR | Body | 5 | |
| 30 | cg12806681 | 0.04 | 0.006 | 1.44 × 10-8 | 2.59 × 10-4 | AHRR | Body | 5 |
| 31 | cg03991871 | 0.05 | 0.009 | 1.12 × 10-7 | 1.85 × 10-3 | AHRR | Body | 5 |
| 32 | cg04551776 | 0.02 | 0.004 | 1.68 × 10-7 | 2.58 × 10-3 | AHRR | Body | 5 |
| 33 | cg23916896 | 0.05 | 0.009 | 3.73 × 10-7 | 5.17 × 10-3 | AHRR | Body | 5 |
| 34 | cg11554391 | 0.02 | 0.004 | 1.21 × 10-6 | 1.27 × 10-2 | AHRR | Body | 5 |
| 35 | cg01899089 | 0.02 | 0.003 | 4.16 × 10-6 | 3.31 × 10-2 | AHRR | Body | 5 |
| 36 | cg11902777 | 0.05 | 0.009 | 6.94 × 10-6 | 4.97 × 10-2 | AHRR | Body | 5 |
| 37 | 0.06 | 0.007 | 1.04 × 10-12 | 5.61 × 10-8 | – | OpenSea | 6 | |
| 38 | 0.03 | 0.004 | 6.12 × 10-11 | 2.19 × 10-6 | – | OpenSea | 6 | |
| 39 | cg14753356 | 0.02 | 0.003 | 3.15 × 10-9 | 7.52 × 10-5 | – | OpenSea | 6 |
| 40 | 0.02 | 0.003 | 6.49 × 10-9 | 1.39 × 10-4 | – | OpenSea | 6 | |
| 41 | cg12803068 | -0.09 | 0.01 | 2.41 × 10-7 | 3.45 × 10-3 | MYO1G | Body | 7 |
| 42 | cg04180046 | -0.03 | 0.005 | 6.39 × 10-7 | 7.84 × 10-3 | MYO1G | Body | 7 |
| 43 | cg22132788 | -0.05 | 0.007 | 1.58 × 10-6 | 1.58 × 10-2 | MYO1G | Body | 7 |
| 44 | cg07826859 | 0.01 | 0.003 | 1.94 × 10-6 | 1.89 × 10-2 | MYO1G | TSS1500 | 7 |
| 45 | cg21322436 | 0.01 | 0.003 | 4.06 × 10-6 | 3.29 × 10-2 | CNTNAP2 | TSS1500 | 7 |
| 46 | cg03450842 | 0.01 | 0.003 | 4.42 × 10-7 | 5.93 × 10-3 | ZMIZ1 | 5′ UTR | 10 |
| 47 | cg01901332 | 0.02 | 0.004 | 1.19 × 10-6 | 1.27 × 10-2 | ARRB1 | Body | 11 |
| 48 | cg21611682 | 0.02 | 0.003 | 7.56 × 10-10 | 2.17 × 10-5 | LRP5 | Body | 11 |
| 49 | cg14624207 | 0.01 | 0.003 | 6.91 × 10-6 | 4.97 × 10-2 | LRP5 | Body | 11 |
| 50 | cg07986378 | 0.02 | 0.005 | 2.49 × 10-6 | 2.28 × 10-2 | ETV6 | Body | 12 |
| 51 | 0.02 | 0.003 | 3.90 × 10-8 | 6.71 × 10-4 | C14orf43 | 5′ UTR | 14 | |
| 52 | 0.02 | 0.003 | 4.53 × 10-9 | 1.03 × 10-4 | ITPK1 | Body | 14 | |
| 53 | cg07069636 | 0.01 | 0.002 | 4.82 × 10-7 | 6.28 × 10-3 | chr16:30669107 | Shore | 16 |
| 54 | cg26038589 ( | 0.03 | 0.006 | 1.17 × 10-6 | 1.27 × 10-2 | CCDC55 | Body | 17 |
| 55 | cg10385208 ( | 0.01 | 0.003 | 3.81 × 10-6 | 3.18 × 10-2 | CWC25 | TSS1500 | 17 |
| 56 | cg19572487 | 0.02 | 0.004 | 8.07 × 10-9 | 1.65 × 10-4 | RARA | 5′ UTR | 17 |
| 57 | 0.05 | 0.004 | 1.00 × 10-17 | 7.17 × 10-13 | F2RL3 | Body | 19 | |
| 58 | cg03707168 | 0.02 | 0.004 | 7.10 × 10-7 | 8.47 × 10-3 | PPP1R15A | Body | 19 |
| 59 | cg17566560 | 0.02 | 0.004 | 2.13 × 10-6 | 2.03 × 10-2 | DLGAP4 | Body, 5′UTR | 20 |
| 60 | cg23110422 | 0.03 | 0.006 | 1.85 × 10-7 | 2.74 × 10-3 | ETS2 | Body | 21 |
Intergenic chromatin regions are called OpenSea. CpG sites retained in model 2 are highlighted with bold font. Statistical models: Model 1 involved adjustment for age, sex, body mass index, cell type proportions, and technical covariates (chip and array). Model 2 involved adjustment with two additional covariates, alcohol usage and socioeconomic status, in addition to those in model 1.
Figure 1Manhattan plot and quantile-quantile (Q-Q) plot of epigenome-wide association analysis of smoking habit
(A) Manhattan plot showing the P-values of genome-wide CpG sites. X-axis represents position of the CpG sites on each chromosome. Y-axis represents negative log10 of the P-values for the association. The dotted line indicates false discovery rate (FDR)-corrected significance threshold and the solid horizontal line represents Bonferroni-corrected significance threshold (experiment-wide significance). The annotations for the three novel CpG sites and corresponding genes (if annotation is known) are shown. (B) Quantile-quantile plot showing genomic inflation factor (lambda = 1.11) of the epigenome-wide association study. The genomic inflation factor (ratio of the median of the empirically observed distribution of the test statistic to the expected median) represents the extent of inflation and false positive rate in the results.
Molecular Signature Database based gene sets identified with false discovery rate (FDR) ≤ 0.05 in the present study
| Significant gene sets | Size (number of genes) of gene sets | GSZ-score | FDR | ||
|---|---|---|---|---|---|
| 1. | Regulation of smoothened signaling pathway involved in dorsal ventral neural tube (biological process) | 8 | 4.89 | 5.12 × 10-6 | 0.04 |
| 2. | Thrombin-activated receptor activity (molecular function) | 5 | 4.27 | 2.3 × 10-5 | 0.03 |
| 3. | Protein domain specific binding (molecular function) | 657 | 4.75 | 5.7 × 10-5 | 0.03 |
| 4. | Olfactory receptor activity (molecular function) | 321 | 9.96 | 6.09 × 10-5 | 0.03 |
| 5. | Semaphorin receptor activity (molecular function) | 9 | 4 | 6.85 × 10-5 | 0.03 |
| 6. | AP-2 adaptor complex binding (molecular function) | 7 | 4.22 | 1.15 × 10-4 | 0.03 |
| 7. | Nuclear receptor activity (molecular function) | 45 | 4.81 | 1.18 × 10-4 | 0.03 |
| 8. | G-protein coupled receptor activity (molecular function) | 725 | 7.21 | 2.28 × 10-4 | 0.05 |
| 9. | Aryl hydrocarbon receptor signaling (canonical, reactome) | 7 | 5.76 | 6.27 × 10-7 | 0.003 |
| 10. | SHI SPARC targets up (chemical and genetic perturbations) | 22 | 5.86 | 1.22 × 10-6 | 0.003 |
| 11. | RORA activates gene expression (canonical, reactome) | 17 | 4.99 | 1.68 × 10-5 | 0.03 |
| 12. | RARRXR pathway (canonical, biocarta) | 7 | 4.38 | 3.41 × 10-5 | 0.05 |
| 13. | UV response down | 138 | 4.50 | 0.001 | 0.05 |
Figure 2Box plots showing differences in average methylation of all CpG sites belonging to genes of the analyzed gene sets in smokers and non-smokers
X-axis represents smoking habit (active smokers and non-smokers). Y-axis represents mean methylation β-values of CpG sites belonging to genes of the analyzed gene sets.
Figure 3Effect of smoking on average methylation level in different genomic regions in our genome
Box plots showing smoking-related differences in average methylation levels in different genomic regions. X-axis represents smoking habit (active smokers and non-smokers). Y-axis represents average methylation β values of CpG sites belonging to different genomic regions. Abbreviations and definitions for gene regions: TSS200, 0–200 bases upstream of the transcriptional start site; TSS1500, 200–1500 bases upstream of the TSS; 5′UTR, within the 5′ untranslated region, between the TSS and the ATG start site; Body, between the ATG and stop codon irrespective of the presence of introns, exons, TSS, or promoters; 3′UTR, between the stop codon and poly A signal. Definitions for CpG islands: Shores, 0–2 kb from CpG island; Shelves, 2–4 kb from CpG island.