Literature DB >> 30389506

Epigenetic signatures of starting and stopping smoking.

Daniel L McCartney1, Anna J Stevenson1, Robert F Hillary1, Rosie M Walker2, Mairead L Bermingham1, Stewart W Morris1, Toni-Kim Clarke3, Archie Campbell1, Alison D Murray4, Heather C Whalley3, David J Porteous2, Peter M Visscher5, Andrew M McIntosh6, Kathryn L Evans2, Ian J Deary7, Riccardo E Marioni8.   

Abstract

BACKGROUND: Multiple studies have made robust associations between differential DNA methylation and exposure to cigarette smoke. But whether a DNA methylation phenotype is established immediately upon exposure, or only after prolonged exposure is less well-established. Here, we assess DNA methylation patterns from peripheral blood samples in current smokers in response to dose and duration of exposure, along with the effects of smoking cessation on DNA methylation in former smokers.
METHODS: Dimensionality reduction was applied to DNA methylation data at 90 previously identified smoking-associated CpG sites for over 4900 individuals in the Generation Scotland cohort. K-means clustering was performed to identify clusters associated with current and never smoker status based on these methylation patterns. Cluster assignments were assessed with respect to duration of exposure in current smokers (years as a smoker), time since smoking cessation in former smokers (years), and dose (cigarettes per day).
FINDINGS: Two clusters were specified, corresponding to never smokers (97·5% of whom were assigned to Cluster 1) and current smokers (81·1% of whom were assigned to Cluster 2). The exposure time point from which >50% of current smokers were assigned to the smoker-enriched cluster varied between 5 and 9 years in heavier smokers and between 15 and 19 years in lighter smokers. Low-dose former smokers were more likely to be assigned to the never smoker-enriched cluster in the first year following cessation. In contrast, a period of at least two years was required before the majority of former high-dose smokers were assigned to the never smoker-enriched cluster.
INTERPRETATION: Our findings suggest that smoking-associated DNA methylation changes are a result of prolonged exposure to cigarette smoke, and can be reversed following cessation. The length of time in which these signatures are established and recovered is dose dependent. Should DNA methylation-based signatures of smoking status be predictive of smoking-related health outcomes, our findings may provide an additional criterion on which to stratify risk.
Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  DNA methylation; Epidemiology; Epigenetics; Smoking

Mesh:

Year:  2018        PMID: 30389506      PMCID: PMC6286188          DOI: 10.1016/j.ebiom.2018.10.051

Source DB:  PubMed          Journal:  EBioMedicine        ISSN: 2352-3964            Impact factor:   8.143


Evidence before this study

The effects of cigarette smoking on DNA methylation have been well established. However, fewer studies have investigated: (1) how long these effects persist upon cessation; (2) the extent to which they can be reversed; and (3) how long it takes for such smoking-based methylation patterns to appear.

Added value of this study

We show the extent to which smoking-associated DNA methylation profiles are time- and dose-dependent in current smokers. In addition, we demonstrate the reversibility of these changes in former smokers is dependent on time since cessation and dose prior to quitting. To our knowledge, this is currently the largest study of DNA methylation in former smokers. Furthermore, the broad age range of our cohort has permitted us to investigate DNA methylation in both recent and long-term smokers (from <1 to >50 years as a smoker), and recent and long-term quitters (from <1 to >35 years since quitting).

Implications of all the available evidence

The establishment of smoking-associated DNA alterations provides an important public health message as a deterrent from smoking initiation. Furthermore, our reports on the dose-dependency and reversibility of these changes may encourage a reduction in the cigarette intake of current smokers (if not cessation), and an incentive against relapse in former smokers. Alt-text: Unlabelled Box

Background

Cigarette smoking is among the leading causes of illness and premature death worldwide [1]. In addition to multiple cancers [2], it is a major risk factor for cardiovascular and respiratory disorders [3,4]. Recent studies suggest that altered DNA methylation may play an important role in the biological pathways linking smoking to adverse health outcomes [[5], [6], [7], [8], [9]]. DNA methylation is an epigenetic modification, typically characterised by the addition of a methyl group to a cytosine–guanine dinucleotide (CpG). Both genetic and environmental factors can modulate DNA methylation levels, which in turn can regulate gene expression [10]. To date, the most informative environmental correlate of DNA methylation has been cigarette smoking. Multiple epigenome–wide association studies (EWAS) have been performed on smoking, using either status (e.g. current smoker, former smoker, never smoker) or intake (e.g. pack years) as the trait of interest [5,7,9], identifying thousands of smoking–associated loci. Moreover, cohort studies have reported altered DNA methylation in the offspring of women who smoked during pregnancy [[11], [12], [13]]. These analyses have identified a large number of loci where methylation is altered by exposure to cigarette smoke, with the cg05572921 locus in the aryl hydrocarbon receptor (AHR) repressor (AHRR) gene being among the most robustly implicated.[[6], [7], [8],13] The relationship between exposure to cigarette smoke and DNA methylation changes has been widely reported. However, when these effects are established in smokers and whether they can be recovered by cessation is not well understood. Studying the mechanics of smoking–associated DNA methylation changes may provide a novel means of identifying risk of smoking–related morbidities. We investigated the extent to which smoking–associated DNA methylation changes were associated with duration of exposure in current smokers and time since cessation in former smokers. We examined the relationship between DNA methylation from peripheral blood samples and smoking in a cohort of over 4900 individuals, incorporating self–reported years as a smoker and cigarettes per day as metrics for duration of exposure and dose, respectively.

Methods

The Generation Scotland cohort

Details of the Generation Scotland: Scottish Family Health Study (GS:SFHS) have been described previously [14,15]. DNA samples were collected for genotype– and DNA methylation–profiling along with detailed clinical, lifestyle, and sociodemographic data. The current study comprised 4905 individuals from the cohort for whom both DNA methylation and smoking data were available. A summary of variables assessed in this analysis is presented in Table 1.
Table 1

Summary of the Generation Scotland cohort and variables assessed. Sample numbers are presented for each variable (N) along with mean and standard deviation (SD) values, where applicable.

VariableNMeanSD
Sex
Males1872
Females3033
Age (years)490548·513·9



Smoking status
Current smoker917
Former smoker1466
Never smoker2522



Smoking variablesa
Cigarettes per day (current and former smokers)217711·19·8
Cigarettes per day (current smokers)85915·29·8
Cigarettes per day (former smokers)13188·58·8
Pack years (current and former smokers)203715·916·8
Pack years (current smokers)85423·319·7
Pack years (former smokers)118310·611·8
Years as a smoker (current and former smokers)222127·713·4
Years as a smoker (current smokers)91728·713·2
Years as a smoker (former smokers)130427·013·5
Years since cessation (former smokers only)13249·010·0

Information relating to dose and time since cessation/duration of exposure was not available for all current and former smokers.

Summary of the Generation Scotland cohort and variables assessed. Sample numbers are presented for each variable (N) along with mean and standard deviation (SD) values, where applicable. Information relating to dose and time since cessation/duration of exposure was not available for all current and former smokers. GS:SFHS smoking data were collected using two different questionnaires. The first version of the questionnaire was answered by 2158/4905 (44·0%) of the participants and collected data on absolute values with respect to number of cigarettes smoked, and age started/stopped smoking. The second version of the questionnaire, which was answered by the remaining 2747 individuals in the analysis sample (56·0%), collected data using binned intervals. In order to harmonise the two sets of measurements, mid–point interval estimates were calculated for the ordinal data from the second version of the questionnaire (e.g., 17 years of exposure was assigned to individuals who had reported smoking between 15 and 19 years). Second–hand smoking status was assigned based on whether participants reported exposure to cigarette smoke at home, work or elsewhere, or whether they reported cohabiting with a smoker. Both questionnaires can be accessed from the GS:SFHS website (www.generationscotland.co.uk). In the current study, exposure data were placed into ten five–year bins from 0 to 4 years to 45–49 years (N ≥ 32 per bin), with the longest exposure defined as ≥50 years (N = 23). Data on time since cessation were placed into five–year bins from 10 to 14 years to 30–34 years (N ≥ 48 per bin). The longest time since cessation was defined as ≥35 years (N = 53), whereas the most recent cessation time points (0–9 years) were presented as yearly intervals (N ≥ 26). Sample counts at each exposure and cessation time point are presented in Supplementary Tables 1–2.

Ethics

All components of GS:SFHS received ethical approval from the NHS Tayside Committee on Medical Research Ethics (REC Reference Number: 05/S1401/89). GS:SFHS has also been granted Research Tissue Bank status by the Tayside Committee on Medical Research Ethics (REC Reference Number: 10/S1402/20), providing generic ethical approval for a wide range of uses within medical research.

GS:SFHS DNA methylation

Genome–wide DNA methylation was profiled from peripheral blood samples in 5200 individuals using the Illumina HumanMethylationEPIC BeadChip. Quality control was conducted in R [16]. ShinyMethyl was used to plot the log median intensity of methylated versus unmethylated signal per array, with outliers excluded upon visual inspection [17]. WateRmelon was used to remove (1) samples where ≥1% of CpGs had a detection p–value in excess of 0·05 (2) probes with a beadcount of less than three in more than five samples, and (3) probes where ≥0·5% of samples had a detection p–value in excess of 0·05 [18]. Methylation β-values were calculated using the dasen() normalisation method. Briefly, the dasen method performs background adjustment and quantile normalises Type I and Type II probes separately. From these, M-values were calculated using the Beta2M() function in wateRmelon [18]. ShinyMethyl was used to exclude samples where predicted sex did not match recorded sex. Ten saliva–derived samples and three samples from individuals who had answered “yes” for all self–reported conditions were also excluded (e.g. stroke, Alzheimer's disease, depression. Further details on these conditions are available in the GS:SFHS questionnaire, accessible from the GS:SFHS website: http://www.generationscotland.co.uk). This left a sample of 5088 participants with blood–derived samples available for analysis, of whom 4905 had smoking data available.

Statistical analysis

All analyses were performed in R [16]. Data–driven cluster analysis was performed on the top 100 p–value–ranked methylation sites from a recent, large meta–analysis EWAS of current versus never smoking (Joehanes et al. Supplementary Table 1, Sheet 02) [5,19]. Ninety of the top 100 probes were present in the GS:SFHS DNA methylation dataset following quality control (Supplementary Table 3). Clusters were visualised by plotting the first two principal coordinates, identified via data reduction analysis (multi dimensional scaling), using the cmdscale() function in the stats package [16,19]. K–means clustering was performed to partition the data, using the kmeans() function in the stats package [16]. As the probe set under consideration was associated with current/never smoker status, two clusters were specified. Logistic regression was performed to assess the relationship between a genetic variant in the CHRNA5–A3–B4 gene cluster that is associated with heaviness of smoking (rs1051730) and cluster assignment in current smokers, adjusting for sex [20]. The relationships between cluster assignment and batch, sex, and passive smoking were assessed using Chi–Squared Tests. The relationship between cluster assignment and alcohol consumption (current, former, and never drinker) was assessed using a Fisher's Exact Test. The relationship between cluster assignment and time since cessation (former smokers) and duration of exposure (current smokers) was assessed using logistic regression, adjusting for sex, age and dose. Data were visualised using “broken stick” regression lines using the default parameters for the segmented() function in the segmented package in R [21]. Comprehensive smoking index (CSI) values were calculated for former smokers using the method described by Dietrich and Hoffman, using a half-life estimate of 1.5 [22].

Results

Descriptive data for the 917 current–, 1466 ex–, and 2522 never–smokers are summarised in Table 1. On average, current smokers had a greater duration of exposure compared to former smokers (28·7 years vs 27·0 years), and a greater cumulative dose (23·3 pack years vs 10·6 pack years).

Clustering of current smokers depends on dose and duration smoked

Data reduction was performed on DNA methylation data for 90 smoking-associated sites (multidimensional scaling; Fig. 1). Of the 2522 never smokers, 2459 (97·5%) were assigned to a never smoker–enriched cluster whereas, of the 917 current smokers, 744 (81·1%) were assigned to a smoker–enriched cluster (K-means clustering with two clusters specified). There was no association between misclassification of current smokers to the never smoker–enriched cluster and sex, alcohol consumption, batch, or genotype at the well–established nicotine addiction genetic variant rs1051730, (P ≥ 0·103; Supplementary Table 4) [20]. Similarly, there was no association between misclassification of never smokers (N = 63) to the smoker–enriched cluster and exposure to second–hand smoke, sex, alcohol consumption, or plate processing batch (P ≥ 0·179; Supplementary Table 4). The proportion of current smokers assigned to the smoker–enriched cluster increased with years as a smoker (Fig. 2 and Supplementary Table 1; ORsmoker-enriched cluster = 1.07 per year of smoking; 95% CI = 1.03–1.12; P = 2.4 × 10−4). A significant association was also present between cluster assignment and cigarettes per day (ORsmoker-enriched cluster = 1.12 per cigarette smoked per day; 95% CI = 1.09–1.15; P = 3.9 × 10−14). Of the 32 individuals who reported smoking for 0–4 years prior to DNA methylation sampling, seven (21·9%) were assigned to the smoker–enriched cluster; for the 76 individuals who reported smoking for 5–9 years prior to sampling, 34 (44·7%) were assigned to the smoker–enriched cluster. The proportion of assignments to the smoker–enriched cluster increased to 87·3% for current smokers at 20–24 years of exposure, remaining stable thereafter. Of the 670 current smokers reporting at least 20 years of exposure, 605 (90·3%) were assigned to the smoker–enriched cluster.
Fig. 1

Principal coordinate vectors 1 and 2 from a multidimensional scaling analysis of 90 smoking-associated probes. Points and ellipses are coloured by smoking status (blue circles = current smokers, orange triangles = former smokers, purple crosses = never smokers). Ellipses represent normal confidence ellipses.

Fig. 2

Proportion of current smokers assigned to Cluster 2 (smoker-enriched cluster) by duration of exposure. “Broken stick” regression lines are presented for all current smokers (red solid line, square points), high-dose current smokers (orange dashed line, circular points) and low-dose current smokers (purple dotted line, diamond points).

Principal coordinate vectors 1 and 2 from a multidimensional scaling analysis of 90 smoking-associated probes. Points and ellipses are coloured by smoking status (blue circles = current smokers, orange triangles = former smokers, purple crosses = never smokers). Ellipses represent normal confidence ellipses. Proportion of current smokers assigned to Cluster 2 (smoker-enriched cluster) by duration of exposure. “Broken stick” regression lines are presented for all current smokers (red solid line, square points), high-dose current smokers (orange dashed line, circular points) and low-dose current smokers (purple dotted line, diamond points). There was a significant association between dose (cigarettes per day) and duration of exposure (years as a smoker) in current smokers. Individuals who had smoked for a longer duration were more likely to be heavier smokers (age– and sex–adjusted linear regression Beta = 0·38 cigarettes per day for each year as a smoker; P < 0.0001). To minimise confounding between dose and duration of exposure, data for current smokers were split based on the median dose to generate time point–specific subsets of heavy and light smokers. The proportion of smoker–enriched cluster assignments increased with duration of exposure in both dose groups, stabilising at 15–19 years of exposure in heavy smokers, and 25–29 years in lighter smokers (Fig. 2 and Supplementary Table 1). The proportion of individuals assigned to the smoker–enriched cluster over time in heavy smokers was significantly greater than that in light smokers (Wilcoxon signed rank test P = 0·002).

Clustering of former smokers depends on dose and time since cessation

Of the 1466 former smokers assessed, 359 (24·5%) were assigned to the smoker–enriched cluster. The proportion of smoker–enriched cluster assignments decreased as time since smoking cessation increased (Fig. 3 and Supplementary Table 2; ORsmoker-enriched cluster = 0.86 per year since cessation; 95% CI = 0.83–0.89; P < 2.0 × 10−16). A significant association was also present between cluster assignment and cigarettes per day in former smokers (ORsmoker-enriched cluster = 1.08 per cigarette smoked per day; 95% CI = 1.06–1.10; P = 1.67 × 10−14). The highest proportion of smoker–enriched cluster assignments (64·4%) was observed in individuals who had quit smoking within a year prior to sampling. The proportion of smoker–enriched cluster assignments fell below 50% by 1 year following cessation. Contrary to the findings in current smokers, there was a significant negative relationship between dose and duration of exposure in former smokers (age– and sex–adjusted linear regression Beta = −0·18 cigarettes smoked per day for each year as a smoker P < 0.0001). Samples were next split on the median dose at each cessation time point to obtain a high–dose and low–dose group. The proportion of smoker–enriched cluster assignments was significantly lower in the low–dose group relative to the high–dose group (Wilcoxon signed rank test P = 1·5 × 10−4). The proportion of smoker–enriched cluster assignments in the low–dose group was consistently below 50% (Fig. 3 and Supplementary Table 2). The proportion of smoker–enriched cluster assignments for former smokers exposed to a high dose fell below 50% two years following cessation. From five years following smoking cessation, the proportion of smoker–enriched cluster assignments stabilised in high– and low–dose groups. Of the 760 individuals who had quit at least 5 years prior to sampling, 84 (11·1%) were assigned to the smoker–enriched cluster. As duration of exposure was not considered here, the analysis was repeated substituting years since cessation with pack years (years as a smoker  ×  packs smoked per day), revealing a similar trend (Supplementary Table 5, Supplementary Fig. 1). Using pack years as a metric, the proportion of smoker–enriched cluster assignments in the low–dose group stabilised from two years following smoking cessation, compared to five years following cessation in the high–dose group.
Fig. 3

Proportion of former smokers assigned to Cluster 2 (smoker-enriched cluster) by years since smoking cessation. “Broken stick” regression lines are presented for all former smokers (red solid line, square points), high-dose former smokers (orange dashed line, circular points) and low-dose former smokers (purple dotted line, diamond points).

Proportion of former smokers assigned to Cluster 2 (smoker-enriched cluster) by years since smoking cessation. “Broken stick” regression lines are presented for all former smokers (red solid line, square points), high-dose former smokers (orange dashed line, circular points) and low-dose former smokers (purple dotted line, diamond points). The comprehensive smoking index (CSI) was calculated as an additional metric to incorporate duration, intensity and recency of exposure in former smokers, and its relationship with cluster assignment was assessed [22]. In a five-year period from cessation, individuals with lower CSI scores were less likely to be assigned to the smoker-enriched cluster relative to those with a higher CSI score. Cluster assignments between high- and low-CSI individuals stabilised beyond 5 years following cessation (Supplementary Fig. 2). Finally, we investigated the trajectories of DNA methylation at probes where smoking-associated modifications were reported to persist up to 30 years following cessation [5]. Of 36 probes reported by Joehanes et al., 30 were present in the GS:SFHS DNA methylation data [5]. Absolute t-statistics for DNA methylation in former smokers versus never smokers decreased with increasing years since cessation (Supplementary Fig. 3; Supplementary Tables 6–7). Four probes (cg05575921, cg21566642, cg01940273 and cg00706683) remained significantly differentially methylated in former smokers relative to never smokers up to 30 years following cessation.

Sensitivity analysis

To check the robustness of the predictions, three sensitivity analyses were considered. In the first analysis, a parsimonious predictor was developed by selecting CpG sites that discriminated smokers from non–smokers with an AUC > 0.9 (five out of the 18,760 genome–wide EWAS sites identified by Joehanes et al. – cg05575921, cg21566642, cg01940273, cg03636183 and cg21161138) [5]. There was a slight improvement in the prediction of current versus never smokers using this score (Supplementary Table 8; Supplementary Fig. 4). However, the proportion of current smoker assignments in high–dose former smokers was consistently higher in the five–CpG predictor compared with the cluster–based predictor. Moreover, low–dose former smokers displayed a consistent proportion of current smoker assignments over time in comparison to the cluster–based assignments (Supplementary Fig. 5 comparison with Fig. 3). In the second analysis, two predictors were developed based on polygenic scores for a subset of the most significant smoking–associated CpG sites (N = 90), and all smoking–associated sites (N = 17,529) [5]. The 90–probe polygenic predictor yielded similar results to the cluster– and AUC–based predictors (Supplementary Table 9, Supplementary Figs. 6–7). In contrast, the polygenic score derived from the larger probe set displayed poorer predictions (Supplementary Table 10, Supplementary Figs. 8–9). In the final analysis, DNA methylation-based smoking scores were generated for former smokers, based on a signature developed from current and never smokers in the GS:SFHS dataset [23]. Average DNA methylation scores in former smokers decreased within the first 2–3 years of quitting, remaining stable thereafter (Supplementary Fig. 10).

Discussion

In this study, we showed that smoking–based DNA methylation patterns are time– and dose–dependent. We identified two clusters from DNA methylation data in over 4900 individuals – one enriched for current smokers and another enriched for never–smokers. It took 15–19 years for the majority of low–dose smokers to display a methylation profile that assigned them to the smoker–enriched cluster. It took <1 year for the majority of low–dose ex–smokers to be assigned to the never smoker–enriched cluster. By contrast, it took 5–9 years for the majority of heavy–dose smokers to display DNA methylation profiles corresponding to the smoker–enriched cluster, and up to 2 years since quitting before the majority of heavy–dose ex–smokers had methylation patterns that more strongly resembled those of never smokers. Furthermore, there is little impact of smoking dose on methylation–based clustering of smoking for those who had smoked for >25 years or for those who had stopped smoking for at least 6 years. These findings suggest that a prolonged period of exposure to cigarette smoke is required before a smoking–related signature can be reliably identified using DNA methylation data. This is supported by evidence from multiple studies, which have reported an association between duration of exposure to cigarette smoke and an increased risk of oesophageal, lung, and bladder cancers [[24], [25], [26]]. Moreover, a longer duration of exposure has been linked to an increased risk of chronic obstructive pulmonary disorder (COPD) and respiratory symptoms [27]. It is therefore worth considering our findings in the context of molecular pathological epidemiology (MPE), an approach that implicates exogenous factors such as lifestyle and the environment on both disease pathogenesis and omics measures such as DNA methylation and gene expression [28,29]. In the current study, we examined DNA methylation from blood and not from tumour or more likely disease-targeted tissues such as lung. Nonetheless, there may still be precision medicine applications of blood-based DNA methylation smoking signatures. Should the DNA methylation profile of smokers be associated with an increased risk of smoking–related pathologies, the current findings suggest there is a dose–dependent period of exposure within which this risk is comparable to that of never smokers. Others have reported reversion of smoking–associated DNA methylation changes in former smokers persisting beyond 30 years from cessation, with the most rapid reversion rates occurring in the within the first 14 years [30]. Moreover, increased methylation levels at AHRR has been reported in smokers undergoing cessation therapy [31]. Examination of cluster assignments in former smokers revealed to some degree the reversible nature of smoking–associated DNA methylation changes. Former light smokers were more likely to be assigned to the never smoker–enriched cluster, regardless of time since cessation. In contrast, a period of two years was required before the rate of never–smoker cluster assignments for former heavy smokers reached >50%. A small proportion of never smokers were assigned to the smoker–enriched cluster. Such misclassifications may be a result of passive smoking, or other lifestyle–related correlates of smoking status. Although we did not observe an association between alcohol consumption and assignment of never smokers to the smoker–enriched cluster, it is possible that additional smoking–associated factors contribute to their misclassification. The effects of passive (i.e. second–hand) smoking on DNA methylation have been well established, with differential DNA methylation reported to persist up to decades following exposure to cigarette smoke in–utero [[11], [12], [13]]. It was not possible to determine whether the individuals profiled in the current study were exposed to cigarette smoke in utero as maternal smoking data were unavailable. Second–hand smoke exposure has also been linked to differential DNA methylation in adults. Similar DNA methylation patterns have been observed in lung tumours of smokers and second–hand smokers [32]. Hypomethylation of AHRR at cg05575921 has been linked to recent exposure to second–hand smoke [33], while others have reported significant associations between second–hand smoke exposure and differential DNA methylation in bladder cancer [34]. There was no association between misclassification of never smokers and co–habitation with, or other exposure to, smokers. However, information on the duration of co–habitation with smokers was not available, and there was no information regarding co–habitants and exposures prior to sampling. We showed the use of AUC–based prediction of current/never smoking status using five probes is more accurate than the cluster–based prediction. However, smoking–associated DNA methylation changes at four of these five probes have been reported to persist decades following cessation [5]. Moreover, as the prediction thresholds for the five CpGs were selected to discriminate smoking status in the current sample, this generates a biased predictor when applied to the same data. While the predictive performance of the cluster–based predictions is less accurate for current/never smokers, its application to former smokers may be more suitable due to the inclusion of sites with reversible smoking–associated DNA methylation changes. This is reflected by the consistently higher proportion of current smoker assignments over time from the AUC–based predictor relative to the cluster–based predictor. In a further sensitivity analysis, Z–score based polygenic methylation scores were built from all 18,760 genome–wide significant CpGs (N = 17,529 present in the GS:SFHS dataset) and also from the top 100 CpGs (N = 90 present in the GS:SFHS dataset). In the primary analysis, clusters were defined in relation to the methylation values in the Generation Scotland cohort, which may have introduced ascertainment bias. The polygenic analysis for the 90 CpGs yielded very similar results to the primary models. In contrast, the polygenic analysis derived from 17,529 probes did not perform as well. This was possibly due to the introduction of noise from many features of small effects. Conversely, predictive performance was improved by the inclusion of fewer features of larger effects. In addition to the lack of information regarding maternal and past exposures to second–hand smoke, a further limitation to the current analysis is the presence of confounding between cigarette dose and duration of exposure to cigarette smoke. In order to minimise this association and to focus primarily on duration of exposure, the sample was stratified on the median dose at each time point assessed. A strength of this study is the use of a large and homogeneous analysis cohort. The Generation Scotland cohort comprises participants across a broad age range (18–99 years) which has permitted the analysis of smoking exposure in a large number of both recently–started and long–term smokers, as well as recently–quit and long–term former–smokers. Moreover, future analysis of smoking phenotypes and related health outcomes are possible, as a result of data linkage capabilities and sample collection for longitudinal DNA methylation profiling. In conclusion, our findings suggest there is a dose–dependent interval within which smoking–associated DNA methylation are established. Furthermore, we have demonstrated a degree of reversibility of these changes in former smokers, whereby the interval of reversion is dependent on dose prior to smoking cessation. Consideration of duration of exposure in current smokers, and years since cessation in former smokers, coupled with dose, all measured via DNA methylation patterns, may assist in determining and stratifying risk of smoking–associated morbidities. Highlighting the establishment of smoking-associated DNA alterations provides an important public health message as a deterrent from smoking initiation. Furthermore, our reports on the dose-dependency and reversibility of these changes may encourage a reduction in the cigarette intake of current smokers (if not cessation), and an incentive against relapse in former smokers.

Author contributions

Conception and design: DLM, REM, IJD, DJP, PMV. Data analysis: DLM. Drafting the article: DLM, REM. Data preparation: DLM, REM, RMW, MLB, SWM, AC. Data collection: DJP, AMM, KLE, ADM. Revision of the article: all authors.

Declaration of interests

Dr. McIntosh reports grants from The Sackler Trust, grants from Eli Lilly, and grants from Janssen outside the submitted work. The remaining authors have nothing to disclose.
  30 in total

1.  Cohort Profile: Generation Scotland: Scottish Family Health Study (GS:SFHS). The study, its participants and their potential for genetic research on health and illness.

Authors:  Blair H Smith; Archie Campbell; Pamela Linksted; Bridie Fitzpatrick; Cathy Jackson; Shona M Kerr; Ian J Deary; Donald J Macintyre; Harry Campbell; Mark McGilchrist; Lynne J Hocking; Lucy Wisely; Ian Ford; Robert S Lindsay; Robin Morton; Colin N A Palmer; Anna F Dominiczak; David J Porteous; Andrew D Morris
Journal:  Int J Epidemiol       Date:  2012-07-10       Impact factor: 7.196

Review 2.  SmokeHaz: Systematic Reviews and Meta-analyses of the Effects of Smoking on Respiratory Health.

Authors:  Leah Jayes; Patricia L Haslam; Christina G Gratziou; Pippa Powell; John Britton; Constantine Vardavas; Carlos Jimenez-Ruiz; Jo Leonardi-Bee
Journal:  Chest       Date:  2016-04-19       Impact factor: 9.410

3.  A variant associated with nicotine dependence, lung cancer and peripheral arterial disease.

Authors:  Thorgeir E Thorgeirsson; Frank Geller; Patrick Sulem; Thorunn Rafnar; Anna Wiste; Kristinn P Magnusson; Andrei Manolescu; Gudmar Thorleifsson; Hreinn Stefansson; Andres Ingason; Simon N Stacey; Jon T Bergthorsson; Steinunn Thorlacius; Julius Gudmundsson; Thorlakur Jonsson; Margret Jakobsdottir; Jona Saemundsdottir; Olof Olafsdottir; Larus J Gudmundsson; Gyda Bjornsdottir; Kristleifur Kristjansson; Halla Skuladottir; Helgi J Isaksson; Tomas Gudbjartsson; Gregory T Jones; Thomas Mueller; Anders Gottsäter; Andrea Flex; Katja K H Aben; Femmie de Vegt; Peter F A Mulders; Dolores Isla; Maria J Vidal; Laura Asin; Berta Saez; Laura Murillo; Thorsteinn Blondal; Halldor Kolbeinsson; Jon G Stefansson; Ingunn Hansdottir; Valgerdur Runarsdottir; Roberto Pola; Bengt Lindblad; Andre M van Rij; Benjamin Dieplinger; Meinhard Haltmayer; Jose I Mayordomo; Lambertus A Kiemeney; Stefan E Matthiasson; Hogni Oskarsson; Thorarinn Tyrfingsson; Daniel F Gudbjartsson; Jeffrey R Gulcher; Steinn Jonsson; Unnur Thorsteinsdottir; Augustine Kong; Kari Stefansson
Journal:  Nature       Date:  2008-04-03       Impact factor: 49.962

4.  Lung cancer mortality in relation to age, duration of smoking, and daily cigarette consumption: results from Cancer Prevention Study II.

Authors:  W Dana Flanders; Cathy A Lally; Bao-Ping Zhu; S Jane Henley; Michael J Thun
Journal:  Cancer Res       Date:  2003-10-01       Impact factor: 12.701

5.  DNA methylation as a long-term biomarker of exposure to tobacco smoke.

Authors:  Natalie S Shenker; Per Magne Ueland; Silvia Polidoro; Karin van Veldhoven; Fulvio Ricceri; Robert Brown; James M Flanagan; Paolo Vineis
Journal:  Epidemiology       Date:  2013-09       Impact factor: 4.822

6.  Epigenetic Signatures of Cigarette Smoking.

Authors:  Roby Joehanes; Allan C Just; Riccardo E Marioni; Luke C Pilling; Lindsay M Reynolds; Pooja R Mandaviya; Weihua Guan; Tao Xu; Cathy E Elks; Stella Aslibekyan; Hortensia Moreno-Macias; Jennifer A Smith; Jennifer A Brody; Radhika Dhingra; Paul Yousefi; James S Pankow; Sonja Kunze; Sonia H Shah; Allan F McRae; Kurt Lohman; Jin Sha; Devin M Absher; Luigi Ferrucci; Wei Zhao; Ellen W Demerath; Jan Bressler; Megan L Grove; Tianxiao Huan; Chunyu Liu; Michael M Mendelson; Chen Yao; Douglas P Kiel; Annette Peters; Rui Wang-Sattler; Peter M Visscher; Naomi R Wray; John M Starr; Jingzhong Ding; Carlos J Rodriguez; Nicholas J Wareham; Marguerite R Irvin; Degui Zhi; Myrto Barrdahl; Paolo Vineis; Srikant Ambatipudi; André G Uitterlinden; Albert Hofman; Joel Schwartz; Elena Colicino; Lifang Hou; Pantel S Vokonas; Dena G Hernandez; Andrew B Singleton; Stefania Bandinelli; Stephen T Turner; Erin B Ware; Alicia K Smith; Torsten Klengel; Elisabeth B Binder; Bruce M Psaty; Kent D Taylor; Sina A Gharib; Brenton R Swenson; Liming Liang; Dawn L DeMeo; George T O'Connor; Zdenko Herceg; Kerry J Ressler; Karen N Conneely; Nona Sotoodehnia; Sharon L R Kardia; David Melzer; Andrea A Baccarelli; Joyce B J van Meurs; Isabelle Romieu; Donna K Arnett; Ken K Ong; Yongmei Liu; Melanie Waldenberger; Ian J Deary; Myriam Fornage; Daniel Levy; Stephanie J London
Journal:  Circ Cardiovasc Genet       Date:  2016-09-20

7.  The dynamics of smoking-related disturbed methylation: a two time-point study of methylation change in smokers, non-smokers and former smokers.

Authors:  Rory Wilson; Simone Wahl; Liliane Pfeiffer; Cavin K Ward-Caviness; Sonja Kunze; Anja Kretschmer; Eva Reischl; Annette Peters; Christian Gieger; Melanie Waldenberger
Journal:  BMC Genomics       Date:  2017-10-18       Impact factor: 3.969

8.  Secondhand Tobacco Smoke Exposure Associations With DNA Methylation of the Aryl Hydrocarbon Receptor Repressor.

Authors:  Lindsay M Reynolds; Hoda S Magid; Gloria C Chi; Kurt Lohman; R Graham Barr; Joel D Kaufman; Ina Hoeschele; Michael J Blaha; Ana Navas-Acien; Yongmei Liu
Journal:  Nicotine Tob Res       Date:  2017-04-01       Impact factor: 4.244

9.  Reversion of AHRR Demethylation Is a Quantitative Biomarker of Smoking Cessation.

Authors:  Robert Philibert; Nancy Hollenbeck; Eleanor Andersen; Shyheme McElroy; Scott Wilson; Kyra Vercande; Steven R H Beach; Terry Osborn; Meg Gerrard; Frederick X Gibbons; Kai Wang
Journal:  Front Psychiatry       Date:  2016-04-06       Impact factor: 4.157

10.  DNA methylation as a marker for prenatal smoke exposure in adults.

Authors:  Rebecca C Richmond; Matthew Suderman; Ryan Langdon; Caroline L Relton; George Davey Smith
Journal:  Int J Epidemiol       Date:  2018-08-01       Impact factor: 7.196

View more
  26 in total

Review 1.  DNA methylation-based predictors of health: applications and statistical considerations.

Authors:  Paul D Yousefi; Matthew Suderman; Ryan Langdon; Oliver Whitehurst; George Davey Smith; Caroline L Relton
Journal:  Nat Rev Genet       Date:  2022-03-18       Impact factor: 53.242

2.  Integrated methylome and phenome study of the circulating proteome reveals markers pertinent to brain health.

Authors:  Danni A Gadd; Robert F Hillary; Daniel L McCartney; Liu Shi; Aleks Stolicyn; Neil A Robertson; Rosie M Walker; Robert I McGeachan; Archie Campbell; Shen Xueyi; Miruna C Barbu; Claire Green; Stewart W Morris; Mathew A Harris; Ellen V Backhouse; Joanna M Wardlaw; J Douglas Steele; Diego A Oyarzún; Graciela Muniz-Terrera; Craig Ritchie; Alejo Nevado-Holgado; Tamir Chandra; Caroline Hayward; Kathryn L Evans; David J Porteous; Simon R Cox; Heather C Whalley; Andrew M McIntosh; Riccardo E Marioni
Journal:  Nat Commun       Date:  2022-08-09       Impact factor: 17.694

3.  Longitudinal change in blood DNA epigenetic signature after smoking cessation.

Authors:  Amena Keshawarz; Roby Joehanes; Weihua Guan; Tianxiao Huan; Dawn L DeMeo; Megan L Grove; Myriam Fornage; Daniel Levy; George O'Connor
Journal:  Epigenetics       Date:  2021-10-06       Impact factor: 4.861

4.  Nicotine dose-dependent epigenomic-wide DNA methylation changes in the mice with long-term electronic cigarette exposure.

Authors:  Gang Peng; Yibo Xi; Chiara Bellini; Kien Pham; Zhen W Zhuang; Qin Yan; Man Jia; Guilin Wang; Lingeng Lu; Moon-Shong Tang; Hongyu Zhao; He Wang
Journal:  Am J Cancer Res       Date:  2022-08-15       Impact factor: 5.942

5.  Smoke-induced SAV1 Gene Promoter Hypermethylation Disrupts YAP Negative Feedback and Promotes Malignant Progression of Non-small Cell Lung Cancer.

Authors:  Ting Liu; Wei Guo; Kai Luo; Lei Li; Jing Dong; Meijun Liu; Xingyuan Shi; Zhiyuan Wang; Jianlei Zhang; Jiang Yin; Ni Qiu; Minying Lu; Danyang Chen; Xiaoting Jia; Hao Liu; Yixue Gu; Yan Xiong; Guopei Zheng; Gang Xu; Zhimin He; Zhijie Zhang
Journal:  Int J Biol Sci       Date:  2022-07-11       Impact factor: 10.750

6.  An epigenetic association analysis of childhood trauma in psychosis reveals possible overlap with methylation changes associated with PTSD.

Authors:  Solveig Løkhammer; Anne-Kristin Stavrum; Tatiana Polushina; Monica Aas; Akiah A Ottesen; Ole A Andreassen; Ingrid Melle; Stephanie Le Hellard
Journal:  Transl Psychiatry       Date:  2022-04-30       Impact factor: 7.989

7.  A comparison of blood and brain-derived ageing and inflammation-related DNA methylation signatures and their association with microglial burdens.

Authors:  Anna J Stevenson; Daniel L McCartney; Danni A Gadd; Gemma Shireby; Robert F Hillary; Declan King; Makis Tzioras; Nicola Wrobel; Sarah McCafferty; Lee Murphy; Barry W McColl; Paul Redmond; Adele M Taylor; Sarah E Harris; Tom C Russ; Andrew M McIntosh; Jonathan Mill; Colin Smith; Ian J Deary; Simon R Cox; Riccardo E Marioni; Tara L Spires-Jones
Journal:  Eur J Neurosci       Date:  2022-04-01       Impact factor: 3.698

Review 8.  Cigarette smoke-induced alterations in blood: A review of research on DNA methylation and gene expression.

Authors:  Constanza P Silva; Helen M Kamens
Journal:  Exp Clin Psychopharmacol       Date:  2020-07-13       Impact factor: 3.157

9.  Sex-specific associations with DNA methylation in lung tissue demonstrate smoking interactions.

Authors:  Hyeon-Kyoung Koo; Jarrett Morrow; Priyadarshini Kachroo; Kelan Tantisira; Scott T Weiss; Craig P Hersh; Edwin K Silverman; Dawn L DeMeo
Journal:  Epigenetics       Date:  2020-09-22       Impact factor: 4.528

10.  Smoking and Incidence of Colorectal Cancer Subclassified by Tumor-Associated Macrophage Infiltrates.

Authors:  Tomotaka Ugai; Juha P Väyrynen; Koichiro Haruki; Naohiko Akimoto; Mai Chan Lau; Rong Zhong; Junko Kishikawa; Sara A Väyrynen; Melissa Zhao; Kenji Fujiyoshi; Andressa Dias Costa; Jennifer Borowsky; Kota Arima; Jennifer L Guerriero; Charles S Fuchs; Xuehong Zhang; Mingyang Song; Molin Wang; Marios Giannakis; Jeffrey A Meyerhardt; Jonathan A Nowak; Shuji Ogino
Journal:  J Natl Cancer Inst       Date:  2022-01-11       Impact factor: 11.816

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.