| Literature DB >> 27924164 |
Yan Zhang1, Magdeldin Elgizouli2, Alexandra Nieters2, Hermann Brenner1,3,4, Ben Schöttker1, Bernd Holleczek5.
Abstract
BACKGROUND: Newly established blood DNA methylation markers that are strongly associated with smoking might open new avenues for lung cancer (LC) screening. We aimed to assess the performance of the top hits from previous epigenome-wide association studies in prediction of LC incidence. In a prospective nested case-control study, DNA methylation at AHRR (cg05575921), 6p21.33 (cg06126421), and F2RL3 (cg03636183) were measured by pyrosequencing in baseline whole blood samples of 143 incident LC cases identified during 11 years of follow-up and 457 age- and sex-matched controls without diagnosis of LC until the end of follow-up. The individual and joint associations of the 3 markers with LC risk were estimated by logistic regression, adjusted for potential confounders including smoking status and cigarette pack-years. The predictive performance was evaluated for both the individual markers and their combinations derived from multiple algorithms.Entities:
Keywords: AHRR; DNA methylation; F2RL3; Lung cancer; Risk prediction; Smoking
Mesh:
Substances:
Year: 2016 PMID: 27924164 PMCID: PMC5123284 DOI: 10.1186/s13148-016-0292-4
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Fig. 1Methylation distribution at baseline by smoking status and lung cancer status. a, b Present methylation levels of AHRR_cg05575921, 6p21.33_cg06126421, and F2RL3_cg03636183 among current, former, and never smokers at baseline, respectively, in the training and validation set. c, d Present methylation levels of AHRR_cg05575921, 6p21.33_cg06126421, and F2RL3_cg03636183 among lung cancer cases and controls, respectively, in the training set and validation set. e, f Illustrate distribution of lung cancer cases inside and outside the first quartile of methylation among controls at AHRR_cg05575921, 6p21.33_cg06126421, and F2RL3_cg03636183, respectively, in the training and validation set
Characteristics of the study population
| Characteristics | Training set | Validation set | ||||
|---|---|---|---|---|---|---|
| Cases ( | Controls ( |
| Cases ( | Controls ( |
| |
| No. (%)a | No. (%)a | No. (%)a | No. (%)a | |||
| Age (years) | 64 (5.7) | 64 (6.1) | 64 (5.9) | 64 (6.3) | ||
| Sex | ||||||
| Male | 58 (74.4) | 167 (75.2) | 48 (73.9) | 169 (71.9) | ||
| Female | 20 (25.6) | 55 (24.8) | 0.88 | 17 (26.1) | 66 (28.1) | 0.76 |
| Smoking statusc | ||||||
| Never smoker | 5 (6.5) | 86 (39.8) | 9 (13.9) | 100 (44.8) | ||
| Former smoker | 29 (37.7) | 90 (41.7) | 26 (40.0) | 88 (39.5) | ||
| Current smoker | 43 (55.8) | 40 (18.5) | <0.0001 | 30 (46.2) | 35 (15.7) | <0.0001 |
| Body mass index (kg/m2)d | ||||||
| Under weight (<18.5) | 1 (1.3) | 0 | 1 (1.6) | 1 (0.43) | ||
| Normal weight (18.5–<25.0) | 25 (32.5) | 55 (24.8) | 19 (29.2) | 62 (26.4) | ||
| Overweight (25.0–<30.0) | 29 (37.7) | 115 (51.8) | 32 (49.2) | 119 (50.6) | ||
| Obesity (≥30.0) | 22 (28.5) | 52 (23.4) | 0.07 | 13 (20.0) | 53 (22.6) | 0.74 |
| Educational levele | ||||||
| Low | 59 (78.7) | 143 (65.3) | 57 (87.7) | 164 (71.6) | ||
| Intermediate | 11 (14.7) | 41 (18.7) | 3 (4.6) | 35 (15.3) | ||
| High | 5 (6.6) | 35 (16.0) | 0.06 | 5 (7.7) | 30 (13.1) | 0.02 |
| Physical activityf | ||||||
| Inactive | 18 (23.1) | 40 (18.0) | 25 (38.5) | 48 (20.6) | ||
| Insufficient | 43 (55.1) | 95 (42.8) | 23 (35.4) | 115 (49.4) | ||
| Sufficient | 17 (21.8) | 87 (39.2) | 0.02 | 17 (26.1) | 70 (30.0) | 0.01 |
| Family history of cancerg | ||||||
| No | 39 (52.0) | 132 (60.0) | 30 (47.6) | 132 (56.4) | ||
| Yes | 36 (48.0) | 88 (40.0) | 0.23 | 33 (52.4) | 102 (43.6) | 0.21 |
| Diabetesh | ||||||
| Not prevalent | 64 (82.0) | 188 (85.1) | 50 (76.9) | 198 (84.3) | ||
| Prevalent | 14 (18.0) | 33 (14.9) | 0.53 | 15 (23.1) | 37 (15.7) | 0.17 |
| Cardiovascular disease | ||||||
| Not prevalent | 60 (76.9) | 177 (79.7) | 44 (67.7) | 180 (76.6) | ||
| Prevalent | 18 (23.1) | 45 (20.3) | 0.60 | 21 (32.3) | 55 (23.4) | 0.14 |
| Systolic blood pressure (mmHg)i | 140 (18) | 140 (19) | 0.12 | 141 (17) | 141 (19) | 0.77 |
| Total cholesterol (mg/dL)j | 205.6 (54.4) | 200.5 (58.7) | 0.48 | 236.1 (38.4) | 224.8 (43.6) | 0.03 |
| Pack-yearsk | 39.2 (25.4) | 16.2 (20.2) | <0.0001 | 34.3 (22.6) | 13.4 (18.4) | <0.0001 |
aTable shows numbers (proportions) for categorical variables and means (standard deviation) for continuous variables
bChi-square test for categorical variable and Wilcoxon test for continuous variables
cData missing for 1 case and 6 controls in the training set and 12 controls in the validation set
dData missing for 1 case in the training set
eData missing for 3 cases and 3 controls in the training set and 6 controls in the validation set
fData missing for 2 controls in the training set
gData missing for 2 cases and 3 controls in the training set and 2 cases and 1 control in the validation set
hData missing for 1 control in the training set
iData missing for 4 cases and 5 controls in the training set and 2 cases and 4 controls in the validation set
jData missing for 1 controls in the training set and 2 controls in the validation set
kData missing for 2 cases and 27 controls in the training set and 3 cases and 27 controls in the validation set
Associations of methylation at AHRR, 6p21.33, and F2RL3 with lung cancer incidence in the validation set
| CpG site | Methylation levela | Controls | Cases | OR (95% CI) | ||
|---|---|---|---|---|---|---|
| Model 1b | Model 2c | Model 3d | ||||
|
| ≥85 (quartile 4) | 59 | 6 | Ref. | Ref. | Ref. |
| <85 (quartile 3) | 73 | 1 | ||||
| <80 (quartile 2) | 58 | 11 | 4.13 (1.48–11.52) | 3.70 (1.12–12.22) | 4.63 (1.27–16.80) | |
| <68 (quartile 1) | 45 | 47 | 23.93 (9.61–59.57) | 17.17 (4.91–60.03) | 15.86 (4.18–60.17) | |
| Per SD less methylation | – | 2.61 (2.02–3.37) | 2.58 (1.69–3.94) | 2.37 (1.46–3.85) | ||
|
| ≥73 (quartile 4) | 63 | 4 | Ref. | Ref. | Ref. |
| <73 (quartile 3) | 76 | 6 | ||||
| <66 (quartile 2) | 50 | 12 | 3.90 (1.52–9.98) | 3.00 (1.06–8.48) | 4.08 (1.27–13.07) | |
| <57 (quartile 1) | 46 | 43 | 15.55 (6.89–35.10) | 6.92 (2.63–18.18) | 8.12 (2.69–24.48) | |
| Per SD less methylation | – | 2.92 (2.15–3.98) | 2.11 (1.45–3.05) | 2.11 (1.39–3.19) | ||
|
| ≥81 (quartile 4) | 113 | 5 | Ref. | Ref. | Ref. |
| <81 (quartile 3) | 39 | 5 | ||||
| <78 (quartile 2) | 40 | 9 | 3.91 (1.45–10.55) | 2.75 (0.91–8.37) | 2.45 (0.72–8.31) | |
| <73 (quartile 1) | 43 | 46 | 19.25 (8.59–43.15) | 10.84 (4.03–29.19) | 10.55 (3.44–32.31) | |
| Per SD less methylation | – | 2.46 (1.90–3.19) | 1.86 (1.33–2.60) | 1.72 (1.17–2.51) | ||
Abbreviations: OR odds ratio, CI confidence interval, Ref. reference category, SD standard deviation
aQuartiles of each site among controls in the training set
bModel 1: adjusted for age and sex
cModel 2: like model 1, additionally adjusted for smoking status and pack-years
dModel 3: like model 2, additionally adjusted for educational level, BMI, physical activity, systolic blood pressure, total cholesterol, family history of cancer, prevalence of hypertension, cardiovascular disease, and diabetes
Fig. 2Dose-response curves of methylation at AHRR, 6p21.33, and F2RL3 with lung cancer incidence. a.b. present the dose-response curves for AHRR_cg05575921, respectively, in training and validation set. c.d. present the dose-response curves for 6p21.33_cg06126421, respectively, in training and validation set. e.f. present the dose-response curves for F2RL3_cg03636183, respectively, in training and validation set
Associations of smoking with lung cancer incidence in the validation set
| Smoking exposure | Controls | Cases | OR (95% CI) | |||
|---|---|---|---|---|---|---|
| Model 1a | Model 2b | Model 3c | Model 4d | |||
| Never smoker | 100 | 9 | Ref. | Ref. | Ref. | Ref. |
| Former smoker | 88 | 26 | 1.58 (0.54–4.60) | 0.94 (0.27–3.21) | 1.05 (0.33–3.30) | 1.08 (0.33–3.51) |
| Current smoker | 35 | 30 | 3.07 (0.93–10.15) | 0.81 (0.21–3.15) | 1.35 (0.36–5.06) | 1.07 (0.28–4.09) |
| Per 21 (=1SD) pack-years | – | 2.26 (1.46–3.51) | 1.55 (0.96–2.48) | 1.93 (1.21–3.07) | 1.72 (1.08–2.75) | |
Abbreviations: OR odds ratio, CI confidence interval, Ref. reference category, SD standard deviation
aModel 1: adjusted for age and sex
bModel 2: adjusted for age, sex, and methylation of AHRR_cg05575921
cModel 3: adjusted for age, sex, and methylation of 6p21.33_cg06126421
dModel 4: adjusted for age, sex, and methylation of F2RL3_cg03636183
Fig. 3Receiver operating characteristic (ROC) curves for methylation at AHRR, 6p21.33, and F2RL3 in discrimination of incident lung cancer in training set (panel a) and in validation set (panel b). ROC curves for self-reported smoking status and pack-years are shown for comparison
Fig. 4Receiver operating characteristic (ROC) curves for methylation at AHRR, 6p21.33, and F2RL3 and pack-years in discrimination of incident lung cancer among light smokers
Individual and joint discriminative performance of methylation at AHRR, 6p21.33, and F2RL3
| Group | AUC (95% CI) | |||
|---|---|---|---|---|
|
|
|
| Combinationa | |
| Overall | ||||
| Training set ( | 0.792 (0.736–0.848) | 0.662 (0.597–0.726) | 0.791 (0.735–0.846) | 0.829 (0.778–0.881) |
| Validation set ( | 0.799 (0.733–0.866) | 0.789 (0.725–0.853) | 0.812 (0.725–0.871) | 0.800 (0.737–0.861) |
| Age specific prediction | ||||
| <65 years ( | 0.789 (0.728–0.850) | 0.745 (0.687–0.803) | 0.792 (0.735–0.849) | 0.800 (0.745–0.856) |
| ≥65 years ( | 0.790 (0.726–0.856) | 0.677 (0.604–0.751) | 0.793 (0.732–0.854) | 0.817 (0.760–0.875) |
| Follow-up time-specific prediction | ||||
| Initial 5 years ( | 0.791 (0.733–0.849) | 0.696 (0.631–0.761) | 0.808 (0.758–0.857) | 0.812 (0.759–0.865) |
| Later years ( | 0.791 (0.734–0.849) | 0.730 (0.673–0.786) | 0.779 (0.722–0.837) | 0.807 (0.755–0.859) |
| Histological subtype prediction | ||||
| SCLC ( | 0.744 (0.630–0.858) | 0.651 (0.535–0.767) | 0.738 (0.632–0.843) | 0.739 (0.634–0.844) |
| NSCLC ( | 0.802 (0.758–0.847) | 0.721 (0.672–0.770) | 0.798 (0.754–0.843) | 0.823 (0.782–0.864) |
| Adenocarcinoma ( | 0.814 (0.751–0.877) | 0.730 (0.659–0.800) | 0.814 (0.751–0.876) | 0.830 (0.770–0.891) |
| Squamous cell carcinoma ( | 0.787 (0.709–0.864) | 0.731 (0.655–0.807) | 0.769 (0.699–0.839) | 0.786 (0.717–0.856) |
| Others ( | 0.775 (0.686–0.864) | 0.673 (0.576–0.770) | 0.800 (0.713–0.888) | 0.813 (0.729–0.896) |
Abbreviations: AUC areas under the curve, CI confidence interval, SCLC small cell lung cancer, NSCLC non-small cell lung cancer
aCombination formula: β1 × M + β2 × M + β3 × M + β4 × M × β3 × M = (−0.0685) × cg05575921 + 0.4673 × cg06126421 + 0.3173 × cg03636183 + (−0.00612) × cg06126421 × cg03636183, where underlined coefficients were derived from regression coefficients in training set