| Literature DB >> 32064457 |
Tamar Paz-Elizur1, Yael Leitner-Dagan1, Kerstin B Meyer1, Barak Markus1, Federico M Giorgi1, Martin O'Reilly1, Hyunjin Kim1, Yentl Evgy1, Ronen Fluss1, Laurence S Freedman1, Robert C Rintoul1, Bruce Ponder1, Zvi Livneh1.
Abstract
BACKGROUND: Improving lung cancer risk assessment is required because current early-detection screening criteria miss most cases. We therefore examined the utility for lung cancer risk assessment of a DNA Repair score obtained from OGG1, MPG, and APE1 blood tests. In addition, we examined the relationship between the level of DNA repair and global gene expression.Entities:
Year: 2019 PMID: 32064457 PMCID: PMC7012022 DOI: 10.1093/jncics/pkz067
Source DB: PubMed Journal: JNCI Cancer Spectr ISSN: 2515-5091
Characteristics of case patients and control individuals
| Characteristics | Control individuals (N = 143) | Case patients (N = 150) |
|
|---|---|---|---|
| Sex, No. (%) | .291 | ||
| Male | 79 (55.2) | 93 (62.0) | |
| Female | 64 (44.8) | 57 (38.0) | |
| Age, mean (SD) | 59.7 (10.0) | 68.6±9.6 | <.001 |
| Smoking status, No. (%) | <.001 | ||
| Current | 48 (33.6) | 62 (41.3) | |
| Ex-smoker | 50 (35.0) | 87 (58.0) | |
| Never | 45 (31.5) | 1 (0.7) | |
| Pack-years, mean (SD) | 16.8±20.3 | 43.2±27.6 | <.001 |
| Histology, No. (%) | |||
| Adenocarcinoma | — | 83 (55.3) | |
| Squamous | — | 56 (37.3) | |
| NSCLC-NOS | — | 11 (7.3) | |
| COPD, No. (%) | <.001 | ||
| None | 112 (83.6) | 56 (41.8) | |
| Mild | 9 (6.7) | 26 (19.4) | |
| Moderate | 10 (7.5) | 31 (23.1) | |
| Severe | 3 (2.2) | 16 (11.9) | |
| Severity unknown | 0 (0.0) | 5 (3.7) |
P value comparing the distribution of case patients with control individuals: For categorial values the chi-square test was used; for continuous variables the unpaired t test was used. COPD = chronic obstructive pulmonary disease; NSCLC-NOS = non–small cell lung cancer–not otherwise specified; — = not applicable; Missing values: pack-years: three control individuals; COPD: nine control individuals and 16 case patients.
Figure 1.Outline of the panel of DNA Repair assays used to determine the personal DNA Repair score. Each assay measures the activity of a specific DNA Repair enzyme, in a protein extract prepared from PBMC, to remove its target DNA damage from a synthetic short oligonucleotide, shown schematically in the upper panel. Each enzyme assay was run separately under optimized conditions. The target damaged base in each substrate oligonucleotide was 8-oxoguanine in the OGG1 assay (illustrated by a black circle), hypoxanthine in the MPG assay (orange square), and a furanyl abasic site in the APE1 assay (blue triangle). The asterisks represent a 3'-Yakima Yellow fluorescence tag. Assays were run on a robotic platform and analyzed in an AB3130XL genetic analyzer. The personal OGG1, MPG, and APE1 enzyme activities were then used to calculate the personal DNA Repair score for each participant, as described in Methods. Further description of the DNA Repair assays is provided in the Supplementary Materials (available online). APE1 = apurinic/apyrimidinic endonuclease 1; MPG = methylpurine DNA glycosylase; OGG1 = 8-oxoguanine DNA glycosylase; OMA = OGG1, MPG, and APE1; PBMC = peripheral blood mononuclear cells.
Distribution of selected characteristics and DNA Repair score* between non–small cell lung cancer case patients and control individuals
| Control individuals (n = 140) | Case patients (n = 149) | ||||
|---|---|---|---|---|---|
| Characteristics | No. | DNA Repair score mean (95% CI) |
| No. | DNA Repair score mean (95% CI) |
| All† | 140 | 4.00 (3.84 to 4.16) | 149 | 2.67 (2.50 to 2.84) | |
|
| |||||
| Histology | |||||
| SQCC | – | – | 56 | 2.69 (2.45 to 2.93) | |
| Adenocarcinoma | – | – | 82 | 2.62 (2.38 to 2.86) | |
| Unknown | – | – | 11 | 2.92 (1.87 to 3.98) | |
|
| |||||
| Age, y | |||||
| ≤55 | 43 | 4.28 (3.96 to 4.59) | 16 | 2.36 (1.82 to 2.89) | |
| 56–69 | 75 | 4.00 (3.81 to 4.20) | 64 | 2.86 (2.55 to 3.16) | |
| ≥70 | 22 | 3.45 (2.95 to 3.94) | 69 | 2.56 (2.35 to 2.78) | |
|
| |||||
| Sex | |||||
| Male | 77 | 3.82 (3.59 to 4.05) | 93 | 2.71 (2.49 to 2.93) | |
| Female | 63 | 4.21 (3.99 to 4.44) | 56 | 2.60 (2.32 to 2.88) | |
|
| |||||
| Smoking status | |||||
| Never smoked | 45 | 4.07 (3.79 to 4.36) | 1 | 4.85 | |
| Past smoker | 49 | 3.98 (3.72 to 4.24) | 87 | 2.60 (2.40 to 2.81) | |
| Current smoker | 46 | 3.95 (3.62 to 4.27) | 61 | 2.72 (2.43 to 3.02) | |
|
| |||||
|
| |||||
| COPD | |||||
| No | 109 | 4.02 (3.85 to 4.19) | 56 | 2.86 (2.55 to 3.18) | |
| Yes | 22 | 3.82 (3.30 to 4.34) | 77 | 2.57 (2.35 to 2.79) | |
|
| |||||
The DNA Repair score was defined as: 0.00621 × APE1 − 0.047 × OGG1 − 0.0223 × MPG, where APE1, OGG1, and MPG each represent the measured enzyme activity of an individual. The weights for each component used in the DNA Repair score were calculated from the logistic regression and chosen to optimize strength of association of the score with lung cancer for the observed data set. APE1 = apurinic/apyrimidinic endonuclease 1; CI = confidence interval; COPD = chronic obstructive pulmonary disease; MPG = methylpurine DNA glycosylase; OGG1 = 8-oxoguanine DNA glycosylase; SQCC = squamous cell carcinoma.
Values were missing for three control individuals (two current and one past smoker) and one case patient (current smoker).
Two-sided P value comparing the distribution of case patients with control individuals: analysis of covariance was used adjusting the comparison for continuous age, sex, and smoking status.
Two-sided P value for comparing SQCC with adenocarcinoma: analysis of covariance was used adjusting for case-control-status, continuous age, sex, and smoking status.
Two-sided P values obtained from analysis of covariance for factor of interest, adjusted for case-control status and other factors from among age, sex, and smoking status.
Two-sided P value comparing COPD vs no COPD; using analysis of covariance adjusting for continuous age and sex.
Figure 2.DNA Repair score distribution among lung cancer case patients and control individuals. The frequency distribution of DNA Repair scores is shown for 149 lung cancer patients (red curve) and 140 control individuals (blue curve). The graphs were plotted using GraphPad Prism version 5.00 (GraphPad Software, San Diego, CA), with bin width automatically chosen by the software, with second- order smoothing, with two neighbors on each size. Bin width was 0.5 units. APE1 = apurinic/apyrimidinic endonuclease 1; MPG = methylpurine DNA glycosylase; OGG1 = 8-oxoguanine DNA glycosylase; OMA = OGG1, MPG, and APE1.
Unconditional logistic regression analysis of the DNA Repair score*
| Variable | Score value | No. of control individuals (%) | No. of case patients (%) | Adjusted OR† (95% CI) |
| Cross-validation Adjusted OR‡ (95% CI) |
|
|---|---|---|---|---|---|---|---|
| Score per 1 SD decrease§ | 0.98 U | 140 (100.0) | 149 (100.0) | 2.7 (1.9 to 3.7) | <.001 | 2.5 (1.8 to 3.4) | <.001 |
| Score by tertiles‖ | |||||||
| Highest tertile | ≥4.52 | 47 (33.6) | 6 (4.0) | 1.0 (Referent) | — | 1.0 (Referent) | — |
| Middle tertile | 3.68–4.51 | 46 (32.9) | 22 (14.8) | 3.0 (1.0 to 8.7) | .048 | 1.6 (0.6 to 4.2) | — |
| Lowest tertile | ≤3.68 | 47 (33.6) | 121 (81.2) | 11.5 (4.3 to 31.2) | <.001 | 7.2 (3.0 to 17.5) | — |
DNA Repair score definition: 0.00621 × APE1 – 0.047 × OGG1 – 0.0223 × MPG. APE1 = apurinic/apyrimidinic endonuclease 1; CI = confidence interval; MPG = methylpurine DNA glycosylase; OGG1 = 8-oxoguanine DNA glycosylase; OR = odds ratio.
Unconditional logistic regression adjusted for age, sex, and smoking status (smoker, past smoker, never smoker).
Cross-validated odds ratios calculated according to the SDs and tertiles of the cross-validated DNA Repair scores.
Score was fitted in the unconditional logistic regression model as a continuous variable. 0.98 U is the 1 SD in the control group for the DNA Repair scores. For each model, the estimated odds ratios for smoking were slightly different and the range was as follows: current vs never = 57.1–59.4; current v ex = 1.27–1.29.
Tertiles of the control individuals’ values. The upper tertile was chosen as the referent.
Figure 3.Analysis of the DNA Repair scores by disease staging. The distributions of the DNA Repair scores are presented for the control individuals (C blue) and for the lung cancer patients by stage status (in red) using boxplots (whiskers 10th–90th percentile) showing the values lower than the first 10th percentile and greater than the 90th percentile as dots, with the line in the middle plotted at the median. Distributions are presented by disease state (A), T staging (B), N staging (C), and M staging (D). The number of participants in each subgroup is indicated above the plots. Importantly, the differences between the control groups and each of the earliest stagings in each category is statistically significant. C vs disease stage IA, P = .0009; C vs T1A, P = .011; C vs N0, P < .0001; C vs M0, P < .0001. All P values were adjusted for sex, age, and smoking status. See related numerical values in Supplementary Table 1 (available online), where the means and SDs are presented. The graphs were plotted using GraphPad Prism version 5.00 (GraphPad Software, San Diego, CA).
Figure 4.Receiver operating characteristic (ROC) curve of the sensitivity and specificity of the DNA repair score in lung cancer risk and examples of its added value. A) ROC curve for the DNA repair scores obtained in the current study (black curve) and after cross-validation (red curve). B) ROC curve for the covariates age, sex, and smoking status (black curve) and after adding the DNA repair score to these basic covariates before (red curve) and after (green curve) cross-validation. C) Estimated added value of the DNA repair score to lung cancer risk estimates based on the Liverpool Lung Project (LLP) Risk Model. Estimates are presented for a man and a woman, age 65 years, who are either never smokers or smoked for 30 years. Gray columns represent the 5-year risk according to the Lung Project Risk model. The effect of having a low DNA repair score of 5th percentile or less (red columns), or a high DNA repair score of 75th percentile or greater (blue columns) are presented. Data were taken from Supplementary Table 4 (available online). APE1 = apurinic/apyrimidinic endonuclease 1; AUC = area under the curve; CV = cross-validated; F = female; M = male; MPG = methylpurine DNA glycosylase; OGG1 = 8-oxoguanine DNA glycosylase; OMA = OGG1, MPG, and APE1.
Figure 5.Enrichment of immune system related pathways with low DNA Repair score using gene set enrichment analysis (GSEA). Genes ranked by the RNAseq2 analysis according to their correlation to the DNA Repair score were analyzed by GSEA to identify pathways (using gene ontology terms) that statistically significantly correlate with the DNA Repair score. The analysis was performed separately for case patients and control individuals. This leads to a list of pathways ranked according to their normalized enrichment score, the top 30 of which are shown in Supplementary Table 2 (available online). The figure presents for each tissue and health state all of the 366 immune system–related pathways as found by the GSEA analysis (according to keywords presented in Supplementary Table 2, available online). Each dot represents a pathway, with its y-axis value showing the false discovery rate, -Log(Q-value), for the enrichment score. The pathways are colored according to their false discovery rate values: gray dots Q-Value > 0.001, red dots Q-Value ≤ 0.001. Negatively and positively enriched pathways are shown for nasal and bronchial tissues, in case patients and control individuals, as indicated.