Literature DB >> 17967182

Identification of a panel of sensitive and specific DNA methylation markers for lung adenocarcinoma.

Jeffrey A Tsou1, Janice S Galler, Kimberly D Siegmund, Peter W Laird, Sally Turla, Wendy Cozen, Jeffrey A Hagen, Michael N Koss, Ite A Laird-Offringa.   

Abstract

BACKGROUND: Lung cancer is the number one cancer killer of both men and women in the United States. Three quarters of lung cancer patients are diagnosed with regionally or distantly disseminated disease; their 5-year survival is only 15%. DNA hypermethylation at promoter CpG islands shows great promise as a cancer-specific marker that would complement visual lung cancer screening tools such as spiral CT, improving early detection. In lung cancer patients, such hypermethylation is detectable in a variety of samples ranging from tumor material to blood and sputum. To date the penetrance of DNA methylation at any single locus has been too low to provide great clinical sensitivity. We used the real-time PCR-based method MethyLight to examine DNA methylation quantitatively at twenty-eight loci in 51 primary human lung adenocarcinomas, 38 adjacent non-tumor lung samples, and 11 lung samples from non-lung cancer patients.
RESULTS: We identified thirteen loci showing significant differential DNA methylation levels between tumor and non-tumor lung; eight of these show highly significant hypermethylation in adenocarcinoma: CDH13, CDKN2A EX2, CDX2, HOXA1, OPCML, RASSF1, SFPR1, and TWIST1 (p-value << 0.0001). Using the current tissue collection and 5-fold cross validation, the four most significant loci (CDKN2A EX2, CDX2, HOXA1 and OPCML) individually distinguish lung adenocarcinoma from non-cancer lung with a sensitivity of 67-86% and specificity of 74-82%. DNA methylation of these loci did not differ significantly based on gender, race, age or tumor stage, indicating their wide applicability as potential lung adenocarcinoma markers. We applied random forests to determine a good classifier based on a subset of our loci and determined that combined use of the same four top markers allows identification of lung cancer tissue from non-lung cancer tissue with 94% sensitivity and 90% specificity.
CONCLUSION: The identification of eight CpG island loci showing highly significant hypermethylation in lung adenocarcinoma provides strong candidates for evaluation in patient remote media such as plasma and sputum. The four most highly ranked loci, CDKN2A EX2, CDX2, HOXA1 and OPCML, which show significant DNA methylation even in stage IA tumor samples, merit further investigation as some of the most promising lung adenocarcinoma markers identified to date.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17967182      PMCID: PMC2206053          DOI: 10.1186/1476-4598-6-70

Source DB:  PubMed          Journal:  Mol Cancer        ISSN: 1476-4598            Impact factor:   27.401


Background

Lung cancer is expected to cause over 160,000 deaths in 2007 -killing more Americans than cancer of the prostate, breast, colon, rectum and pancreas combined [1]. Lung cancer is clinically classified into two classes: the aggressive subtype small cell lung cancer (SCLC, ~13% of cases) and non-small cell lung cancer (NSCLC, the remaining ~87%) [1]. NSCLC is histologically subdivided into four major subtypes with distinct pathological and molecular characteristics: adenocarcinoma, squamous cell lung cancer, large cell lung cancer and "other" (comprising neuroendocrine cancers, carcinoids etc.) [2]. Of these, adenocarcinoma has recently surpassed squamous cell lung cancer as the most common subtype in the United States, accounting for approximately 40% of NSCLC [3]. The incidence of lung adenocarcinoma is on the rise in many countries, in particular in women [4,5]. Adenocarcinoma is also the most common lung cancer subtype in non- and previous smokers [6]. The 5-year survival of lung cancer patients is only 15%, largely due to the fact that three quarters of lung cancer patients are diagnosed when their disease has spread regionally or distantly [7]. To make an impact on long term survival, better strategies are needed for early detection. Prior experience with chest X-ray, sputum cytology, and fiberoptic examination have failed to decrease lung cancer patient mortality, although several recent strategies show promise. Spiral computed tomography (spiral CT) is one such approach. It allows detailed imaging of the lung, and can detect very small lesions. Recent results from the Early Lung Cancer Action Project (ELCAP) indicate that this approach allows detection of early stage lung cancer [8], but in this and other studies, non-cancerous lesions far outnumber malignancies (less than 10% of lesions are cancer). In addition, it is unclear whether the early stage lung cancers identified by spiral CT represent cancers that would ultimately progress and lead to death. A recent analysis suggests spiral CT screening may not reduce lung cancer mortality [9]. Molecular analyses of plasma, sputum, and bronchial lavage fluids have also shown promise as strategies for early detection, but these methods still lack sensitivity [10]. If molecular markers with high sensitivity and specificity for cancers that will progress can be identified, such markers could be combined with spiral CT to screen high-risk individuals, allowing molecular detection and visualization of clinically relevant early lesions. This would greatly increase the chances of curative resection of lung cancer, while minimizing unnecessary and potentially life-threatening procedures in patients with benign lesions. Of the many potential molecular markers, DNA hypermethylation – an epigenetic alteration – shows great promise. DNA hypermethylation occurs in all cancers, frequently leading to gene silencing through methylation of CpG-rich regions (CpG islands) near the transcriptional start sites of genes [11]. In lung cancer patients, such hypermethylation is quantitatively detectable in a variety of samples ranging from tumor material to blood and sputum [10]. However, to date the penetrance of DNA methylation at any single locus has not been high enough to provide great clinical sensitivity. Our focus is to increase the repertoire of sensitive DNA hypermethylation markers for lung cancer, and to compose a small panel of molecular markers that could be used to detect lung cancer with high sensitivity and specificity. Given the histopathologic, clinical and molecular differences between lung cancer subtypes, we believe that markers should be developed individually for the major histological subtypes. These markers can later be combined into a lung cancer hypermethylation panel that can be used for detection of all lung cancers. Because of its increasing frequency and its preponderance in non- and previous smokers, we focused first on lung adenocarcinoma (AD). Here we describe our evaluation of 28 potential DNA methylation markers using primary human lung adenocarcinoma samples. To ensure that these markers detect cancer-specific hypermethylation changes, associated with histologically visible lung cancer (allowing surgical resection), we compared the DNA methylation profiles of the tumors with histologically normal adjacent lung tissue (AdjNTL) from lung cancer patients. We also examined non-tumor lung from non-cancer patients (NTL).

Results

Ideal DNA hypermethylation markers for lung adenocarcinoma should show a high frequency of methylation in tumors as well as DNA methylation levels that are significantly elevated in tumor compared to non-tumor lung tissue. Environmental exposures, such as those arising from tobacco smoke, could lead to higher basal levels of methylation in non-tumor lung [12], which might affect the background signal when any resulting markers are applied to non-invasive molecular analyses of bodily fluids in the future. To ensure the identification of markers that are more highly methylated in adenocarcinoma even when compared to heavily exposed but histologically cancer-free lung, we used adjacent non-tumor lung (AdjNTL) from lung cancer cases as our cancer-free comparison. The AdjNTL sections were derived from separate, histologically verified cancer-free paraffin blocks. We also examined a number of non-tumor lung (NTL) samples from patients operated for non-cancer reasons (emphysema, lung collapse, etc.). Quantitative assessment of DNA methylation levels allows a more detailed evaluation of candidate DNA methylation markers, and of their suitability for correctly identifying a cancer vs. non-cancer sample. For this reason, we used the bisulfite conversion based real-time PCR technique, MethyLight, to measure DNA methylation in tumor and control tissues [13]. Twenty-eight loci were chosen for evaluation (Table 1). The choice of loci was based on a prescreening of 114 loci carried out on a collection of human lung cancer cell lines, including 11 adenocarcinoma cell lines (unpublished data). We also included many loci that appeared promising based on previous reports describing their DNA methylation in lung cancer or other cancers, so that all markers of interest could be compared on one set of tissues using a single technique and platform. Among others, the 28 loci included CpG islands in the promoters of tumor suppressor genes and genes with important roles in cell cycle regulation, DNA repair, and apoptosis (Table 1).
Table 1

Gene name and function of the 28 loci studied

HUGO acronymaGene NamebFunctionc
APCadenomatosis polyposis coliTumor suppressor.
ATMataxia telangiectasia mutatedTumor suppressor. DNA damage and cell cycle control.
CDH1cadherin 1, type 1, E-cadherin (epithelial)Involved in cell-cell adhesions, mobility and proliferation.
CDH13cadherin 13, H-cadherin (heart)Cell-cell adhesions.
CDKN2A EX2cyclin-dependent kinase inhibitor 2A (melanoma, p16, inhibits CDK4)Tumor suppressor. Cell cycle control. Involved in proliferation and apoptosis.
CDKN2Bcyclin-dependent kinase inhibitor 2B (p15, inhibits CDK4)Cell cycle control.
CDX2caudal type homeobox transcription factor 2Transciptional regulation. Involved in differentiation.
CHFRcheckpoint with forkhead and ring finger domainsCell cycle control. Involved in signaling.
CYP1B1cytochrome P450, family 1, subfamily B, polypeptide 1Electron transport pathway. Involved in development.
ESR1estrogen receptor 1Nuclear hormone receptor. Involved in the regulation of gene expression and affect proliferation and differentiation.
HMGA1high mobility group AT-hook 1Involved in the transcription regulation.
HOXA1homeobox A1Transcription factor. Involved in development.
LZTS1leucine zipper, putative tumor suppressor 1Involved in the regulation of cell growth. Cell cycle control and proliferation. May act as tumor suppressor.
MGMTO-6-methylguanine-DNA methyltransferaseDNA repair.
MT1A, MT2Ametallothionein 1A, 2ABind heavy metals.
OPCMLdopioid binding protein/cell adhesion molecule-likeInvolved in cell contact
PGRprogesterone receptorInvolved in the regulation of gene expression and cellular proliferation and differentiation.
PTENphosphatase and tensin homologTumor suppressor. Involved in cell cycle progression and cell survival. Involved in cell migration and cell spreading.
RASSF1Ras association (RalGDS/AF-6) domain family 1Potential tumor suppressor. Invovled in apoptosis, proliferation, cell cycle progression.
SFRP1, SFRP4, SFRP5secreted frizzled-related protein 1, 4, 5Role in regulating cell growth and differentiation and proliferation. Involved in development.
SLC6A20solute carrier family 6 (proline IMINO transporter), member 20Sodium- and chloride-dependent transporter.
SOCS4suppressor of cytokine signaling 4Involved in signal transduction.
SYKspleen tyrosine kinaseInvolved in B cell response.
TWIST1twist homolog 1 (acrocephalosyndactyly 3; Saethre-Chotzen syndrome) (Drosophila)Transcription factor. Involved in differentiation.
VHLvon Hippel-Lindau tumor suppressorInvolved in transcriptional repression.

aHuman Genome Organization nomenclature

bApproved gene name from Human Genome Organization website

cGene function from GeneCards website

dMethyLight amplicon also targets the HNT CpG island

Gene name and function of the 28 loci studied aHuman Genome Organization nomenclature bApproved gene name from Human Genome Organization website cGene function from GeneCards website dMethyLight amplicon also targets the HNT CpG island The results of the DNA methylation analyses for the 28 loci in 51 AD, 38 AdjNTL and 11 NTL samples are shown in Fig. 1. DNA methylation, expressed as the percentage methylated reference (PMR [14]) is visualized by color coding. Comparison of AD in Fig. 1 panel A with AdjNTL in panel B shows that a number of loci are more heavily methylated in AD. The effect appears to be even more pronounced when AD is compared to NTL from non-cancer patients. Although the exposure history of these NTL samples is unknown, their generally lower DNA methylation levels emphasize that these samples may not be the best controls when searching for loci that show cancer-specific hypermethylation. Interestingly, for one locus, LZTS1 (leftmost locus in Fig. 1), the DNA methylation pattern appeared to be reversed; NTL showed the highest level of DNA methylation, while AD samples were least methylated.
Figure 1

Graphic representation of PMR values obtained for 28 loci in AD (A), AdjNTL (B) and NTL (C). Samples are indicated at the left, loci at thetop. PMR values have been categorized as colored boxes denoting no detectable DNA methylation (blue), DNA methylation below the median of all positive samples of each locus (yellow), and DNA methylation equal to or above the median (red). The black bar at bottom indicates loci showing statistically significant differences in DNA methylation levels between tumor and non-tumor lung.

Graphic representation of PMR values obtained for 28 loci in AD (A), AdjNTL (B) and NTL (C). Samples are indicated at the left, loci at thetop. PMR values have been categorized as colored boxes denoting no detectable DNA methylation (blue), DNA methylation below the median of all positive samples of each locus (yellow), and DNA methylation equal to or above the median (red). The black bar at bottom indicates loci showing statistically significant differences in DNA methylation levels between tumor and non-tumor lung. We applied two-dimensional hierarchical clustering to examine the relationship between the loci and the tumor and non-tumor lung samples (Fig 2; VHL was omitted because it showed no DNA methylation in any samples). All but one of the tumor samples clustered together in a major branch of the dendrogram, while the majority of non-tumor lung samples grouped in a separate cluster. Nine loci, CDH13, SFRP1, OPCML, TWIST1, SFRP5, CDKN2A EX2, CDX2, HOXA1 and RASSF1, clustered together (bottom right), showing heavier DNA methylation in the tumor samples.
Figure 2

Two-dimensional hierarchical clustering of samples and loci based on DNA methylation data. In the center, DNA methylation levels are indicated by a color gradient, with the highest DNA methylation levels for each locus indicated in red and the lowest in deep blue. The Ward hierarchical clustering method was used to categorize between cancer and non-tumor samples. Sample IDs are indicated on the left, with AD samples in red, AdjNTL samples in black, and NTL samples in blue. The relationship of samples is indicated at right in the same color schematic as the labels. At bottom, the relationship of the loci is indicated. Note that all eight of the most significant loci cluster at bottom right.

Two-dimensional hierarchical clustering of samples and loci based on DNA methylation data. In the center, DNA methylation levels are indicated by a color gradient, with the highest DNA methylation levels for each locus indicated in red and the lowest in deep blue. The Ward hierarchical clustering method was used to categorize between cancer and non-tumor samples. Sample IDs are indicated on the left, with AD samples in red, AdjNTL samples in black, and NTL samples in blue. The relationship of samples is indicated at right in the same color schematic as the labels. At bottom, the relationship of the loci is indicated. Note that all eight of the most significant loci cluster at bottom right. We next analyzed the statistical significance of the differences in DNA methylation levels for individual markers and different combinations of tissue samples (Table 2): AD vs. all NTL samples, AD vs. AdjNTL, and AD vs. paired AdjNTL (32 of the 38 AdjNTL samples were derived from the AD patients in Fig. 1A). The paired AdjNTL form an exquisite control for the cancer-specific nature of the observed DNA methylation changes, as each of these samples conforms to its tumor sample in patient age, environmental exposure, and genetic background. To avoid assigning statistical significance to spurious associations, we incorporated a multiple comparisons threshold for those loci that at time of analysis lacked any prior data suggesting they might be hypermethylated in lung adenocarcinoma (Table 2, before-last column, [15] see Materials and Methods for details). Thirteen of the analyzed loci showed statistically significant differences in DNA methylation when AD samples were compared to all NTL samples: OPCML, CDX2, HOXA1, CDKN2A EX2, SFRP1, CDH13, TWIST1, LZTS1, RASSF1, SFRP4 and 5, ESR1, and CDH1. All of these except CDH1 remained significant when AD samples were compared to AdjNTL, while all except LZTS1 remained significant in the comparison of AD to paired AdjNTL. Because DNA methylation of LZTS1 is reduced in tumors it is not a candidate for a positive lung adenocarcinoma marker and it was not studied further at this time. APC methylation was found to be statistically significantly different only when paired tumor and non-tumor lung samples were compared. This suggests that basal DNA methylation is high but variable at this locus; elevated DNA methylation in tumors is likely masked by interpatient variability and only becomes visible when samples from the same patient are compared. Indeed, Waki and coworkers have observed frequent DNA methylation of APC in non-cancer lung and other organs [16].
Table 2

Frequency and median PMR values of AD, Adj NTL and NTL tissues for 28 loci

HUGOaFrequencybMedianfp-valueg
ADc n = 51Adj NTLd n = 38NTLe n = 11AD n = 51Adj NTLn = 38NTLn = 11AD vs All NTLAD vs AdjNTLAD vs matched NTLBH-MC ThresholdhImportance Measurei




OPCML987936107.158.9310.029E-158E-132E-100.00257.38
CDX2100j66943.973.400.124E-132E-108E-100.00503.49
HOXA1947136160.004.900.122E-122E-102E-10N/A8.52
CDKN2A EX210010082191.0746.1727.396E-124E-101E-10N/A5.80
SFRP194k8736132.9310.014.781E-102E-081E-090.00752.62
CDH137845039.654.9578.054E-089E-072E-08N/A2.14
TWIST182669392.166.878.031E-078E-061E-070.01002.77
LZTS1100100100107.75170.95210.955E-060.00030.02620.01251.52
RASSF169k58992.530.865.456E-050.00101E-07N/A1.83
SFRP467k4293.251.038.710.00050.00520.00860.01500.39
SFRP590k924514.385.784.590.00050.00950.00100.01750.72
ESR149k3295.721.290.630.00490.02830.0007N/A0.33
CDH194895517.6212.7410.220.00920.06860.0151N/A0.51
SLC6A20251197.810.38154.450.04260.05690.02440.02000.10
PGR145059.926.95N/A0.08500.17530.43750.02250.08
MT1A100100100104.45108.58112.440.22760.42830.74220.02500.84
MT2A88765512.3112.2215.480.26500.45680.46220.02750.37
ATM7568270.160.190.030.29540.94980.51310.03000.07
PTEN292601.210.85N/A0.33190.79760.80770.03250.07
SYK2924180.420.153.340.43140.50020.89040.03500.07
CDKN2B9897918.2910.058.480.44000.24220.53350.03750.44
CYP1B12926181.801.260.130.44810.65650.14540.04000.11
CHFR8500.491.65N/A0.45600.66771.00000.04250.04
APC80978215.104.6010.810.59990.59230.0394N/A1.89
SOCS412j13180.251.7085.450.60400.76370.31250.04500.13
MGMT16180224.539.22N/A0.71410.89880.2031N/A0.07
HMGA120j18270.190.020.120.96420.76450.62210.04750.00001
VHL000N/AN/AN/A1.00001.00001.00000.05000

a HUGO, Human Genome Organization nomenclature sorted by AD vs All NTL p-value with the most significant at the top.

bPercentage of samples with positive methylation value.

cAD, Adenocarcinoma

dAdj NTL, Adjacent non-tumor lung from adenocarcinoma patients

eNTL, Non-tumor lung from non-cancer patients

fMedian percent methylated reference calculated from positive methylation values

gStatistically significant numbers are highlighted in bold; AD vs. all NTL and AD vs. AdjNTL: Wilcoxon rank sum test; AD vs. matched NTL: Wilcoxon signed rank test

hBH-MC threshold, Benjamini- Hochberg multiple comparison threshold p-value

iImportance Measure based on random forest analysis

jn = 50

kn = 49

Frequency and median PMR values of AD, Adj NTL and NTL tissues for 28 loci a HUGO, Human Genome Organization nomenclature sorted by AD vs All NTL p-value with the most significant at the top. bPercentage of samples with positive methylation value. cAD, Adenocarcinoma dAdj NTL, Adjacent non-tumor lung from adenocarcinoma patients eNTL, Non-tumor lung from non-cancer patients fMedian percent methylated reference calculated from positive methylation values gStatistically significant numbers are highlighted in bold; AD vs. all NTL and AD vs. AdjNTL: Wilcoxon rank sum test; AD vs. matched NTL: Wilcoxon signed rank test hBH-MC threshold, Benjamini- Hochberg multiple comparison threshold p-value iImportance Measure based on random forest analysis jn = 50 kn = 49 Of the thirteen significant loci, OPCML, CDX2, HOXA1, CDKN2A EX2, SFRP1, CDH13, TWIST1 and RASSF1 show considerable promise as cancer-specific DNA methylation markers, exhibiting highly significant hypermethylation in tumors compared to paired non-tumor tissues (p ≤ 1 × 10-7, Table 2). All eight of these loci grouped together in the hierarchical clustering (Fig. 2). The ability of the top four candidates, CDKN2A EX2, CDX2, HOXA1 and OPCML (all p < 1 × 10-9), to individually identify lung cancer samples was next evaluated. Fig. 3 shows the distribution of PMR values in the examined sample collection. Note that for all four markers, the mean value in non-tumor lung from non-cancer patients is lower than that of adjacent non-tumor lung from lung cancer patients. This emphasizes the importance of using histologically normal tissue adjacent to lung cancer for comparison; such tissue may show higher basal DNA methylation levels while appearing histologically normal, and should be used for comparison with lung cancer tissue to ensure identification of cancer-specific markers. While all four markers show increased DNA methylation in adenocarcinoma compared to adjacent non-tumor tissue, the spread of DNA methylation levels differs, which would affect their sensitivity and specificity in future detection strategies. The marker potential of quantitative markers is frequently presented in the form of a receiver operating characteristic (ROC) curve, in which sensitivity vs. 1-specificity at all possible cut-off values is plotted. While these DNA methylation markers are ultimately intended for the non-invasive analysis of patient bodily fluids, a preliminary indication of their potential to sensitively and specifically detect cancer could be obtained by plotting ROC curves using the PMR values from the tumor vs. adjacent non-tumor samples. Fig. 4 shows that the area under the curve (AUC, and indicator of marker performance that would be 1 for a marker showing 100% specificity and sensitivty) is 0.87–0.95 for the four top loci.
Figure 3

The distribution of PMR values by group. Log-transformed PMR values for AD, AdjNTL and NTL are shown. The mean is shown by the wide horizontal line, and the top and bottom of the diamond indicate a 95% normal confidence interval for the sample mean.

Figure 4

Receiver operating characteristic curves for the four top markers. All AD and AdjNTL lung samples for which there was complete DNA methylation data were used for the analysis.

The distribution of PMR values by group. Log-transformed PMR values for AD, AdjNTL and NTL are shown. The mean is shown by the wide horizontal line, and the top and bottom of the diamond indicate a 95% normal confidence interval for the sample mean. Receiver operating characteristic curves for the four top markers. All AD and AdjNTL lung samples for which there was complete DNA methylation data were used for the analysis. Despite the promising AUC values, the sensitivity and specificity of these top four markers, used individually and determined using the current sample collection in a five-fold cross-validation, was limited: 67–86% and 74–82% respectively. This supports the notion that DNA hypermethylation markers are best used in the form of a panel. Because of the costs associated with quantitative molecular analyses, it would be important to limit the number of markers included in the panel. To determine which combinations of markers would be most effective to correctly identify tumor vs. non-tumor samples, we fit a random forest classifier to the data set, using 87 samples and 28 variables (2 AD samples with missing PMR data were omitted, resulting in 49 AD vs. 38 AdjNTL). Using bootstrap samples of the data, we grew a forest of 30,000 trees. Splits were determined using a random sample of five variables and trees were grown until there was only one observation in each leaf. Utilizing all 28 loci, we estimated a sensitivity of 92% and a specificity of 95%. Using the Gini index from the random forest classifier (last column, Table 2) to measure locus importance, we restricted our analysis to the most highly ranked variables. Reducing the locus number to 13 did not affect sensitivity and specificity, and limiting our markers to the top-ranked four (HOXA1, OPCML, CDKN2AEX2 and CDX2, which were also the most significant based on our statistical analysis) resulted in a sensitivity of 94% and a specificity of 90%. Thus, these four markers appear to be highly promising DNA hypermethylation markers for development into non-invasive molecular markers of lung adenocarcinoma, through examination of DNA shed into bodily fluids such as sputum, bronchioalveolar lavage, or blood. For candidate hypermethylation markers of lung adenocarcinoma, two important questions arise. First, are these markers hypermethylated in cancer samples irrespective of the subject's age, gender and racial/ethnic background? And secondly, are these markers hypermethylated even in the earliest stages of lung adenocarcinoma? While the population analyzed in the current study is small, we reasoned that an indication of the potential of our top four markers to broadly identify lung adenocarcinoma might be obtained. To address the first question, we assessed correlations to age and determined whether each of the four markers showed statistically significant hypermethylation in tumor vs. adjacent normal tissues in men, women, and all four racial/ethnic groups. We found no correlation of methylation of CDKN2A EX2, CDX2, HOXA1 and OPCML with the age. In addition, all four markers remained significantly hypermethylated in tumor vs. AdjNTL when subjects were stratified by gender or by ethnic group (p < 0.05) (Table 3). The only exception was CDKN2A EX2 methylation in Asian subjects (p = 0.11), which may be related to the small sample size but will need to be further explored.
Table 3

Performance of top four markers in samples based on gender, race/ethnicity and stage

Median PMR, Tumor TumorMedian PMR, AdjNTLp-valuea
GENDER
Malen = 28n = 19
CDKN2A EX2199.4451.885.0E-06
CDX228.404.782.0E-05
HOXA1139.515.064.6E-05
OPCML76.6211.271.6E-06
Femalen = 14n = 10
CDKN2A EX2189.1638.641.3E-04
CDX2114.713.154.0E-06
HOXA1188.131.343.1E-06
OPCML154.325.363.1E-06
RACE
White Hispanicn = 14n = 10
CDKN2A EX2165.4737.526.0E-04
CDX239.491.982.0E-04
HOXA152.981.757.7E-03
OPCML142.568.176.0E-04
White Non-Hispanicn = 14n = 10
CDKN2A EX2314.3538.005.0E-04
CDX2110.734.780.001
HOXA1231.664.904.6E-05
OPCML179.796.544.7E-05
Blackn = 11n = 7
CDKN2A EX2194.9151.880.015
CDX2129.969.110.024
HOXA1128.526.160.011
OPCML91.1514.290.005
Asiann = 6n = 4
CDKN2A EX2145.5639.810.11
CDX224.281.780.023
HOXA1117.490.990.014
OPCML88.814.670.014
STAGEb
Stage IAn = 12n = 12
CDKN2A EX2182.5747.284.9E-04
CDX299.713.154.9E-04
HOXA1143.063.250.001
OPCML189.437.654.9E-04
Stage IBn = 6n = 6
CDKN2A EX2190.6761.000.031
CDX220.6213.360.31
HOXA178.982.150.063
OPCML76.6215.160.063
Stage IIA/IIB/IIIAcn = 10n = 10
CDKN2A EX2208.0539.810.01
CDX283.221.780.002
HOXA1180.186.160.002
OPCML178.548.250.002

ap-value calculated by Mann-Whitney for gender and race and Wilcoxon signed rank test for stage, in which paired adjacent samples were used. Italics: p > 0.05; b Paired adjacent samples; c Only one paired sample was available for stages IIA and IIIA and none for IIIB, hence IIA/IIB/IIIA were pooled.

Performance of top four markers in samples based on gender, race/ethnicity and stage ap-value calculated by Mann-Whitney for gender and race and Wilcoxon signed rank test for stage, in which paired adjacent samples were used. Italics: p > 0.05; b Paired adjacent samples; c Only one paired sample was available for stages IIA and IIIA and none for IIIB, hence IIA/IIB/IIIA were pooled. To address the second question, we determined whether each of the four markers was significantly hypermethylated in early and later stage tumors, using paired samples (Table 3). We examined stages IA and IB individually, but grouped stages IIA, IIB, and IIIA (only one paired sample was available for IIA and IIIA, and none for stage IIIB). Importantly, all four markers were significantly hypermethylated in stage IA cancers. Only CDKN2A EX2 hypermethylation was significant in stage IB tumors, but this could be due to the small number of paired samples (n = 6). All markers were also significantly hypermethylated in later stage lung adenocarcinoma (Stages IIA-IIIA). These analyses indicate that the top four markers show high potential for identification of lung adenocarcinoma, even in its earliest stages, an important characteristic if these markers are to be used for early detection. To determine whether any of the four top markers might have prognostic implications, we determined whether there was any relationship between their DNA methylation level and survival. We found no significant association between DNA methylation and survival for the four loci, or any of the other 24 loci studied (data not shown).

Discussion

Based on the results of our analyses, four loci that are very strong candidates for a DNA methylation panel aimed at early lung adenocarcinoma detection have been identified: CDKN2A EX2, CDX2, HOXA1 and OPCML. CDNK2A, also referred to as p16, encodes an important cell cycle regulator that is frequently inactivated in cancer. CDKN2A is one of the first tumor suppressor genes found to be methylated in a variety of cancers, including lung cancer [17]. It is one of the most widely studied hypermethylated loci, and methylation of its promoter CpG island appears to be a very early event in the development of non-small cell lung cancer (recently reviewed in [10]). In fact, methylation of the CDKN2A promoter CpG island has been observed in the sputum of subjects at risk for lung cancer 3 years prior to diagnosis [18] and in the sputum of asymptomatic heavy smokers [19]. A recent analysis of prospectively collected sputum showed CDKN2A methylation in 39% of cases and 25% of controls; methylation of this gene was associated with an elevated risk of lung cancer [20]. It is thought that DNA methylation observed in the sputum is indicative of field cancerization of the airways and not necessarily a symptom of a present cancer [20]. Our goal was to identify cancer-specific markers, not risk markers. We had evaluated DNA methylation of the CDKN2A promoter CpG island as a cancer indicator, but found substantial DNA methylation in AdjNTL, and no significant difference between AdjNTL and cancer (data not shown). Based on the cancer-specific hypermethylation of the CDKN2A exon 2 CpG island observed in colorectal and bladder cancers [21,22], we tested this downstream island instead. We established that its level of DNA methylation is a strong indicator of lung adenocarcinoma. While substantial methylation at the exon 2 CpG island is detected in histologically normal AdjNTL, by comparison, DNA methylation in adenocarcinoma is highly significantly elevated (p ≤ 1 × 10E-10). The detection of CDKN2A methylation in a high fraction of lung cancer patient plasma samples bodes well for its application to non-invasive detection [23]. Two groups reported an association of DNA methylation of CDKN2A with poor survival in adenocarcinoma/NSCLC patients [24,25], while Divine et al (2005), like us, reported no such association [12]. The differences between the obtained results might be due to the examination of a different CpG island or a different population. DNA methylation of HOX genes, encoding homeobox transcription factors involved in embryogenesis and differentiation, had recently been observed in lung adenocarcinoma and squamous cell lung cancer. In an analysis of eight adenocarcinomas and matching adjacent lung, substantial DNA methylation of the HOXA and D clusters was observed [26]. Five cancer samples showed DNA methylation of HOXA1, while only one AdjNTL sample was methylated at this locus. In a different study, analysis of a stage I adenocarcinoma and squamous cell lung carcinoma showed DNA methylation of the HOX clusters, and examination of the HOXA and D clusters in more detail in squamous cell cancers and control tissue indicated a DNA methylation frequency of 45–80% for HOXA7-9, but methylation of HOXA1 was limited [27]. Neither of these studies examined a large number of adenocarcinomas, nor were quantitative techniques used. Here we demonstrate that HOXA1 is a very promising DNA methylation marker for lung adenocarcinoma. We have also observed DNA methylation of additional HOX genes (unpublished studies), but HOXA1 appears to be particularly informative. OPCML, encoding an opioid-binding cell adhesion molecule, has been shown to be frequently methylated in ovarian cancer [28]. Given that opioids have demonstrated growth inhibitory and pro-apoptotic effects in lung cancer cells [29-31], it is perhaps not surprising that the OPCML promoter CpG island might be a target for DNA methylation in lung cancer. Very recently, high throughput DNA methylation profiling of 11 lung adenocarcinomas and control lung identified a number of CpG dinucleotides methylated in the cancer samples [32]. One probe identified DNA methylation in the area covered by the OPCML probe used here. Although the OPCML locus was not studied in detail in the Bibikova study, the observed methylation supports the idea that OPCML is a strong candidate marker in lung adenocarcinoma. CDX2, another homeobox transcription factor, had been described to be methylated in squamous esophageal cancer [33] and colorectal carcinoma [34], but to our knowledge, its DNA methylation in lung cancer has never been examined. We find it to be methylated in 100% of lung adenocarcinomas, showing a 10-fold higher median methylation than AdjNTL tissue (Table 2). Because our primary goal is marker development, here we focused only on whether loci showed consistent hypermethylation. Whether or not this hypermethylation results in gene inactivation is not relevant for the use of these loci as DNA methylation markers, and was not determined. However, the biological consequences of the observed hypermethylation would also be worth investigating. While each of the four top-ranked loci is of interest as a DNA methylation marker, it is as a panel that they promise to be most powerful. To our knowledge, we are the first to examine CDKN2A EX2, CDX2, HOXA1 and OPCML in combination. The fact that this marker set allows identification of cancer specimens in the current tissue collection with a substantially higher sensitivity and specificity than any previously identified single markers underlines the importance of developing suitable marker panels.

Conclusion

From a starting panel of 28 DNA methylation loci, we have identified 13 that show statistically significant methylation differences between lung adenocarcinoma and non-cancer lung tissue. Of these, 8 show highly significant differences. The four most significant markers also ranked as the top four to be used in a marker panel, as determined by a random forest approach. Thus, we suggest that CDKN2A EX2, CDX2, HOXA1 and OPCML are the top candidates from the 28 tested, and should be validated as DNA methylation markers for lung adenocarcinoma. These validations should consist of examining a sufficiently large number of new subjects representing both genders and all four major ethnic/racial subgroups in the United States (Whites of non-Hispanic and Hispanic descent, Blacks, and Asians), as well as early and late stage cancer. Such studies are currently ongoing. Our analyses of the present sample collection, which contains modest numbers of representatives from all these groups, is very encouraging as they suggest that the markers function independently of subject age, gender or ethnic subgroup, and are positive in early stage cancer. The next step would entail the exploration of different methods to measure these markers non-invasively in early stage lung cancer patients. Potential "remote" media to be considered are sputum, bronchioalveolar lavage, and blood plasma, all of which we are in the process of collecting for examination. The ability of our four-marker panel to clinically detect lung cancer with high sensitivity and specificity will depend on many factors. A loss of sensitivity might be foreseen due to the small amounts of DNA shed into the blood of each patient, but at the same time, an increase in specificity might be expected if tumor DNA is shed more readily into the bloodstream than DNA from adjacent histologically normal tissue. To our knowledge, CDKN2A EX2, CDX2, HOXA1 and OPCML constitute the strongest lung adenocarcinoma DNA methylation markers identified to date, and we are working on further evaluations of their potential with great anticipation.

Methods

Study subjects

Lung adenocarcinoma and when available adjacent non-tumor lung was obtained from archival paraffin blocks from 51 subjects who had been treated at three Los Angeles hospitals: the Los Angeles County Hospital, the USC University Hospital and the Norris Comprehensive Cancer Center. Clinical information was missing for 5 patients. Of the rest, 28 were male and 18 were female, 14 were White of non-Hispanic descent, 14 White of Hispanic descent, 11 were Black, and 7 were of various Asian origins. Ages ranged from 37–82 years old at time of surgery (median: 58 years old). For 32 of these cases, a separate paraffin block containing histologically verified cancer-free lung was available. These adjacent non-tumor lung samples were supplemented with 6 additional cancer-free archival samples from lung cancer patients for which the tumor block was unavailable, and 11 archival non-tumor lung samples from patients operated for non-cancer reasons, such as pneumothorax or emphysema. All studies were institutionally approved by the University of Southern California Institutional Review Board (IRB# HS-016041, HS-06-00447), and the identities of patients were not made available to laboratory investigators.

Tissue samples and DNA extraction

Hematoxylin and eosin-stained slides were reviewed by an experienced lung pathologist (MNK) to support the original classification of the tumor and to select optimal tumor and non-tumor areas of the specimens. DNA was extracted from microdissected tumor and non-tumor lung samples via proteinase K digestion [35]. Briefly, cells were lysed in a solution containing 100 mmol/L Tris-HCl (pH 8.0), 10 mmol/L EDTA (pH 8.0), 1 mg/mL proteinase K, and 0.05 mg/mL tRNA and incubated at 50°C overnight. The DNA was bisulfite converted as previously described [13].

DNA methylation analysis

DNA methylation analysis was done by MethyLight as previously described [14]. Primer and probe sequences were as described [14,36,37]. In addition to primers and probe sets designed specifically for the gene of interest, an internal reference primer and probe set designed to analyze Alu repeats (Alu) was included in the analysis to normalize for input DNA [38]. The percentage methylated reference (PMR) was calculated as the GENE:reference ratio of a sample divided by the GENE:reference ratio of in vitro methylated (SssI-treated) human white blood cell DNA and multiplying by 100 [14]. Occasionally, PMR values over 100 were observed. This can happen when genes are very heavily methylated in the cancer sample, while the SssI-treated sample (in spite of extensive in vitro DNA methylation) is not fully methylated at that locus. This does not affect the significance of the loci identified in this study, as the same batch of SssI-treated DNA was used throughout the study.

Statistical analyses

PMR values of AD were compared to AdjNTL and NTL lung as continuous variables by means of the Wilcoxon rank sum test. For the comparison of paired AD and AdjNTL samples from the same patients, the Wilcoxon signed rank test was used. To control the false discovery rate at 5%, a multiple comparisons threshold was set. It was only applied to those 20 loci for which no previous information supporting a hypothesis of DNA methylation in lung AD was available [15]. Receiver operating characteristic (ROC) curves were plotted using the AD vs. all AdjNTL lung PMR values and JMP 6.0 software (SAS Institute, Cary, NC). The distribution of PMR values by group (AD, AdjNTL and NTL) were shown using log-transformed data in JMP 6.0. The two-dimensional hierarchical clustering was carried out using JMP 6.0 and log-transformed PMR values. VHL, which was negative in all specimens, was omitted from the clustering analysis. Associations between age, gender and race of AD cases were tested by dichotomizing the subjects either by the presence/absence of DNA methylation, or, if the samples were frequently methylated, by the median of all positive PMR values. All statistical tests were two-sided. To determine which combinations of markers would be most effective to correctly identify tumor vs. non-tumor samples, we fit a random forest classifier to the data set, using the R programming language (v 2.5 [39]) and 87 samples and 28 variables (2 AD samples with missing PMR data were omitted, resulting in 49 AD/38 AdjNTL). Using bootstrap samples of the data, we grew a forest of 30,000 trees. Splits were determined using a random sample of five variables and trees were grown until there was only one observation in each leaf. We determined error rates using the observations that were not used to generate the trees. For each observation, its outcome was predicted by having the majority vote from the trees that were generated without the original data point in their bootstrap sample. These predicted values were compared against the true tissue type to estimate prediction error.

Competing interests

IALO and PWL are shareholders of Epigenomics AG, which has a commercial interest in the development of DNA markers for disease detection and diagnosis. None of the work performed in the laboratories of the authors is or has been supported or directed by Epigenomics.

Authors' contributions

JAT was involved in marker design, experimental execution and initial analysis. JSG was involved in experimental execution and extensive data analysis, drafting the manuscript, and generation of figures. KDS oversaw statistical analysis and drafted statistical sections of the manuscript. PWL provided experimental advice and mentoring for JAT. ST helped locate and section most of the tissues and provided the linked and de-identified clinicopathological information. WC provided additional samples through the Norris Cancer Center's Los Angeles area tissue discard repository. JAH provided non-cancer lung samples and statistical discussions. MNK reviewed all slides prior to microdissection. IALO designed the study, oversaw all aspects of the project, mentored JAT and JSG, and revised manuscript drafts. All authors reviewed and commented on the manuscript during its drafting and approved the final version.
  34 in total

1.  A comparison of trends in the incidence rate of lung cancer by histological type in the Osaka Cancer Registry, Japan and in the Surveillance, Epidemiology and End Results Program, USA.

Authors:  Itsuro Yoshimi; Akira Ohshima; Wakiko Ajiki; Hideaki Tsukuma; Tomotaka Sobue
Journal:  Jpn J Clin Oncol       Date:  2003-02       Impact factor: 3.019

Review 2.  The power and the promise of DNA methylation markers.

Authors:  Peter W Laird
Journal:  Nat Rev Cancer       Date:  2003-04       Impact factor: 60.716

3.  MethyLight: a high-throughput assay to measure DNA methylation.

Authors:  C A Eads; K D Danenberg; K Kawakami; L B Saltz; C Blake; D Shibata; P V Danenberg; P W Laird
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

4.  HOX gene clusters are hotspots of de novo methylation in CpG islands of human lung adenocarcinomas.

Authors:  Masahiko Shiraishi; Azumi Sekiguchi; Adam J Oates; Michael J Terry; Yuji Miyamoto
Journal:  Oncogene       Date:  2002-05-16       Impact factor: 9.867

5.  Progressive increases in de novo methylation of CpG islands in bladder cancer.

Authors:  C Salem; G Liang; Y C Tsai; J Coulter; M A Knowles; A C Feng; S Groshen; P W Nichols; P A Jones
Journal:  Cancer Res       Date:  2000-05-01       Impact factor: 12.701

6.  Predicting lung cancer by detecting aberrant promoter methylation in sputum.

Authors:  W A Palmisano; K K Divine; G Saccomanno; F D Gilliland; S B Baylin; J G Herman; S A Belinsky
Journal:  Cancer Res       Date:  2000-11-01       Impact factor: 12.701

7.  Susceptibility of nonpromoter CpG islands to de novo methylation in normal and neoplastic cells.

Authors:  C Nguyen; G Liang; T T Nguyen; D Tsao-Wei; S Groshen; M Lübbert; J H Zhou; W F Benedict; P A Jones
Journal:  J Natl Cancer Inst       Date:  2001-10-03       Impact factor: 13.506

8.  Adenocarcinoma of the lung in young patients: the M. D. Anderson experience.

Authors:  N S Liu; M R Spitz; B L Kemp; C Cooksley; F V Fossella; J S Lee; W K Hong; F R Khuri
Journal:  Cancer       Date:  2000-04-15       Impact factor: 6.860

9.  Hierarchical clustering of lung cancer cell lines using DNA methylation markers.

Authors:  Arvind K Virmani; Jeffrey A Tsou; Kimberly D Siegmund; Linda Y C Shen; Tiffany I Long; Peter W Laird; Adi F Gazdar; Ite A Laird-Offringa
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2002-03       Impact factor: 4.254

10.  Age-related methylation of tumor suppressor and tumor-related genes: an analysis of autopsy samples.

Authors:  Takayoshi Waki; Gen Tamura; Makoto Sato; Teiichi Motoyama
Journal:  Oncogene       Date:  2003-06-26       Impact factor: 9.867

View more
  46 in total

Review 1.  Epigenetics of lung cancer.

Authors:  Scott M Langevin; Robert A Kratzke; Karl T Kelsey
Journal:  Transl Res       Date:  2014-03-12       Impact factor: 7.012

2.  Promoter methylation of TCF21 may repress autophagy in the progression of lung cancer.

Authors:  Baokun Chen; Chao Zeng; Yiwang Ye; Da Wu; Zhimin Mu; Jixian Liu; Yuancai Xie; Hao Wu
Journal:  J Cell Commun Signal       Date:  2017-10-30       Impact factor: 5.782

3.  Study on the Interaction of the CpG Alternating DNA with CdTe Quantum Dots.

Authors:  Morteza Hosseini; Freshteh Khaki; Ehsan Shokri; Hossein Khabbaz; Mehdi Dadmehr; Mohammad Reza Ganjali; Mina Feizabadi; Davood Ajloo
Journal:  J Fluoresc       Date:  2017-08-25       Impact factor: 2.217

4.  Deletions of 11q22.3-q25 are associated with atypical lung carcinoids and poor clinical outcome.

Authors:  Dorian R A Swarts; Sandra M H Claessen; Yvonne M H Jonkers; Robert-Jan van Suylen; Anne-Marie C Dingemans; Wouter W de Herder; Ronald R de Krijger; Egbert F Smit; Frederik B J M Thunnissen; Cornelis A Seldenrijk; Aryan Vink; Aurel Perren; Frans C S Ramaekers; Ernst-Jan M Speel
Journal:  Am J Pathol       Date:  2011-07-16       Impact factor: 4.307

5.  Validation of SCT Methylation as a Hallmark Biomarker for Lung Cancers.

Authors:  Yu-An Zhang; Xiaotu Ma; Adwait Sathe; Junya Fujimoto; Ignacio Wistuba; Stephen Lam; Yasushi Yatabe; Yi-Wei Wang; Victor Stastny; Boning Gao; Jill E Larsen; Luc Girard; Xiaoyun Liu; Kai Song; Carmen Behrens; Neda Kalhor; Yang Xie; Michael Q Zhang; John D Minna; Adi F Gazdar
Journal:  J Thorac Oncol       Date:  2015-12-25       Impact factor: 15.609

6.  Expression of ovarian tumour suppressor OPCML in the female CD-1 mouse reproductive tract.

Authors:  Jean S Fleming; H James McQuillan; Melanie J Millier; Grant C Sellar
Journal:  Reproduction       Date:  2009-01-27       Impact factor: 3.906

Review 7.  The EphA2 receptor and ephrinA1 ligand in solid tumors: function and therapeutic targeting.

Authors:  Jill Wykosky; Waldemar Debinski
Journal:  Mol Cancer Res       Date:  2008-12       Impact factor: 5.852

8.  Multiplexed methylation profiles of tumor suppressor genes and clinical outcome in lung cancer.

Authors:  Mónica Castro; Laura Grau; Patricia Puerta; Liliana Gimenez; Julio Venditti; Silvia Quadrelli; Marta Sánchez-Carbayo
Journal:  J Transl Med       Date:  2010-09-17       Impact factor: 5.531

Review 9.  Genomic and proteomic biomarkers for cancer: a multitude of opportunities.

Authors:  Michael A Tainsky
Journal:  Biochim Biophys Acta       Date:  2009-05-04

Review 10.  Twist: a molecular target in cancer therapeutics.

Authors:  Md Asaduzzaman Khan; Han-chun Chen; Dianzheng Zhang; Junjiang Fu
Journal:  Tumour Biol       Date:  2013-07-20
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.