Literature DB >> 18992152

Gene expression profiling for molecular distinction and characterization of laser captured primary lung cancers.

Astrid Rohrbeck1, Judith Neukirchen, Michael Rosskopf, Guillermo G Pardillos, Helene Geddert, Andreas Schwalen, Helmut E Gabbert, Arndt von Haeseler, Gerald Pitschke, Matthias Schott, Ralf Kronenwett, Rainer Haas, Ulrich-Peter Rohr.   

Abstract

METHODS: We examined gene expression profiles of tumor cells from 29 untreated patients with lung cancer (10 adenocarcinomas (AC), 10 squamous cell carcinomas (SCC), and 9 small cell lung cancer (SCLC)) in comparison to 5 samples of normal lung tissue (NT). The European and American methodological quality guidelines for microarray experiments were followed, including the stipulated use of laser capture microdissection for separation and purification of the lung cancer tumor cells from surrounding tissue.
RESULTS: Based on differentially expressed genes, different lung cancer samples could be distinguished from each other and from normal lung tissue using hierarchical clustering. Comparing AC, SCC and SCLC with NT, we found 205, 335 and 404 genes, respectively, that were at least 2-fold differentially expressed (estimated false discovery rate: < 2.6%). Different lung cancer subtypes had distinct molecular phenotypes, which also reflected their biological characteristics. Differentially expressed genes in human lung tumors which may be of relevance in the respective lung cancer subtypes were corroborated by quantitative real-time PCR. Genetic programming (GP) was performed to construct a classifier for distinguishing between AC, SCC, SCLC, and NT. Forty genes, that could be used to correctly classify the tumor or NT samples, have been identified. In addition, all samples from an independent test set of 13 further tumors (AC or SCC) were also correctly classified.
CONCLUSION: The data from this research identified potential candidate genes which could be used as the basis for the development of diagnostic tools and lung tumor type-specific targeted therapies.

Entities:  

Mesh:

Year:  2008        PMID: 18992152      PMCID: PMC2613386          DOI: 10.1186/1479-5876-6-69

Source DB:  PubMed          Journal:  J Transl Med        ISSN: 1479-5876            Impact factor:   5.531


Background

Lung cancer represents a heterogeneous group of diseases in terms of their biology and the clinical course. The diagnosis and classification of lung cancers are primarily based on the histological morphology and immunohistological methods for distinguishing between small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC) [1]. The molecular pathogenesis of lung cancer, as far as it has been deciphered, consists of genetic and epigenetic alterations, including the activation of proto-oncogenes and inactivation of tumor suppressor genes [2-4]. This leads to a malignant phenotype, resulting in changes in cell structure, adhesion and cell proliferation [5]. Oligonucleotide microarray studies are commonly used to extend the knowledge of the differences in the biology of lung tumors and to identify new candidate genes with diagnostic, prognostic and therapeutic value [6-9]. Several gene expression profiling studies in lung cancer have been published, however, it is still difficult to compare these studies due to the differences in methodologies, array platforms, normalization of the data and biostatistical analyses approaches, which may influence the reproducibility and comparability [10-12]. Such differences could have led to divergent results, with limited overlap of described genes. Another crucial step in the field of oligonucleotide microarray studies is the preparation of the solid tumor sample itself. It contains a variable amount of mesenchymal stroma cells, blood vessels, fibroblasts, tumor-invading lymphocytes and necrotic areas next to the tumor cells themselves. Analyzing the complete tumor sample without efficient separation of the tumor cell confounds the true gene expression profile of the tumor. In order to overcome these methodological limitations, we followed the guidelines from the Microarray Gene Expression Data Society [13] and the MicroArray Quality Control (MAQC) Consortium [14,15], the External RNA Controls Consortium (ERCC) [16] as well as the European consensus guidelines for gene expression experiments [17]. The purification of the tumor cells was carried out by laser capture microdissection (LCM), which has been shown to greatly improve the sample preparation for microarray expression analysis [18]. Few reports on LCM and microarray gene expression analysis have been published to date, comparing all distinct lung cancer subtypes to normal lung tissue [19-21]. In this report, we performed a comparison of gene expression profiles, using microarray analysis and LCM, according to the methodological quality consensus guidelines for microarray experiments, with the aim of identifying genes that are differentially expressed in the major histological lung cancer subtypes, as compared to normal lung tissue. In addition, 14 differentially expressed genes in human lung tumors were corroborated by quantitative real-time PCR. Furthermore, using genetic programming, we found a subset of 40 genes, that could be utilized for the classification of different types of lung tumors.

Materials and methods

Lung tumor samples

Samples of lung tumors were obtained using bronchoscopy or CT-guided needle aspiration from 29 patients, newly diagnosed patients with lung cancer. The samples that were immediately fixed in RNA-later consisted of 10 adenocarcinomas, 10 squamous cell carcinomas and 9 small cell lung carcinomas. Control samples of normal lung tissue were obtained from 5 patients with suspected tuberculosis or sarcoidosis, without presence of malignant lung tumors. The histopathological diagnosis was based on routinely processed hematoxylin-eosin stains and confirmed by immunohistochemical staining looking for pan-cytokeratin, cytokeratin 5 and 7, chromogranin A, synaptophysin and tissue-transcripion-factor-1. For validation of the classificator from genetic programming, 13 lung cancer samples were selected as a test-set from patients with advanced NSCLC lung cancers. All patients gave their informed consent and the study was approved by the ethics committee of the Heinrich-Heine University, Duesseldorf.

Laser capture microdissection

From each frozen tumor tissue, we prepared 8-μm thick sections. The sections were fixed in methanol/acetic acid and stained in hematoxylin. The tumor cells were identified and ascertained in the sample by an experienced pathologist using the Autopix 100 automated LCM system and collected on a CapSure HS LCM Cap (Arcturus, Mount View, CA). Following microdissection, total RNA-extraction was performed with the RNeasy Micro Kit (QIAamp DNA MicroKit Qiagen, Santa Clarita, CA, USA), according to the manufacturer's instruction. A standard quality control of the total RNA was performed using the Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, USA).

RNA isolation, cRNA labeling and hybridization to microarrays

The described procedures strictly adhered to the guidelines from the Microarray Gene Expression Data Society and the MicroArray Quality Control (MAQC) Consortium, the External RNA Controls Consortium (ERCC), as well as the European consensus guidelines for gene expression experiments [13-17]. The full description of the Extraction protocol, labeling and labeling protocol, hybridization protocol and data processing is obtainable in the GEO DATA base under (accession number GSE6044). Total RNA (median: 375 ng; range: 250 – 500 ng) was used to generate biotin-labeled cRNA (median: 6,5 μg; range: 3–10 μg) by means of Message Amp aRNA Amplification Kit (Ambion, Austin, TX). Quality control of RNA and cRNA was performed using a bioanalyzer (Agilent 2001 Biosizing, Agilent Technologies). Following fragmentation, labeled cRNA of each individual patient sample was hybridized to Affymetrix HG-Focus GeneChips, covering 8793 genes, and stained according to the manufacturer's instructions.

Quantification, normalization and statistical analysis

The quality control, normalization and data analysis, were assured with the affy package of functions of statistical scripting language 'R' integrated into the Bioconductor project , as described previously [22]. Using histograms of perfect match intensities, 5' to 3' RNA degradation side-by-side plots, or scatter plots, we estimated the quality of samples and hybridizations. To normalize raw data, we used a method of variance stabilizing transformations (VSN) [23]. To compare the normalized data from AC, SCC, SCLC and normal lung tissue, we used the Significance Analysis of Microarrays (SAM) algorithm v2.23 which contains a sliding scale for false discovery rate (FDR) of significantly up- and downregulated genes [24]. All data were permuted 1000 times by using the two classes, unpaired data mode of the algorithm. As a cut-off for significance, an estimated FDR of 2.6% was chosen by the tuning parameter delta of the software. The significance level of each gene was given by the q-value describing the lowest FDR in multiple testing [25], and a cut-off for fold-change of differential expression of 2 was used. Hierarchical clustering analysis (HCA) was used to determine components of variation in the data in this study. For these analyses we used the unsupervised complete linkage algorithm. The data points were organized in a phylogenetic tree with the branch lengths represent the degree of similarity between the values. Significantly expressed genes were uploaded to KEGG (Kyoto Encyclopedia of Genes and Genomes) and functional annotation was performed. Genes that were not listed or could be classified in more than one functional group were reviewed for the function based on the literature available using Pubmed, OMIM and GENE available in .

Quantitative real-time PCR

Corroboration of RNA expression data was performed by realtime PCR using the ABI PRISM 7900 HT Sequence Detection System Instrument (Applied Biosystems, Applera Deutschland GmbH, Darmstadt, Germany). Total RNA, ranging between 600 – 1000 ng, underwent reverse transcription using a High capacity cDNA Archive Kit according to the manufacturer's instruction (Applied Biosystems, Applera Deutschland GmbH, Darmstadt, Germany). PCRs were performed according to the instructions of the manufacturer, using commercially available assays-on-demand (Applied Biosystems, Applera Deutschland GmbH, Darmstadt, Germany). Ct values were calculated by the ABI PRISM software, and relative gene expression levels were expressed as the difference in Ct values of the target gene and the control gene ribosomal protein S11(RPS11). RPS11 was selected as reference gene for the quantification analyses, because the expression levels of the gene were similar between the examined tumor samples and normal tissue.

Classification using genetic programming

In order to generate a classifier that distinguishes between AC, SCC and SCLC, as well as the normal lung tissue, a Genetic Programming (GP) approach was used. The software DISCIPULUS which implements GP [26] was utilized. A leave-one-out cross validation (LOOCV) was performed, whereby one sample is removed from the training set. The other samples are reduced to those 50 genes with the highest signal-to-noise ratio, which are used as a training set in a training series. A training series generates a number of classifiers. After each series, the 30 best resulting classifiers are applied to that sample removed before, and the number of exact predictions were counted. The procedure was iterated, so that every sample was outside the training set once. The percentages of exact predictions for all samples of a class using 1020 classifiers (34 tissue samples and 30 classifiers = 34 * 30 = 1020 classifiers) were calculated. Each classifier used 50 different genes of a sample, queried their expression values and made the decision of "part of the class" or "outside the class". For each classifier and LOOCV iteration, the frequency of a gene (how often a gene occurs as appropriate classifier) was determined. The frequency was used as a quality criterion. The 10 genes with the highest frequency in each of the four classes were chosen in order to generate a final classifier of 40 genes. The accuracy of correct classification of the tissue is calculated as percentage using 30 classifiers of all left-out samples.

Results

Expression profiles and hierarchical cluster analysis

In this study, we examined gene expression profiles of untreated tumor cells from 29 patients with lung cancer (10 adenocarcinomas, 10 squamous cell carcinomas, 9 small cell lung cancer) in comparison to 5 normal lung tissues. The original data set and the patients characteristics are available in the GEO DATA base under (accession number GSE6044). Comparing AC, SCC and SCLC to normal lung tissue using significance analysis of microarrays (SAM), we found 205, 335 and 404 genes with an at least 2-fold different expression level and an estimated false discovery rate (FDR) of <2.6%. For an overview, a Venn diagram shows the overlaps of the three among groups (Figure 1) and the differentially expressed genes were further grouped in 14 functional classes (Table 1). Following SAM analysis, an unsupervised complete linkage clustering algorithm for cluster analyses was performed. The closest pair of the highest expression values of 198 differentially expressed genes was grouped together and a clear segregation of the analyzed groups (adenocarcinomas, squamous cell carcinomas, small cell lung cancer and normal lung tissue) was obtained (Figure 2)
Figure 1

Venn Diagramm of significantly regulated genes comparing adenocarcinomas, squamous cell carcinomas and small cell lung cancer to normal lung tissue (NT). The 3 genes that were overexpressed in all 3 tumor types were chromosome condensation protein G (overexpression in AC vs. NT, SCC vs. NT and SCLC vs. NT was 2.2, 2.1 and 3.2-fold, respectively); collagen, type I, alpha 1 (overexpression in AC vs. NT, SCC vs. NT and SCLC vs. NT was 7.98, 3.24 and 2.4-fold, respectively) and mesoderm specific transcript homolog (overexpression in AC vs. NT, SCC vs. NT and SCLC vs. NT was 2.9, 4.5 and 14.8-fold, respectively).

Table 1

Classification of significantly deregulated lung cancer genes in comparison to normal lung tissue with respect to cancer subtype and biological functions.

functional classAdenocarcinoman = 205squamous cell carcinoman = 335small cell lung cancern = 404
proliferation1 gene ↑1 gene ↓41 genes ↑56 genes ↑1 gene ↓

DNA-repair1 gene ↓8 genes ↑14 genes ↑

oncogenes/tumor related genes1 gene ↑11 genes ↓9 genes ↑11 genes ↓13 genes ↑10 genes ↓

cell adhesion3 genes ↑6 genes ↓2 genes ↑6 genes ↓8 genes ↑7 genes ↓

cell structure7 genes ↑20 genes ↓14 genes ↑15 genes ↓19 genes ↑18 genes ↓

metabolism11 genes ↑25 genes ↓34 genes ↑26 genes ↓22 genes ↑38 genes ↓

immune system6 genes ↑13 genes ↓3 genes ↑19 genes ↓2 genes ↑25 genes ↓

signal transduction13 genes ↓7 genes ↑3 genes ↓18 genes ↑10 genes ↓

transcription1 gene ↑8 genes ↓10 genes ↑5 genes ↓22 genes ↑4 genes ↓

transport2 genes ↑12 genes ↓11 genes ↑15 genes ↓8 genes ↑15 genes ↓

development1 gene ↑9 genes ↓5 genes ↑6 genes ↓12 genes ↑4 genes ↓

calcium-binding3 genes ↑4 genes ↓3 genes ↓4 genes ↓

apoptosis1 gene ↓3 genes ↑1 gene ↓

unknown7 genes ↑38 genes ↓28 genes ↑54 genes ↓26 genes ↑44 genes ↓
Figure 2

Consensus tree of hierarchical clustering of AC (A1–A10), SCC (P1–P10), SCLC (K1–K10) and lung tissue samples (NT) as a control without cancer (N1–N5) using the genes with the higherst differential expression according to the fold change from the comparison of AC vs. NT, SCC vs. NT and SCLC vs. NT. Data are displayed by a color code. Green, transcript levels below the median; black, equal to the median; red, greater than median. The effective length of the dash after sample separation visualizes the degree of similarity of the different samples.

Classification of significantly deregulated lung cancer genes in comparison to normal lung tissue with respect to cancer subtype and biological functions. Venn Diagramm of significantly regulated genes comparing adenocarcinomas, squamous cell carcinomas and small cell lung cancer to normal lung tissue (NT). The 3 genes that were overexpressed in all 3 tumor types were chromosome condensation protein G (overexpression in AC vs. NT, SCC vs. NT and SCLC vs. NT was 2.2, 2.1 and 3.2-fold, respectively); collagen, type I, alpha 1 (overexpression in AC vs. NT, SCC vs. NT and SCLC vs. NT was 7.98, 3.24 and 2.4-fold, respectively) and mesoderm specific transcript homolog (overexpression in AC vs. NT, SCC vs. NT and SCLC vs. NT was 2.9, 4.5 and 14.8-fold, respectively). Consensus tree of hierarchical clustering of AC (A1–A10), SCC (P1–P10), SCLC (K1–K10) and lung tissue samples (NT) as a control without cancer (N1–N5) using the genes with the higherst differential expression according to the fold change from the comparison of AC vs. NT, SCC vs. NT and SCLC vs. NT. Data are displayed by a color code. Green, transcript levels below the median; black, equal to the median; red, greater than median. The effective length of the dash after sample separation visualizes the degree of similarity of the different samples.

Adenocarcinomas

We found 205 deregulated genes in AC; 43 were upregulated and 162 were downregulated. Looking at oncogenes and tumor-associated genes, only the paraneoplastic antigen MA2 gene was upregulated. Focusing on genes involved in cell structure, 7 genes were upregulated 2 to 7.9-fold, compared to normal lung tissue. Next to the intermediary filament keratin 7 gene, 3 genes were involved in the actin metabolism such as thymosin beta-10, actin-related protein 2/3 complex subunit 1B and plastin 3. Four downregulated genes, involved in cell structure, were found in AC compared to normal lung tissue. These genes were tubulin alpha 3 and beta 2 involved in the assembly of microtubules and intermediary filaments, as well as keratin 5 and 15. We also looked for genes involved in cell adhesion and migration and found integrin alpha 3, integrin beta 2 and intercellular adhesion molecule 1 to be upregulated in adenocarcinomas, while 6 genes, including the desmosomal cadherins desmoglein 3 and desmocollin 3, were significantly downregulated compared to normal lung tissue. Examining the genes involved in cell cycle control and proliferation, we found only 2 genes differentially expressed. The chromosome condensation protein G was upregulated, while cyclin A1 was downregulated in comparison to normal lung tissue. Looking at genes involved in DNA repair, only the DNA mismatch repair gene mutS homolog 3 was downregulated (Table 2).
Table 2

Selection of significantly differentially expressed genes in adenocarcinomas focusing on cell structure, cell adhesion and oncogenesis.

Gene SymbolGene NameFold ChangeAC vs. NTq-value
cell structure

COL1A1collagen, type I, alpha 17.981.22

KRT7keratin 75.110.53

PLS3plastin 32.430.53

TMSB10thymosin, beta 102.010.89

ARPC1Bactin related protein 2/3 complex, 1B2.381.22

TUBA3tubulin, alpha 30.391.22

TUBB2tubulin, beta 20.330.53

KRT15keratin 150.060.53

KRT5keratin 50.050.53

cell adhesion

ICAM1intercellular adhesion molecule 15.800.89

ITGB2integrin, ί-22.481.22

ITGA3integrin α-32.021.53

DSG3desmoglein 30.480.53

DSC3desmocollin 30.320.53

oncogenesis

SERPINH1serine (or cysteine) proteinase inhibitor, clade H3.270.89

PNMA2paraneoplastic antigen MA23.130.89

MSH3mutS homolog 30.440.53

MYBv-myb Avian Myeloblastosis viral oncogene homolog0.390.53

RABL2Brab-like 2B0.340.53

RABL2Arab-like 2A0.270.53

AC = adenocarcinoma, NT = normal lung tissue

Selection of significantly differentially expressed genes in adenocarcinomas focusing on cell structure, cell adhesion and oncogenesis. AC = adenocarcinoma, NT = normal lung tissue

Squamous cell carcinomas

In SCC, we found 335 deregulated genes, including 172 upregulated and 163 downregulated genes. Looking at oncogenes and tumor-associated genes, 4 genes of the RAS associated gene family; the oncogenes v-myc myelocytomatosis viral oncogene homolog, v-maf musculoaponeurotic fibrosarcoma oncogene homolog and pituitary tumor-transforming 1 were upregulated. Examining genes involved in cell structure and cell adhesion, we found 5 types of collagen genes, in particular the genes encoding for collagen type I alpha-1 and 2, type V alpha-2, type VI alpha-3 and type XI alpha-1 to be upregulated in comparison to normal lung tissue. Further, gap junction protein alpha 1 (43 kDa), a member of the connexin gene family and neuronal cell adhesion molecule, a member of the immunoglobulin superfamily were upregulated, while 6 other genes involved in cell adhesion such as the tight junction protein 3 and claudin 10 were downregulated in comparison to normal lung tissue. In SCCs, 41 genes involved in cell cycle regulation were upregulated between 2 to 4.3-fold. Looking at key molecules for progression of cell cycle, the cyclines A2 and B2, cyclin-dependent kinase 4 and the cell division cycle 2 genes were upregulated. In the group of genes involved in DNA repair, we found genes with key functions for mismatch and double-strand DNA repair such as proliferating cell nuclear antigen, mutS homolog 6 replication factor C 4 and C5, RAD51 associated protein 1, which were overexpressed in comparison to normal lung tissue (Table 3).
Table 3

Selection of significantly differentially expressed genes in squamous cell carcinomas focusing on proliferation, cell structure and oncogenesis.

Gene SymbolGene NameFold ChangeSCC vs. NTq-value
Proliferation

RFC4replication factor C (activator 4)5.730.37

CCNB2Cyclin B 24.340.37

PRC1protein regulator of cytokinesis 13.541.21

CENPAcentromere protein A3.290.61

MAD2L1mitotic arrest deficient-like 13.200.61

CDK4cyclin-dependent kinase 42.890.37

CDC2cell division cycle 22.890.37

BUB1Bbudding uninhibited by benzimidazoles 1 homolog beta2.671.21

PCNAproliferating cell nuclear antigen2.620.37

PIR51RAD51 associated protein 12.480.61

HEC1kinetochore associated 22.150.84

MSH6mutS homolog 62.101.06

RFC5replication factor C (activator 5)2.090.37

CCNA2Cyclin A 22.051.06

CCNA1Cyclin A 10.570.61

cell structure

COL11A1collagen, type XI, alpha 17.940.84

COL1A1collagen, type I, alpha 13.241.21

TMSNBthymosin, beta, 4X3.242.53

COL5A2collagen, type V, alpha 22.990.37

COL1A2collagen, type I, alpha 22.890.84

PLS3plastin 32.460.37

COL6A3collagen, type VI, alpha 32.281.34

FSCN1fascin, homolog 12.201.06

oncogenesis

NMBglycoprotein (transmembrane) nmb4.011.34

RANBP1RAN binding protein 13.230.37

MAFv-maf Avian Musculoaponeurotic Fibrosarcoma oncogene2.631.34

RACGAP1Rac GTPase activating protein 12.530.61

PTTG1pituitary tumor-transforming 12.491.34

MYCv-myc Avian Myelocytomatosis viral oncogene homolog2.431.90

RALAv-ral simian leukemia viral oncogene homolog A2.421.34

RAP2BRas-Related Protein 2B2.300.84

RANRas-Related Nuclear Protein2.021.90

KITv-KIT Hardy-Zuckerman 4 Feline Sarcoma viral oncogene homolog0.460.37

RABL2BRab-like 2B0.350.37

RABL2ARab-like 2A0.331.34

SCC = squamous cell carcinomas, NT = normal lung tissue

Selection of significantly differentially expressed genes in squamous cell carcinomas focusing on proliferation, cell structure and oncogenesis. SCC = squamous cell carcinomas, NT = normal lung tissue

Small cell lung cancer

In SCLC, we found 404 differential expressed genes, including 223 upregulated and 181 downregulated genes. Looking at oncogenes and tumor-associated genes, 4 genes of the rat sarcoma viral oncogene homolog associated gene family, FYN oncogene related to SRC and pituitary tumor-transforming 1 were upregulated, respectively. Of interest, the three tumor-related genes: tumor protein D52, melanoma antigen family D 4, stathmin 1/oncoprotein 18 and two oncogenes DEK oncogene and forkhead box G1 were upregulated which has not been described in the context of lung cancer so far. In comparison to normal lung tissue, a different pattern of cell adhesion molecules was found in SCLC, showing 8 genes up- and 7 genes downregulated between 2 to 12.8-fold and 2.1 to 4.6-fold, respectively. In particular, the neural cell adhesion molecule 1 and the neuronal cell adhesion molecule, both members of the immunoglobulin superfamily, were overexpressed. Looking for genes involved in cell cycle regulation, we found 56 genes upregulated between 2.1 to 5.1-fold compared to normal lung tissue among them the key molecules for progression of cell cycle, the cyclines A2 and B2, cyclin-dependent kinase 4 and the cell division cycle 2 genes and cyclin E. The expression patterns of genes of the centromer/kinetochore complex and genes involved in DNA repair were similar to the expression patterns in SCC (Table 4).
Table 4

Selection of significantly differentially expressed genes in small cell lung cancer focusing on proliferation, oncogenesis and cell adhesion.

Gene SymbolGene NameFold ChangeSCLC vs. NTq-value
proliferation

RFC4replication factor C (activator 4)5.560.09

p16/CDKN2Acyclin-dependent kinase inhibitor 2A5.150.23

CENPAcentromere protein A4.680.09

CCNECyclin E4.480.09

CCNB2Cyclin B24.430.09

PRC1protein regulator of cytokinesis 14.230.28

CENPFcentromere protein F4.010.23

MAD2L1mitotic arrest deficient-like 13.780.09

HEC1kinetochore associated 23.660.09

PIR51RAD51 associated protein13.600.09

BUB1Bbudding uninhibited by benzimidazoles 1 homolog beta3.510.09

CDC2cell division cycle 23.420.28

PCNAproliferating cell nuclear antigen3.120.09

RFC3replication factor C (activator 3)2.580.09

RFC5replication factor C (activator 5)2.550.09

CDK4cyclin-dependent kinase 42.550.09

NEK2never in mitosis gene a-related kinase 22.480.09

CDK2cyclin-dependent kinase 22.410.09

BUB1budding uninhibited by benzimidazoles 1 homolog2.380.09

CENPEcentromere protein E2.380.09

ENC1ectodermal-neural cortex 12.320.23

MSH2mutS homolog 22.310.09

FANCGFanconi anemia, complementation G2.220.09

MSH6mutS homolog 62.090.09

CCNA2Cyclin A22.070.09

oncogenesis

FOXG1BForkhead BOX G1B, QIN oncogene16.760.09

STMN1stathmin 1/oncoprotein 184.980.16

FYNfyn oncogene related to src, fgr, yes3.910.16

MAGED4melanoma antigen, family D, 43.510.28

PTTG1pituitary tumor-transforming 12.560.09

RACGAP1Rac GTPase activating protein 13.300.09

DEKDEK oncogene (DNA binding)2.490.16

RANRas-Related Nuclear Protein2.390.16

RANBP1RAN binding protein 12.390.16

TPD52tumor protein D522.080.16

RABL2BRab-like 2B0.440.09

RABL2Rab-like 2A0.400.09

Cell adhesion

NCAM1neural cell adhesion molecule 112.750.16

NRCAMneuronal cell adhesion molecule9.430.09

CDH2N-cadherin5.410.09

CELSR3cadherin, EGF LAG seven-pass G-type receptor 32.460.09

TJP3tight junction protein 3, zona occludens 30.450.09

SCLC = small cell lung cancer, NT = normal lung tissue

Selection of significantly differentially expressed genes in small cell lung cancer focusing on proliferation, oncogenesis and cell adhesion. SCLC = small cell lung cancer, NT = normal lung tissue

Corroboration of array data by quantitative real-time (RT) PCR

Quantitative RT-PCR was used to verify the microarray data for 13 genes found to be differentially regulated in the different histologic lung cancer subgroups as compared to normal lung tissue. The 13 tested genes that were selected from different functional classes, with focus on the genes presented in tables 2–4, were: CASK (calcium/calmodulin-dependent serine protein kinase), CCNB2 (cyclin B2), COL1A1 (collagen, type I, alpha 1), IFNGR2 (interferon gamma receptor 2), PCNA (proliferating cell nuclear antigen), PRKCI (protein kinase C, iota), PLS3 (plastin 3), PTTG1 (pituitary tumor-transforming 1), PTTG1-IP (pituitary tumor-transforming 1 binding protein), UBE2C (ubiquitin-conjugating enzyme E2C), MAGED4 (melanoma antigen family D 4), FOX (forkhead box G1) and FYN (FYN oncogene related to SRC) and NRCAM (neuronal cell adhesion molecule). The expression data generated by the oligonucleotide array and RT-PCR were highly concordant, supporting the reliability of the array analysis (Figure 3). Of interest, the pituitary tumor-transforming gene 1 was 2.56 and 2.49-fold significantly differentially expressed in SCLC and SCC, respectively, in comparison to normal lung tissue using microarray analysis. In AC, the difference of expression was not significant in microarray analysis. However, using RT-PCR for corroboration, the pituitary tumor-transforming gene 1 was 5.7, 8.0 and 8.3 overexpressed in SCLC, SCC and AC, respectively, in comparison to normal lung tissue. In a previously conducted immunohistochemical study, we have demonstrated a strong pituitary tumor-transforming gene 1 expression in SCLC, adenocarcinomas, as well as in SCC, whilst a weak expression was only found in the luminal layer of normal lung epithelia, thus supporting the data of RT-PCR [27].
Figure 3

Corroboration of the results from microarray analysis using RT-PCR. The gene expression levels were normalized to a housekeeping gene (RPS11) for calculating ΔΔCt values. A ΔΔCt value of 1 corresponds to a Fold Change of 2. A) adenocarcinomas (AC), B) squamous cell carcinomas (SCC) and C) small cell lung cancer (SCLC).

Corroboration of the results from microarray analysis using RT-PCR. The gene expression levels were normalized to a housekeeping gene (RPS11) for calculating ΔΔCt values. A ΔΔCt value of 1 corresponds to a Fold Change of 2. A) adenocarcinomas (AC), B) squamous cell carcinomas (SCC) and C) small cell lung cancer (SCLC).

Class prediction using genetic programming

In order to identify genes that enable accurate distinction between AC, SCC and SCLC, as well as normal lung tissue, a genetic programming data analysis was performed. The percentages of exact predictions for all samples of a class using 1020 classifiers (34 tissue samples and 30 classifiers = 34 * 30 = 1020 classifiers) are shown in Table 5 and the 10 genes with the highest frequency in each of the four classes were chosen in order to generate a final classifier of 40 genes. Using microarray training set of 34 samples (10 AC, 10 SCC, 9 SCLC and 5 normal lung tissues), a minimal set of 40 genes (Table 6) provided a classification accuracy for division into the 4 different cell tissues. For external validation, the test set included 13 different NSCLC samples from pretreated patients (9 recurrent AC and 4 recurrent SCC). All test set samples were correctly classified using the 40 genes found with genetic programming.
Table 5

Leave-one-out Cross Validation (LOOCV) accuracy for all one vs. rest experiments.

ExperimentAccuracy of all 30 classifiers
AC vs. rest87.75%
SCLC vs. rest93.92%
SCC vs. rest82.25%
NT vs. rest93.24%
Table 6

Genes found by genetic programming for discrimination between SCLC, NSCLC (AC and SCC) and normal lung tissue.

Gene SymbolGene NameLocation
Discrimination AC vs. rest

CLCA2chloride channel, calcium activated, family member 21p31-p22

EFSembryonal Fyn-associated substrate14q11.2-q12

FGGfibrinogen, gamma polypeptide4q28

GPCR5AG protein-coupled receptor, family C, group 5, member A12p13-p12.3

KRT7keratin 712q12-q13

KRT5keratin 5 (epidermolysis bullosa simplex)12q12-q13

PTPRZ1protein tyrosine phosphatase, receptor-type, Z polypeptide 17q31.3

SEMA3Fsema domain, immunoglobulin domain (Ig), (semaphorin) 3F3p21.3

SERPINA1serine (or cysteine) proteinase inhibitor, clade A member 114q32.1

SLC39A8solute carrier family 39 (zinc transporter), member 84q22-q24

Discrimination SCC vs. rest

ADCY3adenylate cyclase 32p24-p22

ATP2B1ATPase, Ca++ transporting, plasma membrane 112q21.3

CASKcalcium/calmodulin-dependent serine protein kinaseXp11.4

CHST2carbohydrate (N-acetylglucosamine-6-O) sulfotransferase 23q24| 7q31

DGKAdiacylglycerol kinase, alpha 80kDa12q13.3

GOLPH2golgi phosphoprotein 29q21.33

KIF13Bkinesin family member 13B8p12

RAB17RAB17, member RAS oncogene family2q37.3

RAB40BRAB40B, member RAS oncogene family17q25.3

SCNN1Asodium channel, nonvoltage-gated 1 alpha12p13

Discrimination SCLC vs. rest

CELSR3cadherin, EGF LAG seven-pass G-type receptor 33p24.1-p21.2

CTSHcathepsin H15q24-q25

DLK1delta-like 1 homolog (Drosophila)14q32

ERBB2v-erb-b2 erythroblastic leukemia viral oncogene homolog 217q11.2-q12

FANCAFanconi anemia, complementation group A16q24.3

ID4inhibitor of DNA binding 4, dominant helix-loop-helix protein6p22-p21

ISL1ISL1 transcription factor, LIM/homeodomain, (islet-1)5q11.2

MGC13024hypothetical protein MGC1302416p11.2

POU4F1POU domain, class 4, transcription factor 113q21.1-q22

XYLT2xylosyltransferase II17q21.3-17q22

Discrimination NT vs. rest

ALOX15arachidonate 15-lipoxygenase17p13.3

ANKMY1ankyrin repeat and MYND domain containing 12q37.3

C18orf43chromosome 18 open reading frame 4318p11.21

DNAI1dynein, axonemal, intermediate polypeptide 19p21-p13

GSTA3glutathione S-transferase A36p12.1

LRRC6leucine rich repeat containing 68q24.22

MIPEPmitochondrial intermediate peptidase13q12

NKX3-1NK3 transcription factor related, locus 1 (Drosophila)8p21

RTDR1rhabdoid tumor deletion region gene 122q11.2

VNN3vanin 36q23-q24

AC = adenocarcinoma, SCC = squamous cell carcinoma, SCLC = small cell lung cancer, NT = normal lung tissue

Leave-one-out Cross Validation (LOOCV) accuracy for all one vs. rest experiments. Genes found by genetic programming for discrimination between SCLC, NSCLC (AC and SCC) and normal lung tissue. AC = adenocarcinoma, SCC = squamous cell carcinoma, SCLC = small cell lung cancer, NT = normal lung tissue

Discussion

In this study, a comparison of the expression pattern of the 3 major histological lung cancer subtypes, as measured by array analysis, is presented. In comparison to the normal lung tissue, 205, 335 and 404 genes in AC, SCC and SCLC were found to be at least 2-fold differentially expressed. Fourteen genes of different gene families were corroborated using RT-PCR. In AC, we found an up-regulation of keratin 7, a characteristic finding for pathologists to diagnose this subtype of lung cancer. On the other hand, keratin 5 was downregulated in AC. The differential expression is already described as a separator between AC and SCC, in line with our results [28,29]. Looking at adhesion molecules in AC, a down-regulation of the desmosomes desmoglein 3 and desmocollin 3 was found. In this context, it was shown that the invasive behavior of cells is inhibited when transfected with desmosomal components [30], suggesting that down-regulation of the desmosomes in adenocarcinomas of the lung plays a role in the loss of cell to cell contact and tumor spreading. The extracellular cell matrix receptors integrin alpha-3 and integrin beta-2 as well as the collagen binding protein-1 (SERPHINH1) were upregulated in AC. These genes have a high affinity to collagen IV and laminin, both essential components of the basement membrane [31], possibly mediating adhesion and invasion. Additionally, we found intercellular adhesion molecule 1 (ICAM1), a cell-adhesion molecule also binding to integrin beta-2 and promoting metastasis due to tumor cell adhesion to endothelium overexpressed in AC [32,33]. Looking at the oncogenes in SCC, we found genes of the RAS associated gene family, the myc myelocytomatosis viral oncogene homolog (MYC) and musculoaponeurotic fibrosarcoma oncogene (MAF) upregulated. MAF encodes for nuclear transcriptional regulating proteins with a leucine zipper motif, and was identified in the genome of the acute transforming avian retrovirus AS42, which induces fibrosarcomas and has the ability to transform chicken embryo fibroblasts [34]. It is noteworthy that in SCC 5 members of the collagen family type I, V, VI, and XI were upregulated. An increased collagen synthesis might be associated with carcinogenesis, as in patients with breast cancer the emerging fibrotic focus is regarded as an indicator of tumor angiogenesis and independent predictor of early metastasis [35]. SCLCs show an up-regulation of 3 proto-oncogenes, which have not been described in this context so far. The DEK oncogene encodes for a 375 amino acid chromatin binding protein, which introduces supercoiling in DNA. It has been described to be upregulated in other tumor types, such as bladder cancer, glioblastoma, melanoma and leukemia [36]. The Qin oncogene, originally isolated from avian sarcoma virus, causes oncogenic transformation. Qin is the avian orthologue of mammalian brain factor-1 or forkhead box G1 (FOXG1B), a gene which belongs to the human forkhead-box gene family [37]. Possibly related to the neuroendocrine differentiation of SCLC, forkehead box G1 is essential for the proliferation and survival of cerebro-cortical progenitor cells [38]. Further, we found the Fyn oncogene upregulated in SCLC. Fyn is a member of the src family which is activated in colorectal cancer, and has also been identified in melanoma cells with elevated cell motility and spreading ability [39,40]. With regard to adhesion molecules, the overexpressed neural cell adhesion molecule 1 is useful for the diagnosis of SCLC [41,42]. Next to neural cell adhesion molecule 1 we found other genes significantly upregulated such as the Purkinje cell protein 4, secretory granule neuroendocrine protein 1, synaptotagmin 1 and the neuronal cell adhesion molecule (NRCAM) that seems to reflect the neuronal heritage of this particular lung tumor subtype. NRCAM belongs to the L1 family immunoglobulin-like CAMs, which are involved in the guidance, growth and fasciculation of neuronal cells [43]. Neuronal cell adhesion molecule has also been described in 2006 by Taniwaki and colleagues', who performed comprehensive gene expression profiles of pure SCLC cells derived from laser-microdissected tissue samples [44]. In order to confirm the overexpression of the neuronal cell adhesion molecule using a different technique, we corroborated the result of microarray analysis using RT-PCR, showing a 9.3-fold overexpression of the neuronal cell adhesion molecule in SCLC in comparison to lung tissue. The imbalance of activated oncogenes and lost tumor suppressor genes, found in different types of lung cancer, may be associated with the different tumor growth kinetics. SCLC is the fastest growing lung tumor with a median tumor doubling time of 50 days [45]. This is reflected by our data with regard to the number and strength of upregulated cell cycle genes affecting growth rate. Several cyclines, their associated cyclin-dependent kinases and cell division cycle (CDC) genes controlling cell cycle progression, such as cyclin A2, B2 and E2, and cyclin-dependent kinase 2 and 4, as well as cell division cycle 2, 20 and 25B were upregulated [46]. The activation level of different cell cycle genes may be relevant with regard to new antitumor agents, which selectively target cell cycle proteins. For example, flavopiridol has the ability to induce cell cycle arrest by binding and inhibiting different cyclin-dependent kinase such as 2 and 4 [47,48]. Both CDKs are significantly upregulated in SCLC. On the other hand, the upregulation of cyclin-dependent kinase 2, that is critical for cell entry and progression through S phase of the cell cycle, is missing in NSCLC. Preclinical data support this finding since most NSCLC cell lines are resistant to flavopiridol-induced apoptosis unless they were treated during S phase. Furthermore, the IC 50 of flavopiridol-treated cells in SCLC cell lines is three times lower compared to NSCLC cell lines [49]. Consequently, this drug might be more promising in patients with SCLC. We have further shown that genes involved in mismatch repair, such as mutS homolog 2 or 6, were upregulated in SCLCs, which is in line with other reports showing that these gene transcripts and proteins are present [50,51], in contrast to NSCLCs, where high resolution deletion mapping reveals frequent allelic losses at the DNA mismatch repair loci mutS homolog 3 [52]. Similar to the latter report, we have observed a downregulation of mutS homolog 3 in ACs. After outlining potentially important molecular differences in different subtypes of lung cancer to normal lung tissue, we were interested in defining how many and which genes are necessary for correct classification of the lung tumor subtype. Using genetic programming (GP), a training set of 34 tissue samples was applied. With an evolutionary algorithm of GP, 40 genes were sufficient for a correct discrimination between all lung tumor tissue types and normal lung tissue. The 40 selected genes, identified using GP, were a subset of the genes, which were previously identified to be differentially expressed using cluster analysis. Following identification of the 40 genes with GP, further 13 tissue samples of previously treated patients NSCLC lung cancers were correctly classified with 100% prediction accuracy. It is important to note that the samples of the training set were from treatment naïve patients, while the test set came from those that were previously treated for their cancer using platinum-based chemotherapy. Nevertheless, the presented genes for distinction seem to maintain their value, independent from whether or not the patient had been treated. However, caution must be applied, since in the test set did not contain additional SCLC samples and larger sample size is needed which includes samples from all lung cancer subtypes in order to confirm the predictor.

Conclusion

Our data show the different gene expression profiles in dependence from the histological type of lung cancer, which reflects the specific biological characteristics of the respective tumor subtype. These data may form the basis for a molecular classification system and allows a further insight into the altered genomic progress of the lung cancer cell, which may help to develop molecularly targeted drugs.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

AR, UPR, GP, RH and RK were involved in the design and/or conduct of the experiments as, well as the preparation of the manuscript. HG and HEG were involved in the histopathological review of the tumor samples. AS was involved in the tumor sample collection. AVH and MR were involved in the statistical analysis of the data. JN, GG and MS were involved in the data collection of the patients, and in the review of the manuscript.
  50 in total

1.  MUC1 mediates transendothelial migration in vitro by ligating endothelial cell ICAM-1.

Authors:  Jennifer J Rahn; Jeffrey W Chow; Garnet J Horne; Brian K Mah; Joanne T Emerman; Pat Hoffman; Judith C Hugh
Journal:  Clin Exp Metastasis       Date:  2005       Impact factor: 5.150

2.  Expression profiles of non-small cell lung cancers on cDNA microarrays: identification of genes for prediction of lymph-node metastasis and sensitivity to anti-cancer drugs.

Authors:  Takefumi Kikuchi; Yataro Daigo; Toyomasa Katagiri; Tatsuhiko Tsunoda; Koichi Okada; Soji Kakiuchi; Hitoshi Zembutsu; Yoichi Furukawa; Masafumi Kawamura; Koichi Kobayashi; Kohzoh Imai; Yusuke Nakamura
Journal:  Oncogene       Date:  2003-04-10       Impact factor: 9.867

3.  Smoking and cancer-related gene expression in bronchial epithelium and non-small-cell lung cancers.

Authors:  M Woenckhaus; L Klein-Hitpass; U Grepmeier; J Merk; M Pfeifer; Pj Wild; M Bettstetter; P Wuensch; H Blaszyk; A Hartmann; F Hofstaedter; W Dietmaier
Journal:  J Pathol       Date:  2006-10       Impact factor: 7.996

4.  Nonsmall cell lung carcinoma with neuroendocrine differentiation--an entity of no clinical or prognostic significance.

Authors:  Diana N Ionescu; Diana Treaba; Cyril B Gilks; Samuel Leung; Daniel Renouf; Janessa Laskin; Richard Wood-Baker; Allen M Gown
Journal:  Am J Surg Pathol       Date:  2007-01       Impact factor: 6.394

5.  Expression of DNA mismatch repair gene MSH2 in cytological material from lung cancer patients.

Authors:  George Kanellis; Ioulia Chatzistamou; Helen Koutselini; Ekaterini Politi; Athanasios Gouliamos; Lambros Vlahos; Antonis Koutselinis
Journal:  Diagn Cytopathol       Date:  2006-07       Impact factor: 1.582

6.  Combined depletion of cell cycle and transcriptional cyclin-dependent kinase activities induces apoptosis in cancer cells.

Authors:  Dongpo Cai; Vaughan M Latham; Xinxin Zhang; Geoffrey I Shapiro
Journal:  Cancer Res       Date:  2006-09-15       Impact factor: 12.701

7.  Gene expression profiles of small-cell lung cancers: molecular signatures of lung cancer.

Authors:  Masaya Taniwaki; Yataro Daigo; Nobuhisa Ishikawa; Atsushi Takano; Tatsuhiko Tsunoda; Wataru Yasui; Kouki Inai; Nobuoki Kohno; Yusuke Nakamura
Journal:  Int J Oncol       Date:  2006-09       Impact factor: 5.650

Review 8.  Chromosomal imbalances in human lung cancer.

Authors:  Binaifer R Balsara; Joseph R Testa
Journal:  Oncogene       Date:  2002-10-07       Impact factor: 9.867

9.  Expression of the cell adhesion molecules ICAM-1 and VCAM-1 in the cytosol of breast cancer tissue, benign breast tissue and corresponding sera.

Authors:  P A Regidor; R Callies; M Regidor; A E Schindler
Journal:  Eur J Gynaecol Oncol       Date:  1998       Impact factor: 0.196

10.  The influence of the pituitary tumor transforming gene-1 (PTTG-1) on survival of patients with small cell lung cancer and non-small cell lung cancer.

Authors:  Nina Rehfeld; Helene Geddert; Ulrich-Peter Rohr; Abedelsalam Atamna; Astrid Rohrbeck; Guillermo Garcia; Slawek Kliszewski; Judith Neukirchen; Ingmar Bruns; Ulrich Steidl; Roland Fenk; Helmut E Gabbert; Ralf Kronenwett; Rainer Haas
Journal:  J Carcinog       Date:  2006-01-20
View more
  34 in total

1.  An Expression Signature as an Aid to the Histologic Classification of Non-Small Cell Lung Cancer.

Authors:  Luc Girard; Jaime Rodriguez-Canales; Carmen Behrens; Debrah M Thompson; Ihab W Botros; Hao Tang; Yang Xie; Natasha Rekhtman; William D Travis; Ignacio I Wistuba; John D Minna; Adi F Gazdar
Journal:  Clin Cancer Res       Date:  2016-06-28       Impact factor: 12.531

2.  Lung cancer transcriptomes refined with laser capture microdissection.

Authors:  Juan Lin; Gabrielle Marquardt; Nandita Mullapudi; Tao Wang; Weiguo Han; Miao Shi; Steven Keller; Changcheng Zhu; Joseph Locker; Simon D Spivack
Journal:  Am J Pathol       Date:  2014-08-14       Impact factor: 4.307

3.  Smoking-Related Gene Expression in Laser Capture-Microdissected Human Lung.

Authors:  Xiang-Lin Tan; Tao Wang; Shengli Xiong; Shalini V Kumar; Weiguo Han; Simon D Spivack
Journal:  Clin Cancer Res       Date:  2009-12-15       Impact factor: 12.531

4.  High expression of CASK correlates with progression and poor prognosis of colorectal cancer.

Authors:  Jin-Lai Wei; Zhong-Xue Fu; Min Fang; Qiu-Yuan Zhou; Qing-Ning Zhao; Jin-Bao Guo; Wei-Dong Lu; Hao Wang
Journal:  Tumour Biol       Date:  2014-06-14

Review 5.  Tumor protein D52 (TPD52) and cancer-oncogene understudy or understudied oncogene?

Authors:  Jennifer A Byrne; Sarah Frost; Yuyan Chen; Robert K Bright
Journal:  Tumour Biol       Date:  2014-05-06

6.  Genetic and pharmacologic inhibition of EPHA2 promotes apoptosis in NSCLC.

Authors:  Katherine R Amato; Shan Wang; Andrew K Hastings; Victoria M Youngblood; Pranav R Santapuram; Haiying Chen; Justin M Cates; Daniel C Colvin; Fei Ye; Dana M Brantley-Sieders; Rebecca S Cook; Li Tan; Nathanael S Gray; Jin Chen
Journal:  J Clin Invest       Date:  2014-04-08       Impact factor: 14.808

7.  A drug repositioning approach identifies tricyclic antidepressants as inhibitors of small cell lung cancer and other neuroendocrine tumors.

Authors:  Nadine S Jahchan; Joel T Dudley; Pawel K Mazur; Natasha Flores; Dian Yang; Alec Palmerton; Anne-Flore Zmoos; Dedeepya Vaka; Kim Q T Tran; Margaret Zhou; Karolina Krasinska; Jonathan W Riess; Joel W Neal; Purvesh Khatri; Kwon S Park; Atul J Butte; Julien Sage
Journal:  Cancer Discov       Date:  2013-09-26       Impact factor: 39.397

8.  Venetoclax Is Effective in Small-Cell Lung Cancers with High BCL-2 Expression.

Authors:  Timothy L Lochmann; Konstantinos V Floros; Mitra Naseri; Krista M Powell; Wade Cook; Ryan J March; Giovanna T Stein; Patricia Greninger; Yuki Kato Maves; Laura R Saunders; Scott J Dylla; Carlotta Costa; Sosipatros A Boikos; Joel D Leverson; Andrew J Souers; Geoffrey W Krystal; Hisashi Harada; Cyril H Benes; Anthony C Faber
Journal:  Clin Cancer Res       Date:  2017-11-08       Impact factor: 12.531

9.  Prediction of lung cancer histological types by RT-qPCR gene expression in FFPE specimens.

Authors:  Matthew D Wilkerson; Jason M Schallheim; D Neil Hayes; Patrick J Roberts; Roy R L Bastien; Michael Mullins; Xiaoying Yin; C Ryan Miller; Leigh B Thorne; Katherine B Geiersbach; Kenneth L Muldrew; William K Funkhouser; Cheng Fan; Michele C Hayward; Steven Bayer; Charles M Perou; Philip S Bernard
Journal:  J Mol Diagn       Date:  2013-05-20       Impact factor: 5.568

10.  HLungDB: an integrated database of human lung cancer research.

Authors:  Lishan Wang; Yuanyuan Xiong; Yihua Sun; Zhaoyuan Fang; Li Li; Hongbin Ji; Tieliu Shi
Journal:  Nucleic Acids Res       Date:  2009-11-09       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.