Literature DB >> 21888658

Preferential expression of potential markers for cancer stem cells in large cell neuroendocrine carcinoma of the lung. An FFPE proteomic study.

Masaharu Nomura1, Tetsuya Fukuda, Kiyonaga Fujii, Takeshi Kawamura, Hiromasa Tojo, Makoto Kihara, Yasuhiko Bando, Adi F Gazdar, Masahiro Tsuboi, Hisashi Oshiro, Toshitaka Nagao, Tatsuo Ohira, Norihiko Ikeda, Noriko Gotoh, Harubumi Kato, Gyorgy Marko-Varga, Toshihide Nishimura.   

Abstract

BACKGROUND: Large cell neuroendocrine carcinoma (LCNEC) of the lung, a subtype of large cell carcinoma (LCC), is characterized by neuroendocrine differentiation that small cell lung carcinoma (SCLC) shares. Pre-therapeutic histological distinction between LCNEC and SCLC has so far been problematic, leading to adverse clinical outcome. We started a project establishing protein targets characteristic of LCNEC with a proteomic method using formalin fixed paraffin-embedded (FFPE) tissues, which will help make diagnosis convincing.
METHODS: Cancer cells were collected by laser microdissection from cancer foci in FFPE tissues of LCNEC (n = 4), SCLC (n = 5), and LCC (n = 5) with definite histological diagnosis. Proteins were extracted from the harvested sections, trypsin-digested, and subjected to HPLC/mass spectrometry. Proteins identified by database search were semi-quantified by spectral counting and statistically sorted by pair-wise G-statistics. The results were immunohistochemically verified using a total of 10 cases for each group to confirm proteomic results.
RESULTS: A total of 1981 proteins identified from the three cancer groups were subjected to pair-wise G-test under p < 0.05 and specificity of a protein's expression to LCNEC was checked using a 3D plot with the coordinates comprising G-statistic values for every two group comparisons. We identified four protein candidates preferentially expressed in LCNEC compared with SCLC with convincingly low p-values: aldehyde dehydrogenase 1 family member A1 (AL1A1) (p = 6.1 × 10-4), aldo-keto reductase family 1 members C1 (AK1C1) (p = 9.6x10-10) and C3 (AK1C3) (p = 3.9x10-10) and CD44 antigen (p = 0.021). These p-values were confirmed by non-parametric exact inference tests. Interestingly, all these candidates would belong to cancer stem cell markers. Immunohistochmistry supported proteomic results.
CONCLUSIONS: These results suggest that candidate biomarkers of LCNEC were related to cancer stem cells and this proteomic approach via FFPE samples was effective to detect them.

Entities:  

Year:  2011        PMID: 21888658      PMCID: PMC3178477          DOI: 10.1186/2043-9113-1-23

Source DB:  PubMed          Journal:  J Clin Bioinforma        ISSN: 2043-9113


Introduction

Lung cancer is the leading cause of cancer-related death worldwide [1]. In Japan, annual deaths from lung cancer have been increasing and reached about 70,000 [2] and in USA reached 160,000 even with a recent decreasing trend [3]. Generally, lung cancer is divided into two histological subgroups, non-small cell lung carcinoma (NSCLC) and small cell lung carcinoma (SCLC). NSCLC mainly consists of adenocarcinoma (AC), squamous cell carcinoma (SC) and large cell carcinoma (LCC). AC and SC are differentiated with the features of normal cells but LCC is undifferentiated without such features. The prognosis of lung cancer depends on pathological stages and histological types; in NSCLC, AC is the best, while LCC the worst [4]. Travis et al. [5] proposed a new subtype of LCC, named large cell neuroendocrine carcinoma (LCNEC) in 1991, and the World Health Organization finally adopted it for the revised pathological classification of lung cancer in 1999. LCNEC exhibits morphology similar to LCC but neuroendocrine differentiation like SCLC that could be judged by expression of at least one of three representative neuroendocrine proteins, CD56, synaptophysin (Syn) and chromogranin A (CGA). Among subtypes of LCC, the prognosis of LCNEC was poorer than others even if at early stages [6,7] like SCLC. However therapeutic strategies of LCNEC and SCLC differ from each other. The former needs surgery as the first choice but the latter chemotherapy. It is therefore important to distinguish LCNEC from SCLC definitely but common morphological growth patterns characteristic of neuroendocrine tumors sometimes hinder clear pathologic distinction between the two neuroendocrine cancers. It follows that new biomarkers should be developed for definite diagnosis of those cancers, even if histopathology has long been the golden standard for diagnosis and determination of disease progression. Genomic and immunohistochemical analyses for such a purpose have been reported [8,9] but there have still been no biomarkers specific to LCNEC. Recent advancements in shotgun sequencing and quantitative mass spectrometry for protein analyses could make proteomics amenable to clinical biomarker discovery [10]. In addition, selective collection of target cells from formalin fixed paraffin embedded (FFPE) tissues by laser microdissection can permit to access to tissues of a variety of cancer types with definite diagnosis. We have used these methods for exploring stage-related proteins on non-metastatic lung AC by both global and multiple reaction monitoring (MRM) mass spectrometry-based proteomics [11,12]. In this study, we applied them to detect the potential protein markers characteristic of LCNEC by label-free semi-quantitative shotgun proteomics using spectral counting.

2. Materials and methods

2. 1. Sample Preparation for FFPE Tissue Specimens

Surgically removed lung tissues were fixed with a buffered formalin solution containing 10-15% methanol, and embedded by a conventional method. Archived paraffin blocks of formalin-fixed tissues obtained from four LCNEC cases, five LCC and five SCLC, which were retrieved with the approval from Ethical Committee of Tokyo Medical University Hospital and used with patients' consents. Patients' characteristics are listed in Table 1. Paraffin blocks were cut into 4 μm sections for diagnosis and 10 μm sections for proteomics. The 10 μm sections were stained with only haematoxylin. Three pathologists (M.N., H.O., and T.N.) independently made a diagnosis using the 4 μm sections stained with haematoxylin and eosin according to the WHO classification. LCNEC has its characteristic cancer cells with relatively larger cytoplasm, less fine chromatin and more distinct nucleoli than those of SCLC. The sections of patients diagnosed unequivocally were used in this study.
Table 1

Patients' Characteristics.

Cancer groupsPatient No.GenderAgeTNM*Staging
LCNEC1F68T1N0M0IA
2M73T2N0M0IB
3M58T1N1M0IIA
4M70T2N0M0IB

5M76T2N2M0IIIA
6M69T3N3M0IIIB
7M64T2N1M0IIB
8M60T2N2M0IIIA
9F77T1N0M0IA
10M69T1N2M0IIIA

SCLC1F62T2N0M0IB
2M77T2N1M0IIB
3M57T2N1M0IIB
4M76T1N1M0IIA
5M64T1N1M0IIA

6F70T1N1M0IIA
7M69T1N1M0IIA
8M77T2N0M0IB
9M73T1N0M0IA
10M73T2N1M0IIB

LCC1M52T2N1M0IIB
2M71T1N0M0IA
3F57T1N0M0IA
4M51T4N2M0IIIB
5M72T1N1M0IIA

6M67T1N1M0IIA
7M67T2N0M0IB
8M58T1N0M0IA
9M67T2N0M0IB
10M66T1N0M0IA

*Ref. [31]

Patients' Characteristics. *Ref. [31]

2. 2. Immunohistochemical Staining

The neuroendocrine nature of tumors was confirmed with the three representative antibodies, monoclonal mouse anti CD56 antibody (Novocastra, Newcastle upon Tyne, U.K.), polyclonal rabbit anti CGA antibody (DAKO Japan, Kyoto, Japan) and monoclonal mouse anti SYN antibody (DAKO Japan, Kyoto, Japan). The staining of these antibodies was performed automatically on a Ventana Benchmark® XT (Ventana Japan, Tokyo, Japan). Expression of four proteomics-identifying proteins specific to LCNEC was tested with the following commercially available antibodies according to the manufacturer's protocols: monoclonal rabbit anti AL1A1 antibody (Abcom Japan, Tokyo, Japan), polyclonal anti AK1C1 antibody (GeneTex, Irvine, CA, USA), monoclonal anti AK1C3 antibody (Sigma Japan, Tokyo, Japan) and monoclonal mouse anti CD44 antibody (Abcom Japan, Tokyo, Japan). Briefly, sections were incubated with xylene, rehydrated with graded ethanol solutions and incubated with methyl alcohol containing 3% hydrogen peroxide to remove endogenous peroxidase activity. After washing thoroughly with PBS, sections were incubated with adequately diluted primary antibodies and then with Histofine simple stain® (Nichirei Bioscience, Tokyo, Japan), and finally visualized with products of the peroxidase and diaminobenzidien reaction.

2. 3. Laser Capture and Protein Solubilization

Cancerous lesions were identified on serial sections of NSCLC tissues stained with hematoxylin-eosin (HE). For proteomic analysis, a 10 μm thick section prepared from the same tissue block was attached onto DIRECTOR™ slides (Expression Pathology, Rockville, MD, USA), de-paraffinized twice with xylene for 5 min., rehydrated with graded ethanol solutions and distilled water and stained by only hematoxylin. Those slides were air-dried and subjected to laser microdissection with a Leica LMD6000 (Leica Micro-systems GmbH, Ernst-Leitz-Strasse, Wetzlar, Germany). At least 30,000 cells (8.0mm2) were collected directly into a 1.5mL low-binding plastic tube. Proteins were extracted and digested with trypsin using Liquid Tissue™ MS Protein Prep kits (Expression Pathology, Rockville, MD, USA) according to the manufacturer's protocol.

2. 4. Liquid Chromatography-Tandem Mass Spectrometry

We here adopted label-free semi-quantitation using spectral counting by liquid chromatography (LC)-tandem mass spectrometry (MS/MS) to a global proteomic analysis. The digested samples were analyzed in triplicates by LC-MS/MS using reversed-phase liquid chromatography (RP-LC) interfaced with a LTQ-Orbitrap hybrid mass spectrometer (Thermo Fisher Scientific, Bremen, Germany) via a nano-electrospray device as described in details previously [13]. Briefly, the RP-LC system consisted of a peptide Cap-Trap cartridge (0.5 × 2.0 mm) and a capillary separation column (an L-column Micro of 0.2 × 150 mm packed with reverse phase L-C18 gels of 3 μm in diameter and 12 nm pore size, (CERI, Tokyo, Japan)) connected an emitter tip (FortisTip of 20 μm ID and 150 μm OD with a perfluoropolymer-coated blunt end, OmniSeparo-TJ, Hyogo, Japan) to the outlet. An autosampler (HTC-PAL, CTC Analytics, Switzerland) loaded an aliquot of samples onto the trap, which then was washed with solvent A (98% distilled water with 2% acetonitrile and 0.1% formic acid) for concentrating peptides on the trap and desalting. Subsequently, the trap was connected in series to the separation column, and the whole columns were developed for 70 min. with a linear acetonitrile concentration gradient made from 5 to 40% solvent B (10% distilled water and 90% acetonitrile containing 0.1% formic acid) at the flow-rate of 1 μL/min. An LTQ was operated in the data-dependent MS/MS mode to automatically acquire up to three successive MS/MS scans in the centroid mode. The three most intense precursor ions for these MS/MS scans could be selected from a high-resolution MS spectrum (a survey scan) that an Orbitrap previously acquired during a predefined short time window in the profile mode at the resolution of 30 000 in the m/z range of 400 to 1600. The sets of acquired high-resolution MS and MS/MS spectra for peptides were converted to single data files and they were merged into Mascot generic format files for database searching.

2.5 Database Searching and Semi-quantification with Spectral Counting

Mascot software (version 2.1.1, Matrix Science, London, UK) was used for database search against Homo sapiens entries in the UniProtKB/Swiss-Prot database (Release 56.6, 20413 entries). Peptide mass tolerance was 10ppm, fragment mass tolerance 0.8Da, and up to two missed cleavages were allowed for errors in trypsin specificity. Carbamidomethylation of cysteines was taken as fixed modifications, and methionine oxidation and formylation of lysine, arginine and N-terminal amino acids as variable modifications. A p-value being < 0.05 was considered significant, and the score cutoff was 44. The lists of identified proteins were merged into a master file where the primary accession numbers and entry names from UniProtKB were used. The false positive rates for protein identification were estimated using a decoy database created by reversing the protein sequences in the original database; the estimated false positive rate of peptide matches was 0.45% under protein score threshold conditions (p < 0.005). Mascot search results were processed through Scaffold software (version 2.02.03, Proteome Software, Portland, OR) to semi-quantitatively analyze differential expression levels of proteins in LCNEC, LCC and SCLC by spectral counting as described [11]. The number of peptide MS/MS spectra with high confidence (Mascot ion score, p < 0.005) was used for calculating spectral counts. Fold changes of expressed proteins in the base 2 logarithmic scale (R) were calculated using spectral counting as described [11]. Candidate proteins between two groups were chosen so that their Rsatisfy >1 or <−1, which correspond to their fold changes >2 or <0.5. G-test was used for evaluating differential protein expression in pair-wise cancer groups [14]. In this study we mainly focus on LCNEC vs. SCLC comparison, but the other pairs were considered. The results are illustrated in a three-dimensional plot to judge whether a protein is specifically expressed in a given cancer group. Although G-test does not require replicates, spectral counts for each protein from triplicates were pooled and used for G-statistic calculation using a two-way contingency table arranged in two rows for a target protein and any other proteins, and two columns for cancer groups on an Excel macro. Statistical significance should be p < 0.05. The Yates correction for continuity is applied to the 2 × 2 tables. The correction could enable us to handle the data containing small spectral counts including zero. Statisticians, however, showed that the results of G-test using a contingency table containing small counts are not so convincing because it is assumed that the G statistic asymptotically obey a χ2 distribution with one degree of freedom. To validate the G-test results, we calculated exact p-values for some significant proteins without making any assumptions of statistical distribution based on the permutational distribution of the test statistic, i.e., Fisher's exact test and Mann-Whitney U test for the contingency tables using a R package.

3. Results

3. 1. Patient groups and pathological classification

To explore protein markers to distinguish LCNEC from SCLC, we investigated cancer cells prepared by laser microdissection from FFPE sections of LCNEC, SCLC, and LCC with a shotgun proteomic method. The LCNEC group consisted of four independent patients and other two groups consisted of five independent ones. For immunohistochemistry, we added more patients so as to amount to 10 patients for each group. Patients were divided into those cancer groups according to the WHO classification and by immunohistochemistry with antibodies raised against established neuroendocrine markers, CD56, CGA and Syn (Table 1 and Figure 1). All LCNEC and SCLC tissues used in this study are positively stained with at least one of these antibodies consistent with the neuroendocrine nature of those cancers. LCC tissues were not stained immunohistochemically except for 2 cases with faintly positive for Syn but histopathological differentiation from SC, AC and SCLC was required for its definite diagnosis. The patient profiles including the TNM pathological classification and staging are summarized in Table 1. There was no difference between the ages for each group (p = 0.076 by ANOVA, mean age + SD: 68.4 + 6.3 for LCNEC, 69.8 + 6.8 for SCLC, and 62.8 + 7.7 for LCC) and the number of male accounts for over 80% for all groups. The majority of patients remained at stages from IA to IIB and accordingly had the extent of the primary tumor (T1 and T2) and of regional lymph node involvement (N0 and N1) except for the most advanced stage IIIA or IIIB in a LCC patient (patient 4) and additional four patients of LCNEC for immunohistochemistry (patients 5, 6, 8, and 10). All patients had no distant metastasis (M0). All the patients but patient 5 (carboplatin + irinotecan) in LCNEC and patient 4 (carboplatin + pacritaxel) in LCC have not undergone pre-operative chemotherapy.
Figure 1

Immunohistochemistry with antibodies raised against established neuroendocrine markers, CD56, CGA, and Syn.

Immunohistochemistry with antibodies raised against established neuroendocrine markers, CD56, CGA, and Syn.

3. 2. LC-MS/MS protein identifications and semi-quantification by spectral counting

Trypsin-digests from laser-microdissected samples typically containing ~30,000 cells were analyzed in triplicate by LC-MS/MS as described in "Materials and Methods". Under the database search settings used, we identified significant proteins as follows: LCNEC contained a total of 1,124 proteins including 410 unique, 168 in the overlap only between LCNEC and SCLC, 93 in the overlap only between LCNEC and LCC, and 453 in the overlap among three groups; SCLC contained a total of 1,096 including 362 unique, 100 in the overlap only between SCLC and LCC and the overlapped proteins described above; LCC contained a total of 1,083 including 450 unique and the overlapped proteins described earlier. The spectral counts were calculated for these proteins and those from triplicate experiments were pooled, thereby improving the performance of G-test and decreasing false positive rates significantly [14]. There was no significant difference among the total spectral counts of each group (p = 0.248 by ANOVA; mean counts + SD: 1916 + 571 for LCNEC, 1879 + 457 for SCLC, 2491 + 645 for LCC). Next, the values of Rsc that is a measure of fold changes for protein expression levels were calculated as described in "Materials and Methods" using the spectral counts of these proteins. The pooled counts for each protein were also subjected to pair-wise G-test between cancer groups. Table 2 shows the identified proteins that are significantly up- or down-regulated in LCNEC compared with SCLC as judged by G test under p <0.05. The proteins are listed in descending order of the Rsc values; the larger the Rsc value of a given protein, the greater its expression level in LCNEC compared with SCLC and vice versa. Representative proteins up-regulated in LCNEC were AL1A1, AK1C1, AK1C3, brain-type fatty acid-binding protein (FABP) and β-enolase. On the other hand, those in SCLC were brain acid soluble protein 1 (BASP), secretagogin (SEGN), fascin and neural cell adhesion molecule (CD56).
Table 2

Significant changes in protein expression levels as judged with G-test under p < 0.05 for an LCNEC vs. SCLC pair.

NoEntry nameAccession numberProteinsGPRscSpectral counts

LCNEC SCLC
1AK1C3P42330Aldo-keto reductase family 1 member C339.13.93E-104.91250
2AK1C1Q04828Aldo-keto reductase family 1 member C137.49.56E-104.86240
3FABP7O15540Fatty acid-binding protein, brain21.92.89E-064.22150
4ENOBP13929Beta-enolase22.22.50E-063.61181
5AL1A1P00352Retinal dehydrogenase 111.86.07E-043.5590
64F2P081954F2 cell-surface antigen heavy chain11.86.07E-043.5590
71C12P30508HLA class I histocompatibility antigen, Cw-12 alpha chain precursor11.86.07E-043.5590
8TBA4AP68366Tubulin alpha-4A chain11.86.07E-043.5590
9LG3BPQ08380Galectin-3-binding protein precursor20.65.77E-063.54171
101C03P04222HLA class I histocompatibility antigen, Cw-3 alpha chain precursor10.11.48E-033.4080
11TKTP29401Transketolase8.463.62E-033.2470
12VTNCP04004Vitronectin precursor6.858.87E-033.0560
13G6PDP11413Glucose-6-phosphate 1-dehydrogenase6.858.87E-033.0560
14PRDX4Q13162Peroxiredoxin-46.858.87E-033.0560
15VDAC1P21796Voltage-dependent anion-selective channel protein 16.858.87E-033.0560
161B15P30464HLA class I histocompatibility antigen, B-15 alpha chain precursor6.858.87E-033.0560
17VILIP09327Villin-16.858.87E-033.0560
18DESPP15924Desmoplakin11.198.24E-042.96111
19AHSA1O95433Activator of 90 kDa heat shock protein ATPase homolog 15.272.18E-022.8450
20COPBP53618Coatomer subunit beta5.272.18E-022.8450
21TMEDAP49755Transmembrane emp24 domain-containing protein 10 precursor5.272.18E-022.8450
22CD44P16070CD44 antigen precursor5.272.18E-022.8450
23COPAP53621Coatomer subunit alpha5.272.18E-022.8450
24TBB4QQ99867Putative tubulin beta-4q chain5.272.18E-022.8450
25THILP24752Acetyl-CoA acetyltransferase, mitochondrial precursor5.272.18E-022.8450
26EFTUP49411Elongation factor Tu, mitochondrial precursor14.881.14E-042.47194
27IDHPP48735Isocitrate dehydrogenase [NADP], mitochondrial precursor13.552.32E-042.39184
28LRC47Q8N1G4Leucine-rich repeat-containing protein 475.412.01E-022.3971
29CO6A1P12109Collagen alpha-1(VI) chain precursor4.084.34E-022.2061
30PSAP55786Puromycin-sensitive aminopeptidase4.084.34E-022.2061
31IMB1Q14974Importin subunit beta-15.951.47E-022.1792
32PSA2P25787Proteasome subunit alpha type-24.722.99E-022.0282
33FASP49327Fatty acid synthase9.931.63E-031.93186
34A1ATP01009Alpha-1-antitrypsin precursor4.054.41E-021.62104
35ROA1P09651Heterogeneous nuclear ribonucleoprotein A16.431.12E-021.58167
36FINCP02751Fibronectin precursor8.014.64E-031.57209
37TRAP1Q12931Heat shock protein 75 kDa, mitochondrial precursor6.261.24E-021.50178
38MYH14Q7Z406Myosin-146.261.24E-021.50178
39ANXA2P07355Annexin A25.461.94E-021.49157
40PHB2Q99623Prohibitin-24.673.07E-021.49136
41GSTP1P09211Glutathione S-transferase P10.631.12E-031.383217
42PDIA1P07237Protein disulfide-isomerase precursor8.962.76E-031.332916
431433GP6198114-3-3 protein gamma8.114.40E-031.282816
44ACTN4O43707Alpha-actinin-48.822.98E-031.263118
45PCBP2Q15366Poly(rC)-binding protein 25.032.49E-021.162113
46TPISP60174Triosephosphate isomerase6.451.11E-021.122818
47TRFEP02787Serotransferrin precursor7.177.41E-031.123120
48ARF1P84077ADP-ribosylation factor 15.002.53E-021.092315
49PCBP1Q15365Poly(rC)-binding protein 14.313.79E-021.012316
50CO6A3P12111Collagen alpha-3(VI) chain precursor5.162.31E-020.913224
51EF1A1P68104Elongation factor 1-alpha 14.722.98E-020.833528
52PDIA6Q15084Protein disulfide-isomerase A6 precursor4.014.52E-020.813125
53G3PP04406Glyceraldehyde-3-phosphate dehydrogenase14.211.64E-040.7711395
54ENPLP14625Endoplasmin precursor5.761.64E-020.666256
55TBB2AQ13885Tubulin beta-2A chain5.801.61E-020.636964
56VIMEP08670Vimentin5.921.49E-020.607773
57HBBP68871Hemoglobin subunit beta4.882.72E-02-0.5353110
58TBB5P07437Tubulin beta chain10.899.64E-04-0.6381179
59TBA1AQ71U36Tubulin alpha-1A chain14.381.49E-04-0.6793211
60TBB2BQ9BVA1Tubulin beta-2B chain5.931.49E-02-0.693582
61H2B1BP33778Histone H2B type 1-B6.451.11E-02-0.832565
62LMNB1P20700Lamin-B17.277.03E-03-0.872567
63HBAP69905Hemoglobin subunit alpha5.771.63E-02-0.921748
64CALMP62158Calmodulin3.904.82E-02-1.00928
65HNRH1P31943Heterogeneous nuclear ribonucleoprotein H4.363.67E-02-1.011031
66NUMA1Q14980Nuclear mitotic apparatus protein 14.832.80E-02-1.011134
67LAP2AP42166Lamina-associated polypeptide 2 isoform alpha5.312.13E-02-1.051135
68H31TQ16695Histone H3.1t6.231.26E-02-1.061341
69GDIAP31150Rab GDP dissociation inhibitor alpha7.655.67E-03-1.091548
70TBA1CQ9BQE3Tubulin alpha-1C chain14.231.62E-04-1.122786
71TBA1BP68363Tubulin alpha-1B chain35.272.88E-09-1.1663202
72K1C19P08727Keratin, type I cytoskeletal 1910.641.11E-03-1.191758
73HSP76P17066Heat shock 70 kDa protein 66.461.10E-02-1.23933
74H12P16403Histone H1.27.595.87E-03-1.31935
75TBB4P04350Tubulin beta-4 chain12.663.73E-04-1.501148
76MOESP26038Moesin4.513.36E-02-1.51316
77KU70P12956ATP-dependent DNA helicase 2 subunit 122.322.31E-06-1.641675
78DYHC1Q14204Cytoplasmic dynein 1 heavy chain 18.543.48E-03-1.67527
79RBBP4Q09028Histone-binding protein RBBP46.581.03E-02-1.74319
80PGS1P21810Biglycan precursor3.994.59E-02-1.81110
81ROA1LQ32P51Heterogeneous nuclear ribonucleoprotein A1-like protein7.306.89E-03-1.81320
82HNRPFP52597Heterogeneous nuclear ribonucleoprotein F4.772.90E-02-1.93111
83RUXGP62308Small nuclear ribonucleoprotein G6.401.14E-02-2.15113
841433SP3194714-3-3 protein sigma4.194.08E-02-2.2107
85PEG10Q86TG7Retrotransposon-derived protein PEG104.194.08E-02-2.2107
86CAYP1Q13938Calcyphosin4.194.08E-02-2.2107
87GBB1P62873Guanine nucleotide-binding protein G(I)/G(S)/G(T) subunit beta-14.194.08E-02-2.2107
88NCA11P13591Neural cell adhesion molecule 1, 140 kDa isoform precursor4.194.08E-02-2.2107
89FSCN1Q16658Fascin5.112.38E-02-2.3708
90ROA0Q13151Heterogeneous nuclear ribonucleoprotein A08.982.74E-03-2.43116
91MDHCP40925Malate dehydrogenase, cytoplasmic7.008.13E-03-2.66010
92H2A1DP20671Histone H2A type 1-D8.942.79E-03-2.89012
93SEGNO76038Secretagogin15.9156.63E-05-3.51019
94MAP1BP46821Microtubule-associated protein 1B16.9263.89E-05-3.58020
95BASPP80723Brain acid soluble protein 124.0679.30E-07-3.99027

Proteins are listed in descending order of Rvalues, pooled spectral counts are listed, and "_HUMAN" are removed from UniProtKG entry names.

Significant changes in protein expression levels as judged with G-test under p < 0.05 for an LCNEC vs. SCLC pair. Proteins are listed in descending order of Rvalues, pooled spectral counts are listed, and "_HUMAN" are removed from UniProtKG entry names.

3. 3. Biomarker Candidates for LCNEC

To illustrate the specificity of protein expression toward LCNEC more clearly, we made a 3D scatter plot with an × axis indicating G-statistic values (G values) for LCNEC vs. LCC analysis, a y axis for LCC vs. SCLC, and a z axis for LCNEC vs. SCLC (Figure 2). When the spectral counts of a target protein are zero for both groups in question, it is hereafter defined as G = 0. The proteins expressed specifically to LCNEC will therefore be present in the region (x>3.84, z>3.84 corresponding to p < 0.05 each) on the x-z plane, those in SCLC in the region (y>3.84, z>3.84) on the y-z plane and those in LCC in the region (x>3.84, y>3.84) on the x-y plane. We used 1,918 proteins for this plotting. Close inspection of the 3D plot shows that AK1C3 at a point (40.8, 0, 39.1), AK1C1 at a point (39.0, 0, 37.4), AL1A1 at a point (8.75, 2.6 × 10-5, 11.8) and CD44 antigen precursor (CD44) at a point (5.56, 0, 5.27) are very near or on the x-z plane with convincingly low p-values (3.9 × 10-10, 9.6 × 10-10, 6.1 × 10-4, and 0.021, respectively) from LCNEC vs. SCLC comparisons and thus specific to LCNEC. Interestingly, AK1C1, AK1C3, AL1A1, and CD44 have been reported to be biomarkers of cancer stem cells (see Discussion). In Table 2 BASP and SEGN are significantly up-regulated in SCLC compared with LCNEC, which are indeed located on the y-z plane at the respective points (0, 32.2, 24.1) and (0, 21.5, 15.9), and specific to SCLC. Major vault protein (MVP) is at a point (23.8, 34.1, 0) on the x-y plane, indicating an LCC-specific protein. One of well known proteins related to SCLC, γ-enolase (ENOG) is detectable at a point (0.55, 7.23, 2.84) in the 3D G-statistic space which indicates that it is expressed significantly in SCLC compared to in LCC. The G-statistic is assumed to obey a χ2-distribution with one degree of freedom and the p-values based on G-values obtained with the contingency tables containing small counts should be handled with caution. Therefore we calculated exact p-values for the 2 × 2 tables with the non-parametric Fisher's exact test and Mann-Whitney U test. The results were fully consistent with those obtained with the G-test; the exact p-values for LCNEC vs. SCLC were 3.40 × 10-4 for AL1A1, 5.53 × 10-10 for AK1C1, 2.27 × 10-10 for AK1C3, and 0.012 for CD44. The G-test analyses of three cancer group pairs (LCNEC vs. SCLC, LCNEC vs. LCC, and LCC vs. SCLC) under p < 0.05 retrieved the respective 95, 186 and 237 proteins that showed significant changes in expression levels. These proteins were subjected to gene ontology (GO) analysis, highlighting their biological and molecular functions and cellular localization. As Figure 3 shows, the molecular functions and cellular localization of proteins preferentially expressed in the LCNEC vs. SCLC pair were quite different from those of the other pairs.
Figure 2

Marker candidates' extraction by pairwise G statistics. In the 3D scatter plot, X, Y, Z-axis shows G-values (X: LCNEC vs. LCC; Y: LCC vs. SCLC; Z: LCNEC vs. SCLC). Data point sets from 1,918 proteins were plotted with circles. AK1C1 and AK1C3 (orange), AL1A1 (purple) and CD44 (red) Proteins being located very near or on X-Z plane are isolated as candidates of specific LCNEC markers. SEGN (yellow) were located on Y-Z plane, which was already known as one of SCLC-specific markers.

Figure 3

Gene ontology (GO) analysis on the molecular functions and cellular localization of proteins preferentially expressed in three cancer group pairs (LCNEC vs. SCLC, LCNEC vs. LCC, and LCC vs. SCLC). A) Molecular functions: 1, antioxidant activity; 2, auxiliary transport protein activity; 3, binding; 4, catalytic activity; 5, chemoattractant activity; 6, electron carrier activity; 7, enzyme regulator activity; 8, molecular function; 9, molecular transducer activity; 10, motor activity; 11, structural molecule activity; 12, transcription regulator activity; 13, translation regulator activity; 14, transporter activity. B) Cellular localizations: 1, Golgi apparatus; 2, cytoplasm; 3, cytoskeleton; 4, endoplasmic reticulum; 5, endosome; 6, extracellular region; 7, intracellular organelle; 8, membrane; 9, mitochondrion; 10, nucleus; 11, organelle membrane; 12, organelle part; 13, plasma membrane; 14, ribosome.

Marker candidates' extraction by pairwise G statistics. In the 3D scatter plot, X, Y, Z-axis shows G-values (X: LCNEC vs. LCC; Y: LCC vs. SCLC; Z: LCNEC vs. SCLC). Data point sets from 1,918 proteins were plotted with circles. AK1C1 and AK1C3 (orange), AL1A1 (purple) and CD44 (red) Proteins being located very near or on X-Z plane are isolated as candidates of specific LCNEC markers. SEGN (yellow) were located on Y-Z plane, which was already known as one of SCLC-specific markers. Gene ontology (GO) analysis on the molecular functions and cellular localization of proteins preferentially expressed in three cancer group pairs (LCNEC vs. SCLC, LCNEC vs. LCC, and LCC vs. SCLC). A) Molecular functions: 1, antioxidant activity; 2, auxiliary transport protein activity; 3, binding; 4, catalytic activity; 5, chemoattractant activity; 6, electron carrier activity; 7, enzyme regulator activity; 8, molecular function; 9, molecular transducer activity; 10, motor activity; 11, structural molecule activity; 12, transcription regulator activity; 13, translation regulator activity; 14, transporter activity. B) Cellular localizations: 1, Golgi apparatus; 2, cytoplasm; 3, cytoskeleton; 4, endoplasmic reticulum; 5, endosome; 6, extracellular region; 7, intracellular organelle; 8, membrane; 9, mitochondrion; 10, nucleus; 11, organelle membrane; 12, organelle part; 13, plasma membrane; 14, ribosome.

3. 4. Extended immunohistochemical validation of the proteomics results

From this proteomic study we identified AL1A1, AK1C1, AK1C3 and CD44 as biomarker candidates for LCNEC. The results were immunohistochemically verified using a total of 10 cases for each group. We assessed immunoreactivity with the percentage of immunopositive area and staining intensity compared to those of positive-control samples at the maximal cut-surface of tumors (Figure 4). All SCLC cases showed no immunoreactivity with AK1C1, AK1C3 and CD44 and the reactivity of all antibodies with LCNEC sections differed impressively from that of SCLC, supporting the proteomic results. Notably, nine cases of LCNEC including four used for the proteomic experiments were AL1A1 positive in the extent of 30 to 90%. The most intense staining (90% positive area) was observed in patient 2 of LCNEC (Table 1 and Figure 4A). On the other hand, LCC and SCLC sections with typical histology were AL1A1 negative (Figure 4A). There were four cases with weak immunoreactivity (30-80% area) which would contain the small areas mimicking some LCNEC morphology. In LCNEC four were immuno-positive (30-100% positive area) to both AK1C1 and AK1C3, and there was one more AK1C3 positive case. In LCC group one case was AK1C1 positive and four cases were AK1C3 positive; these cases showed small areas with neuroendocrine tendency in the tissue structure. Immnoreactivity of LCNEC cells to CD44 were the same as that of LCC.
Figure 4

Imunohistochemical identification of proteomics-identifying proteins. A) Histological appearances of LCNEC, SCLC and LCC, and immunohistochemical staining of AL1A1, AK1C1 and AK1C3. Magnification, x200. B) Immunoreactivitiy with AL1A1, AK1C1, AK1C3, and CD44. The immunoreactivity was indicated as the percentage of immunopositive area at the maximal cut-surface of tumors.

Imunohistochemical identification of proteomics-identifying proteins. A) Histological appearances of LCNEC, SCLC and LCC, and immunohistochemical staining of AL1A1, AK1C1 and AK1C3. Magnification, x200. B) Immunoreactivitiy with AL1A1, AK1C1, AK1C3, and CD44. The immunoreactivity was indicated as the percentage of immunopositive area at the maximal cut-surface of tumors.

4. Discussions

This study aimed at developing the way of proteomic distinction between LCNEC and SCLC, which will assist pathologic distinction that has not sometimes been straightforward, leading to therapeutic inefficiency. We have been focusing our attention on using laser-microdissection sampling from FFPE sections for proteomics to explore disease-related protein markers. We have already applied this method to both global semi-quantitative shotgun proteomics using spectral counting and MRM-based quantitative proteomics and successfully identified stage-related proteins on lung AC [11,12]. In this study, we used the same global shotgun method for comparison of three cancer groups (LCNEC, SCLC, and LCC) by spectral counting and explicitly interpreted three sets of pairwise G test results in the 3D G-statistic space (Figure 2). This resulted in identifying four proteins AL1A1, AK1C1 AK1C3 and CD44 that were expressed in LCNEC more than in SCLC and LCC with high probabilities. These proteomic findings using the limited scale of patients were confirmed by routine immunohistochemitry with additional patients. Moreover we identified other proteins related to these cancer groups in the present study, further demonstrating the technical feasibility of this FFPE proteomic method. The identified four proteins physiologically take part in known metabolic processes. AL1A1, AK1C1 and AK1C3 are cytosolic oxidoreductases that are involved in reduction of progesterone to the inactive form 20-alpha-hydroxy-progesterone, metabolism of steroids and prostaglandins with multi-specificity, oxidation of retinal to retinoic acid and the precursor of the storage form vitamin A, respectively. CD44 is one of cell-surface glycoproteins which relates to cell-cell interactions including adhesion and migration, and thus to tumor growth and progression [15]. When we have considered the properties common to these proteins that have apparently no functional relationship with one another, we noticed that AL1A1 [16,17], AK1C1 [18], AK1C3 [19] and CD44 [20] have been proposed to be the markers of cancer stem cells. Their expression in tumor cells could correlate with their aggressive biological behavior, drug resistance and poor prognosis, which are common characteristics of LCNEC and SCLC. The preferential expression of the cancer stem cell markers in LCNEC over SCLC suggests that the mechanism of increasing the extent of malignancy in LCNEC differs from that in SCLC. Previous studies suggested that these redox enzymes were present in a variety of malignant tumor cells. In particular, AK1C1, and AK1C3 are reported in human non-small cell lung carcinoma (A549) cells [21], and a high expression of AL1A1 in lung cancer cell lines, especially in AC cell lines compared to LCC and SCLC cell lines [22-24]. To our knowledge, however, this is the first report of the statistically significant proteomic detection of AL1A1, AK1C1 and AK1C3 in clinical samples of lung cancers, especially in LCNEC. Out of the top five LCNEC-specific proteins, brain-type FABP7 is present in highly infiltrative malignant glioma and associated with enhanced cell migratory activity and thus with poor prognosis [25], suggesting for its involvement in the aggressive nature of LCNEC. Out of the top five down-regulated LCNEC proteins compared with SCLC, BASP is a potential tumor suppressor [26], consistent with its down-regulation in LCNEC, and its specific expression in SCLC suggests that different mechanisms of tumor growth could operate between LCNEC and SCLC. Another SCLC-specific SEGN is a novel neuroendocrine marker that has a distinct expression pattern from the conventional ones used in this study, consistent with being negative in LCNEC, and with the reported rate for positive staining in SCLC (26 out of 31) [27]. The role of AL1A1 in lung cancers is still unknown, but it is recently reported that AL1A1 plays an important role in Notch pathway [28]. Though there has been no effective chemotherapy to LCNEC, Sorafenib, a tyrosine kinase inhibitor in the MAP kinase pathway, is effective to malignant tumor cells with AL1A [29]. AL1A1 would be not only cancer stem cell markers, but also an attractive target of treatment of LCNEC. In addition to statistically sorting protein expression levels by spectral counting, GO mapping of significant proteins on pairwise comparison (p < 0.05) provides insights into overall differences from pair and pair in their biological and molecular functions, and cellular components. Gene ontology distributions of molecular function and cellular components in neuroendocrine vs. non-neuroendocrine comparisons, i.e., LCNEC vs. LCC and SCLC vs. LCC, did not significantly differ from each other. On the other hand, those distributions in comparison within neuroendocrine groups, LCNEC vs. SCLC, differed greatly from those of the other pairs. This does encourage us to go ahead with further studies in this line and will promise to get target proteins of LCNEC eventually in future. We checked the rate of positive immuno-reaction of relevant antibodies with proteomics-identifying proteins for ten patients of each group (Figure 4B). Differences between the rates for all target proteins in LCNEC and SCLC are fully consistent with the proteomic results, confirming the specificity to LCNEC. The preferential expression of AL1A1 and AK1C1 in LCNEC over LCC was also immunochemically confirmed, and the rate of AL1A1 positive cases in LCC (20%) agreed with the previous results (25%, 1 of 4) [16]. In contrast, the positive staining rates of AK1C3 and CD44 in LCNEC and LCC were similar to each other. Close inspection of HE sections showed that the positive cases in LCC had small areas with neuroendocrine tendency in the tissue structure as pointed out above. Almost all sections of LCC exhibited no immunoreactivity with the neuroendocrine markers used except for weak reactivity (20 or 30%) in only two cases. This suggests that the LCNEC like structure observed in small portions of LCC sections does not necessarily contain enough secretory granules, but presumably contain LCNEC specific AK1C3 and CD44. Confirmatory conclusion of this issue should await proof by electron micrographic immunohitochemistry. A previous study indicated that CD44 was expressed more in SC (97%) and AC (71%) compared to LCC (29%) and SCLC (0%) [30] in agreement with the present positive rates for LCC (30%) and SCLC (0%).

5. Conclusions

We concluded that AL1A1, AK1C1, AK1C3, and CD44 were specific for the LCNEC phenotype in relation to SCLC and LCC through proteomics of FFPE samples. They were useful targets to immunohistochemically distinguish LCNEC from SCLC and LCC. Though we need a variety of studies with more extensive experimental and clinical data to assess the precise function of these marker candidates and confirm them as real biomarkers, this proteomic analysis was effective to detect them and will be applied to other phenotype of malignancies.

Abbreviations

NSCLC: non-small cell lung carcinoma; LCNEC: large cell neuroendocrine carcinoma; LCC: large cell carcinoma; SCLC: small cell lung carcinoma; CSC: cancer stem cell; LC: liquid chromatography; MS: mass spectrometry; FFPE: formalin-fixed paraffin embedded; LMD: laser microdissection; MS/MS: tandem mass spectrometry; ISIS: in-sample internal standard; AL1A1: aldehyde dehydrogenase 1 family, member A 1; AK1C1: aldo-keto reductase family 1, member C1; AK1C3: aldo-keto reductase family 1, member C3; HE: hematoxylin-eosin

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

MN coordinated the clinical and experimental parts of study and drafted the manuscript. TF and KF performed protein analysis through mass spectrometry. TK carried out proteomic data analysis. HT performed statistical analysis and helped to draft the manuscript. MK performed statistical analysis of G-test. YB helped us to use FFPE technique. AG suggested some important points of pathological diagnosis of LCNEC. MT offered clinical samples from patients. HO and TN pathologically diagnosed all samples independently. TO and NI supported us clinically and financially. NG supported us experimentally and financially. HK supported us clinically. GMV and TN coordinated FFPE project and assessed the results. All authors read and approved the final manuscript.
  29 in total

1.  Synergistic activity of sorafenib and sulforaphane abolishes pancreatic cancer stem cell characteristics.

Authors:  Vanessa Rausch; Li Liu; Georgios Kallifatidis; Bernd Baumann; Jürgen Mattern; Jury Gladkich; Thomas Wirth; Peter Schemmer; Markus W Büchler; Margot Zöller; Alexei V Salnikov; Ingrid Herr
Journal:  Cancer Res       Date:  2010-06-08       Impact factor: 12.701

2.  Proteomics and disease revisited: the challenge of providing proteomic tools into clinical practice.

Authors:  Thomas E Fehniger; György Marko-Varga
Journal:  J Proteome Res       Date:  2010-03-05       Impact factor: 4.466

Review 3.  CD44 and the adhesion of neoplastic cells.

Authors:  Z Rudzki; S Jothy
Journal:  Mol Pathol       Date:  1997-04

4.  Class 1 and class 3 aldehyde dehydrogenase levels in the human tumor cell lines currently used by the National Cancer Institute to screen for potentially useful antitumor agents.

Authors:  L Sreerama; N E Sladek
Journal:  Adv Exp Med Biol       Date:  1997       Impact factor: 2.622

5.  The IASLC Lung Cancer Staging Project: proposals for the revision of the TNM stage groupings in the forthcoming (seventh) edition of the TNM Classification of malignant tumours.

Authors:  Peter Goldstraw; John Crowley; Kari Chansky; Dorothy J Giroux; Patti A Groome; Ramon Rami-Porta; Pieter E Postmus; Valerie Rusch; Leslie Sobin
Journal:  J Thorac Oncol       Date:  2007-08       Impact factor: 15.609

6.  Distinct expression levels and patterns of stem cell marker, aldehyde dehydrogenase isoform 1 (ALDH1), in human epithelial cancers.

Authors:  Shan Deng; Xiaojun Yang; Heini Lassus; Shun Liang; Sippy Kaur; Qunrui Ye; Chunsheng Li; Li-Ping Wang; Katherine F Roby; Sandra Orsulic; Denise C Connolly; Youcheng Zhang; Kathleen Montone; Ralf Bützow; George Coukos; Lin Zhang
Journal:  PLoS One       Date:  2010-04-21       Impact factor: 3.240

7.  Proteomic analysis of laser-microdissected paraffin-embedded tissues: (1) Stage-related protein candidates upon non-metastatic lung adenocarcinoma.

Authors:  Takeshi Kawamura; Masaharu Nomura; Hiromasa Tojo; Kiyonaga Fujii; Hiroko Hamasaki; Sayaka Mikami; Yasuhiko Bando; Harubumi Kato; Toshihide Nishimura
Journal:  J Proteomics       Date:  2009-11-27       Impact factor: 4.044

8.  Aldehyde dehydrogenase 1 is a tumor stem cell-associated marker in lung cancer.

Authors:  Feng Jiang; Qi Qiu; Abha Khanna; Nevins W Todd; Janaki Deepak; Lingxiao Xing; Huijun Wang; Zhenqiu Liu; Yun Su; Sanford A Stass; Ruth L Katz
Journal:  Mol Cancer Res       Date:  2009-03-10       Impact factor: 5.852

9.  Improvements in surgical results for lung cancer from 1989 to 1999 in Japan.

Authors:  Teruaki Koike; Yasushi Yamato; Hisao Asamura; Ryosuke Tsuchiya; Yasunori Sohara; Kenji Eguchi; Masaki Mori; Yoichi Nakanishi; Tomoyuki Goya; Yoshihiko Koshiishi; Etsuo Miyaoka
Journal:  J Thorac Oncol       Date:  2009-11       Impact factor: 15.609

10.  Cancer statistics, 2009.

Authors:  Ahmedin Jemal; Rebecca Siegel; Elizabeth Ward; Yongping Hao; Jiaquan Xu; Michael J Thun
Journal:  CA Cancer J Clin       Date:  2009-05-27       Impact factor: 508.702

View more
  7 in total

Review 1.  Toward deciphering proteomes of formalin-fixed paraffin-embedded (FFPE) tissues.

Authors:  Sameh Magdeldin; Tadashi Yamamoto
Journal:  Proteomics       Date:  2012-04       Impact factor: 3.984

2.  Evaluation of SLOG/TCI-III pediatric system on target control infusion of propofol.

Authors:  Wan-hua Yang; Hong-bin Gu; Bing Chen; Juan Li; Qiu-wei Fan; Yong-fang Yuan; Xiangdong Wang
Journal:  J Transl Med       Date:  2011-11-01       Impact factor: 5.531

3.  Cancer Phenotype Diagnosis and Drug Efficacy within Japanese Health Care.

Authors:  Toshihide Nishimura; Harubumi Kato; Norihiko Ikeda; Makoto Kihara; Masaharu Nomura; Yasufumi Kato; György Marko-Varga
Journal:  Int J Proteomics       Date:  2012-05-22

4.  Clinical initiatives linking Japanese and Swedish healthcare resources on cancer studies utilizing Biobank Repositories.

Authors:  Toshihide Nishimura; Takeshi Kawamura; Yutaka Sugihara; Yasuhiko Bando; Shigeru Sakamoto; Masaharu Nomura; Norihiko Ikeda; Tatsuo Ohira; Junichiro Fujimoto; Hiromasa Tojo; Takao Hamakubo; Tatsuhiko Kodama; Roland Andersson; Thomas E Fehniger; Harubumi Kato; György Marko-Varga
Journal:  Clin Transl Med       Date:  2014-11-22

5.  A selected reaction monitoring mass spectrometric assessment of biomarker candidates diagnosing large-cell neuroendocrine lung carcinoma by the scaling method using endogenous references.

Authors:  Tetsuya Fukuda; Masaharu Nomura; Yasufumi Kato; Hiromasa Tojo; Kiyonaga Fujii; Toshitaka Nagao; Yasuhiko Bando; Thomas E Fehniger; György Marko-Varga; Haruhiko Nakamura; Harubumi Kato; Toshihide Nishimura
Journal:  PLoS One       Date:  2017-04-27       Impact factor: 3.240

6.  A proteomic profile of synoviocyte lesions microdissected from formalin-fixed paraffin-embedded synovial tissues of rheumatoid arthritis.

Authors:  Junji Hayashi; Makoto Kihara; Harubumi Kato; Toshihide Nishimura
Journal:  Clin Proteomics       Date:  2015-08-06       Impact factor: 3.988

7.  Quantitative evaluation of aldo-keto reductase expression in hepatocellular carcinoma (HCC) cell lines.

Authors:  Lei Yang; Ju Zhang; Shenyan Zhang; Weiwei Dong; Xiaomin Lou; Siqi Liu
Journal:  Genomics Proteomics Bioinformatics       Date:  2013-04-11       Impact factor: 7.691

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.