Vlad Serafim1, Ajit Shah2, Maria Puiu1, Nicoleta Andreescu1, Dorina Coricovac3, Alexander Nosyrev4, Demetrios A Spandidos5, Aristides M Tsatsakis6, Cristina Dehelean3, Iulia Pinzaru3. 1. Center of Genomic Medicine, 'Victor Babes' University of Medicine and Pharmacy, Timisoara 300041, Romania. 2. Department of Natural Sciences, Middlesex University, London NW4 4BT, UK. 3. Department of Toxicology, Faculty of Pharmacy, 'Victor Babes' University of Medicine and Pharmacy, 300041 Timisoara, Romania. 4. Central Chemical Laboratory of Toxicology, I.M. Sechenov First Moscow State Medical University, 119991 Moscow, Russia. 5. Laboratory of Clinical Virology, Medical School, University of Crete, 71003 Heraklion, Greece. 6. Department of Forensic Sciences and Toxicology, Faculty of Medicine, University of Crete, 71003 Heraklion, Greece.
Abstract
Over the past decade, matrix-assisted laser desorption/ionization time‑of‑flight mass spectrometry (MALDI‑TOF MS) has been established as a valuable platform for microbial identification, and it is also frequently applied in biology and clinical studies to identify new markers expressed in pathological conditions. The aim of the present study was to assess the potential of using this approach for the classification of cancer cell lines as a quantifiable method for the proteomic profiling of cellular organelles. Intact protein extracts isolated from different tumor cell lines (human and murine) were analyzed using MALDI‑TOF MS and the obtained mass lists were processed using principle component analysis (PCA) within Bruker Biotyper® software. Furthermore, reference spectra were created for each cell line and were used for classification. Based on the intact protein profiles, we were able to differentiate and classify six cancer cell lines: two murine melanoma (B16‑F0 and B164A5), one human melanoma (A375), two human breast carcinoma (MCF7 and MDA‑MB‑231) and one human liver carcinoma (HepG2). The cell lines were classified according to cancer type and the species they originated from, as well as by their metastatic potential, offering the possibility to differentiate non‑invasive from invasive cells. The obtained results pave the way for developing a broad‑based strategy for the identification and classification of cancer cells.
Over the past decade, matrix-assisted laser desorption/ionization time‑of‑flight mass spectrometry (MALDI‑TOF MS) has been established as a valuable platform for microbial identification, and it is also frequently applied in biology and clinical studies to identify new markers expressed in pathological conditions. The aim of the present study was to assess the potential of using this approach for the classification of cancer cell lines as a quantifiable method for the proteomic profiling of cellular organelles. Intact protein extracts isolated from different tumor cell lines (human and murine) were analyzed using MALDI‑TOF MS and the obtained mass lists were processed using principle component analysis (PCA) within Bruker Biotyper® software. Furthermore, reference spectra were created for each cell line and were used for classification. Based on the intact protein profiles, we were able to differentiate and classify six cancer cell lines: two murinemelanoma (B16‑F0 and B164A5), one humanmelanoma (A375), two humanbreast carcinoma (MCF7 and MDA‑MB‑231) and one humanliver carcinoma (HepG2). The cell lines were classified according to cancer type and the species they originated from, as well as by their metastatic potential, offering the possibility to differentiate non‑invasive from invasive cells. The obtained results pave the way for developing a broad‑based strategy for the identification and classification of cancer cells.
The development and application of matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) for the identification of microorganisms has been revolutionary. Microbial proteomics has attracted attention worldwide and this has resulted in the development of novel identification methods which are rapid, robust and relatively inexpensive (1).For the identification of microorganisms, MALDI-TOF MS analysis typically concentrates on intact proteins with a molecular weight in the range of 2–20 kDa. The proteins in this range are mainly ribosomal proteins which represent approximately 30% of the total proteins from a microbial cell (2). Following mass spectral analysis, the raw spectra are processed using a peak recognition algorithm. The data generated following MALDI-TOF MS analysis consist of two large sets of values: mass-to-charge (m/z) and intensity. Each m/z value has a corresponding intensity value. The signal extraction, also known as peak picking, is often corrupted by noise. Therefore, various algorithms have been developed for obtaining peaks that correspond to true peptide/proteins signals (3). This is a very important step in data analysis as different peak recognition algorithms may have a considerable effect on the peak list, and therefore should be adjusted with care (4). Once extracted, the peak lists are compared to a dedicated database which contains reference mass spectra of known microbial strains. The first such platform 'MALDI Biotyper' was developed by Bruker Daltonics. Another platform combines the Shimadzu mass instrumentation and software 'Launchpad' with a centralized database 'SARAMIS' provided by BioMerieux (Marcy l'Etoile, France) (1,5).Although mainly used for microbial identification, intact protein profiles have also been used successfully for the characterization and identification of mammalian cell lines (6–8). Karger et al (6) identified 66 cell lines from 34 species using reference spectra library created by MALDI Biotyper, while Povey et al (8) used partial least squares discriminant analysis model to predict the phenotype of recombinant mammalian cell lines.The methods currently used for the identification and characterization of cancer cells, namely DNA fingerprinting, immunohistochemistry and flow cytometry (9–12), require specific reagents which limit the degree of multiplexing. In addition, these methods require laborious sample preparation which leads to an increased analysis time. The proteomic approach can achieve a level of multiplexing where several cell lines can be analyzed without changing the method parameters and without using specialized materials and reagents. The aim of this study was to assess the potential of using MALDI-TOF MS for the classification of cancer cell lines. To achieve this, the procedure for the taxonomic classification of microorganisms was adapted. Six cancer cell lines (murine and human) were used in this study: B16-F0 and B164A5 (murinemelanoma cells), A375 (humanmelanoma), HepG2 (humanliver carcinoma), MCF7 (humanbreast carcinoma) and MDA-MB-231 (humanbreast carcinoma). The statistical analysis was processed using MALDI Biotyper software. These data were used for a better observation of differences regarding the species and metastatic potential, differences well-defined between the two humanbreast carcinoma cell lines (MCF7 and MDA-MB-231) and two murinemelanoma cell lines (B16-F0 and B164A5). As an end point, a different cell line was applied for an upgraded picture: HepG2 (humanliver carcinoma).
Materials and methods
Cell lines and reagents
The cancer cell lines used in the present study, B16-F0 [murinemelanoma; CRL-6322™, American Type Culture Collection (ATCC), Manassas, VA, USA], B16 melanoma 4A5 (murinemelanoma; 94042254; Sigma-Aldrich Chemie GmbH, Munich, Germany), A375 (humanmelanoma; CRL-1619™; ATCC), HepG2 (humanliver carcinoma; HB8065™; ATCC), MCF7 (humanbreast carcinoma; HTB22™; ATCC) and MDA-MB-231 (humanbreast carcinoma; HTB26™; ATCC), were acquired from Sigma-Aldrich Chemie GmbH and ATCC as frozen items.The specific reagents for cell culture [Dulbecco's modified Eagle's medium (DMEM); Eagle's Minimum Essential Medium (EMEM)], fetal bovine serum (FBS), antibiotic mixture of penicillin/streptomycin, phosphate-buffered saline (PBS), Trypsin/EDTA and trypan blue were acquired from Sigma-Aldrich Chemie GmbH and ATCC.Ethanol, formic acid, trifluoroacetic acid and acetonitrile were acquired from Sigma-Aldrich Chemie GmbH. α-cyano-4-hydroxycinnamic acid (Bruker HCCA matrix) and protein I calibration standard were acquired from Bruker Daltonics (Bremen, Germany).
Cell culture
The murinemelanoma (B164A5 and B16-F0), humanmelanoma (A375) and humanbreast carcinoma (MDA-MB-231) cell lines were cultured in DMEM with 4.5 g/l glucose, 2 mM L-glutamine and supplemented with 10% FBS and antibiotic mixture (100 U/ml penicillin and 100 µg/ml streptomycin). The humanliver carcinoma (HepG2) and humanbreast carcinoma (MCF7) were cultured in EMEM, supplemented with 10% FBS and antibiotic mixture. The cells were kept in standard conditions as follows: a humidified atmosphere with 5% CO2 at a temperature of 37°C and were passaged every 2–3 days.
Scratch assay
The migratory character of the tumor cells used in this study was examined by the means of a scratch assay. In brief, 2×105 cells/well were seeded in 12-well plates in specific culture medium and when the confluence was appropriate (85–90%) a gap/scratch was drawn in the middle of the well with a 10 µl tip (13). The capacity of the cells to migrate and fill the gap was monitored for 24 h by acquiring images at different time points, namely 0, 3 and 24 h using an Optika Microscopes Optikam Pro Cool 5 and Optika View (Optika, Ponteranica, Italy).
Protein extraction for MALDI-TOF MS analysis
Sample preparation for MALDI-TOF MS analysis was performed as follows: the culture medium was discarded from the flasks; the cells were washed with 10 ml PBS and were subsequently incubated with 3 ml 0.025% trypsin/EDTA for 3–5 min. The reaction was terminated by the addition of 10 ml cell culture medium. The cells were stained with trypan blue and the cell number was established using the Neubauer cell counting chamber. The suspension was subsequently centrifuged at 1,700 × g for 5 min. The supernatant was discarded and the cell pellet was mixed with 1 ml of ethanol. Following centrifugation at 1,700 × g for 5 min, the ethanol was removed and the pellet was reconstituted in 70% formic acid at a ratio of 20 µl/1×106 cells. The mixture was left at room temperature for 2 min, and then an equal volume of acetonitrile was added. The samples were then centrifuged at 10,000 × g for 5 min. An aliquot (1 µl) of the supernatant was spotted in duplicate onto the MTP 384 ground steel MALDI target plate (Bruker Daltonics). The sample was allowed to dry at ambient temperature and then overlaid with 1 µl of HCCA (5 mg/ml in a mixture of acetonitrile, water and trifluoroacetic acid 50:47.5:2.5% v/v).
MALDI-TOF MS analysis
The MALDI-TOF MS instrument (Bruker Daltonics) was calibrated in a positive ion linear mode in a mass range of 5,000–20,000 m/z, using Bruker Protein Calibration Standard I containing insulin, ubiquitin I, cytochrome c and myoglobin. Flex Control® (version 3.4) software was used to acquire the data and set the method parameters. The following settings were used: laser frequency, 2,000 Hz; smartbeam, '4_large'; sample rate and digitiser settings, 1.25 GS/sec; accelerator voltage, 20.07 kW; extraction voltage, 18.87 kW; lens voltage, 5.58 kW; and delayed extraction, 250 nsec. The laser intensity was adjusted such that the highest peak in the spectrum was in the range of 104 arbitrary units. A 1,000 laser shots were used for each individual spectrum and minimum of 10 individual spectra (10,000 laser shots) were cumulated and saved.
Mass spectra processing
Mass spectra were processed using MALDI Biotyper Offline Client® software (version 3.1; Bruker Daltonics). The MALDI-TOF MS spectra were analyzed using principle component analysis (PCA). The following parameters were used for pre-processing the spectra: mass adjustment from 5,000 to 20,000 m/z; resolution, 1; processing method, spectra compressing; compressing factor, 10; smoothing method, 'Savitsky-Golay' with a frame size of 25 kDa; baseline subtraction method, multipolygon with a search window of 5 kDa in 2 runs; spectra normalization, maximum; peak piking method, 'Local maximum' with 200 maximum peaks, signal-to-noise threshold, 0.01 and window width, 25 kDa. The PCA creation parameters were set as follows: mass from 9,500–12,500 m/z; resolution, 2 and number of principal components set to automatic.For creating main spectra (MSP) the pre-processing parameters used were the same as those described for PCA analysis except that the mass range was 9,500 to 12,500 m/z. For the MSP creation method, the following parameters were used: maximum mass errors of each single spectrum, 2,000; desired mass error for the MSP, 200; desired peak frequency minimum, 25%; maximum desired peak number for the MSP, 10. For the identification method, the following parameters were used: frequency threshold for spectra adjusting, 50; frequency threshold for score calculation, 5; maximum mass error of raw spectrum, 2,000; desired mass tolerance of the adjusted spectrum, 250; furthermore accepted mass tolerance of a peak 600, parameter of the intensity correction function, 0.25. For creating the MSP dendrogram, the following parameters were used: distance measure, correlation; linkage, average; score threshold value for a single organism, 300; and score threshold value for a related organism, 0.
Results
Characteristics of tumor cell appearance in culture
The cell lines employed in the present study in order to be classified and differentiated at proteomic level by the means of MALDI-TOF MS are illustrated in Fig. 1. It can be observed that the cells have different shapes, even though they originate from the same species [Fig. 1C (B16-F0) and D (B164A5) murinemelanoma] or cancer type [Fig. 1E (MCF7) and F (MDA-MB-231) humanbreast carcinoma]. A375 (Fig. 1A) are human amelanotic cells with an epithelial morphology, whereas B16-F0 and B164A5 (murine) are melanin producing cells with a mixture of spindle-shaped and epithelial-like cells (B16-F0) or a fibroblast-like morphology (B164A5) (Fig. 1C and D, granules of melanin can also be observed).
Figure 1
The appearance of tumor cells in culture. (A) A375 (human melanoma cells); (B) HepG2 (human liver carcinoma cells); (C) B16-F0 (murine melanoma cells); (D) B164A5 (murine melanoma cells); (E) MCF7 (breast carcinoma cells); and (F) MDA-MB-231 (breast carcinoma cells).
The humanliver carcinoma cell line, HepG2, has an epithelial morphology and the cells are strongly bonded (Fig. 1B). In the case of the two breast human carcinoma cell lines, MCF7 and MDA-MB-231, some differences were observed in terms of shape (an elongated shape in the case of MDA-MB-231 and a more round one in the case of MCF7 cells), albeit both are described to have an epithelial morphology (Fig. 1E and F).To delineate the differences regarding the migratory character of the tumor cells studied, we performed a wound healing assay/scratch assay. As shown in Fig. 2, the A375 and HepG2 cells were not able to fill the gap drawn within 24 h, what indicates a low migratory capacity.
Figure 2
The migratory/invasive capacity of human melanoma cells (A375) and liver carcinoma cells (HepG2) over a period of 24 h. The contrast of the images was increased for better visualization.
B164A5 cells (murinemelanoma) proved to be highly invasive, with the gap being completely filled after 24 h. The B16-F0 cells also exhibited a potent migratory capacity, although to a lesser extent than the B164A5 cells (Fig. 3).
Figure 3
The migratory capacity of murine melanoma cells (B16-F0 and B164A5) over a period of 24 h. The contrast of the images was increased for better visualization.
The comparison between the two breast carcinoma cell lines in terms of migratory capacity indicated that the MDA-MB-231 cells were more invasive (Fig. 4).
Figure 4
The migratory capacity of human breast carcinoma cells (MCF7 and MDA-MB-231) over a period of 24 h. The contrast of the images was increased for better visualization.
PCA analysis
PCA three-dimensional clustering (Fig. 5) revealed that the three human carcinoma cell lines (HepG2, MDA-MB-231 and MCF7) formed a distinct cluster, while the human (A375) and murinemelanoma (B164A5 and B16-F0) cell lines formed another two separate clusters.
Figure 5
PCA spectra clustering using 9,500–12,500 m/z interval. The top left score plot is a 3D plot of PC1 vs. PC2 vs. PC3 while on right is a 2D plots of PC1 vs. PC2, bottom left is PC1 vs. PC3 and bottom right PC2 vs. PC3. Each dot represents one spectra: 1, A375 (human melanoma); 2, HepG2 (human liver carcinoma); 3, B16-F0 (murine melanoma); 4, B164A5 (murine melanoma); 5, MCF7 (human breast carcinoma); and 6, MDA-MB-231 (human breast carcinoma). PCA, principle component analysis.
Within the humancarcinoma cluster, and also in the PCA dendrogram (Fig. 6) the distance level that separated the breast carcinoma (MDA-MB-231) cell line from the liver carcinoma (HepG2) cell line was shorter than the distance level between the two breast carcinoma cell lines. The contribution of PC1, PC2 and PC3 was found to be 66, 24 and 7%, respectively (Fig. 7).
The contribution of three principal components to the PCA classification. The PC1 contribution was approximately 66% while that of PC2 and PC3 was approximately 24 and 7%, respectively. PCA, principle component analysis.
Creation and use of MSP
All MSP (Fig. 8) were used to confirm the classification of cell lines. This yielded a score for the level of similarity (Table I).
Figure 8
MSP spectra of the cancer cells. MSP were created using the 9,500–12,500 m/z interval (Int). Each green bar represents one reference peak. Ten reference peaks were assigned for each reference spectrums. MSP, main spectra.
Table I
The classification results obtained by comparing the MSP in rapport with each other.
Individual MSP score
Group MSP score
Cell line
Score
Cell group/type
Score
A375
A375
3
Human melanoma
3
MDA-MB-231
1.832
Human carcinoma
2.27
B164A5
1.405
Murine melanoma
0.641
HepG2
1.397
B16-F0
1.389
MCF7
1.344
HepG2
HepG2
3
Human carcinoma
2.505
MDA-MB-231
2.451
Human melanoma
1.397
MCF7
2.109
Murine melanoma
1.35
A375
1.397
B16-F0
1.386
B164A5
1.003
B16-F0
B16-F0
3
Murine melanoma
2.782
B164A5
2.479
Human carcinoma
1.406
HepG2
1.498
Human melanoma
1.389
A375
1.389
MDA-MB-231
1.329
MCF7
1.304
B164A5
B164A5
3
Murine melanoma
2.659
B16-F0
2.48
Human melanoma
1.405
A375
1.405
Human carcinoma
<0
MCF7
1.354
HepG2
1.003
MDA-MB-231
0.641
MCF7
MCF7
3
Human carcinoma
2.116
HepG2
2.109
Human melanoma
1.344
MDA-MB-231
1.464
Murine melanoma
1.331
B164A5
1.354
A375
1.344
B16-F0
1.304
MDA-MB-231
MDA-MB-231
3
Human carcinoma
2.443
HepG2
2.449
Human melanoma
1.832
A375
1.832
Murine melanoma
0.574
MCF7
1.466
B164A5
1.393
B16-F0
1.343
The maximum score is 3. The higher the scores, the higher the relationship. MSP, main spectrum.
This was also illustrated as an MSP-based dendrogram (Fig. 9). A score value of 3 represents the maximum similarity, while 0 represent no relatedness. For individual MSP, the results revealed a similarity between the same types of cancer. The only exception was the A375 MSP, where the second hit was a breast carcinoma. In the MSP dendrogram however, all three human carcinoma cell lines (HepG2, MDA-MB-231 and MCF7) were grouped in a separate subclade from human carcinomaA375 cells. The murinemelanoma cell lines (B164A5 and B16-F0) formed separate clades from all the human cells. In addition, within the human carcinoma clades, the distance level that separated the breast carcinoma cell line (MDA-MB-231) and the liver carcinoma cell line (HepG2) was shorter than the distance level between the two breast carcinoma cell lines (MCF7 and MDA-MB-231).
Figure 9
The MSP dendrogram. 1, A375 (human melanoma); 2, HepG2 (human liver carcinoma); 3, B16-F0 (murine melanoma); 4, B164A5 (murine melanoma); 5, MCF7 (human breast carcinoma); and 6, MDA-MB-231 (human breast carcinoma). MSP, main spectra.
Discussion
MALDI mass spectrometry was firstly announced in the late 1980's by Hillenkamp and Karas (14) who developed a novel ionization technique that allowed the characterization of large biomolecules as proteins by mass spectrophotometer (15). Further improvements of the initial method propelled MALDI-TOF MS into the top of renowned tools for proteomic research. This technique is characterized by a high sensitivity, an ease of operation and automated capabilities, and can be used for the analysis of small organic or large biochemical compounds obtained from various sources (15,16). Apart from its great usability in microbial identification, MALDI-TOF MS has proven to be a valuable cancer diagnostic tool, being applied to detect biomarkers for a plethora of cancer types, such as gastrointestinal cancer, lung cancer, renal and bladder cancer, ovarian cancer, leukemia (17), melanoma (18), breast carcinoma (19) and hepatic carcinoma (20).The present study aimed to use the established Bruker Biotyper protocol with minimum modifications for the classification and differentiation of several human and murinecancer cell lines. The choice of HCCA matrix was also supported by evidence from a previous study on cell lines (6). The sample preparation process was very simple, rapid and inexpensive, and it was found that using 70% formic acid at a ratio of 20 µl for 1×106 cells, provided the optimal spectra quality. Following the addition of acetonitrile, the final ratio was 0.025×106 cells/ml. A higher concentration of cells led to ion suppression, which resulted in an increased noise level and fewer assigned peaks, a fact previously reported by Povey et al (8). The formic acid extraction requires only 2 min, a method that is faster compared with a previous report by Povey et al (8), where a 4-h-long incubation with the matrix at 3°C was used. Analysis can therefore be performed using a reduced number of cells (250,000 cells). The analysis was performed in manual mode and the laser intensity was adjusted manually to obtain the highest signal-to-noise ratio. The method proved to have a very good shot-to-shot and spot-to-spot reproducibility correlated to the new application of this method in personalized medicine and tumor heterogeneity observation (21).The most challenging step in this study was to establish a spectra processing method able to transform raw spectra (Fig. 10) into peak lists that, once processed using PCA algorithm, provided a realistic classification of the cell lines. This process involved three steps: smoothing, baseline subtraction and peak picking. For both smoothing and baseline subtraction the Bruker standard parameters were used. However, for peak picking the 'local maximum' method was used. This method is based on using the local maxima within a defined window width (in our case 25 kDa) rather than the differentiation of the entire spectrum as used in the standard Biotyper method. This is motivated by the fact that the noise has different intensities at different mass ranges. A maximum spectra normalization was used; in this approach the most intense peak from each spectrum is set to a value of 1 and the intensity of all other signals is calculated relative to this (22). The signal-to-noise threshold was set to 0.01. Thus, in order to be assigned, a peak should have a relative intensity of at least 1% of the highest peak. By combining all these parameters, we were able to eliminate most of the noise and artefacts from raw spectra which otherwise resulted in unrealistic classification of the cell lines.
Figure 10
Raw spectra of the cancer cells: view that displays the entire acquisition range of 5,000–20,000 m/z.
The MALDI mass spectra consist of a type of multivariate data, with each mass signal defining one molecular dimension, and the process of evaluating mass spectra requires multivariate statistical methods that grant the differentiation of samples (16). A multivariate statistical method used to analyze MALDI spectra is the PCA. The principle of this method is represented by the extraction of variance within a data set, reducing the dimensionality of the data set while retaining the information present in the original data set what leads to spectra with similar variation characteristics that can be clustered together and the differences between samples groups can be easily visualized in the system (16).PCA analysis could be applied successfully on cancer cells by using their group spectra for the differentiation of pathology with a high sensitivity and selectivity (23). We began by performing PCA analysis using the whole mass range of 5,000–20,000 m/z. As in the MSP classification, this mass range proved to comprise too many groups of variables which contain redundant information and interfere with a realistic classification. By selecting the m/z interval of 9,500–12,500, a minimized profile was created which retains more meaningful peaks, and at the same time eliminates peaks with similar behavior. The restricted mass range also decreased the number of contributory principle components to three (Fig. 7).Since the PC3 contribution to the dataset variance is low, the experiment could disregard it and take only the PC1/PC2 two-dimensional clustering into consideration, thus reducing the dataset to an even lower dimensional space. The PC1/PC2 showed a closer clustering of the three human carcinomas and of the two murinemelanomas. The humanmelanoma is plotted at a significant distance from both murinemelanoma and human carcinoma groups (Fig. 5).MALDI-imaging mass spectrometry (IMS) is also used with PCA analysis in a similar manner as it was used in our study (24). In MALDI-IMS, the PCA is used to classify cells in a single tissue section, while in our study it was used to classify different cell lines (25). The present approach can be used for cancer studies in combination with MALDI-IMS. The combined proteomic profiles from these two approaches can be compared revealing further information about tumor complexity.PCA analysis has been used in conjunction with Raman spectroscopy in a cancer study by Bodanese et al (23) for the successful discrimination of basal cell carcinoma and melanoma from normal skin biopsies. Raman spectroscopy is a spectroscopic technique which typically uses electromagnetic radiation emitted by a sample when treated with a laser (26). The major difference from MALDI-TOF MS is that Raman microscopy does not use a laser to ablate the proteins from the target. Although Raman spectroscopy has proven to be a valuable technique, MALDI-TOF MS is a more widely established as a routine technique in many laboratories (27). It does not require complex sample preparation and requires minimal training, particularly after the technique was standardized and validated (5).MSP are used as an identification basis by the MALDI Biotyper software. MSP are a reference spectra which contain one or several peak lists assigned to a bacteria species or strain. In the Biotyper identification workflow, one MSP should contain a certain number of peaks. This will provide an unambiguous identification of an unknown organism when its mass spectrum is compared to the reference MSP stored in a database. The software generates scoring values between 0 and 3. The higher the score value, the higher the degree of similarity between the sample and the reference MSP. Apart from identification, the generated MSPs can be used for sample classification instead of raw spectra. We decided to use this system to compare the obtained scores to the PCA classification. In our case, the cumulated spectra (10 individual spectra of 10,000 laser shots) were used for creating two types of MSPs: individual MSPs which contain the peak list for one single cell line and three group MSPs which contained the cumulated peak list of the three main groups of cancers cells: humanmelanoma (A375), human carcinoma (MDA-MB-231, HepG2 and MCF7) and murinemelanoma (B164A5 and B16-F0) cells. In this study, a combination of parameters was tested and the optimum was used for MSP creation. The use of the whole mass range of 5,000–20,000 m/z, did not provide a realistic classification according to the type of cancer and the originating species. Therefore, several mass ranges were selected, as well as different numbers of peaks to be included in the MSP. A number of ten peaks from the 9,500–12,500 m/z interval proved to be more appropriate in providing realistic results.When compared to the group MSPs, all cell lines were correctly attributed to their corresponding group (Table I). The differences between the first and second hit were substantial providing an unambiguous classification.The MSP-based dendrogram (Fig. 9) revealed similar results to the PCA-based one, proving consistency between the two approaches. These results are in agreement with those described by Geiger et al (2012) in a previous study in the way that species relationship prevails to organ relationship, data obtained by using an LTQ-Orbitrap family mass spectrometer with a 'high field' Orbitrap mass analyzer (28). In the present study, the species-based relationship was stronger than the relationship between different types of cancer from which the cells originated, a fact that was revealed for the first time by the means of the MALDI-TOF MS technique. In addition, the restricted mass range of 9,500–12,500 m/z was not previously reported.The results revealed a closer relationship between HepG2 (liver carcinoma) and MDA-MB-231 (breast carcinoma) than between the two breast carcinomas cell lines. This can be explained by a high degree of similarity between the proteomic profiles of the different humancancer cell lines, a fact previously reported (28). Another hypothesis for this similarity is the adaptation of the cells to in vitro culture which could lead to a clone selection based on their proliferative potential with the loss of the functions that are not vital (29). Although there were similarities between the HepG2 and MDA-MB-231 cell lines, the MSP-based scoring system revealed that the protein expression was clearly distinct between the cell lines.The distance level between the MCF7 and MDA-MB-231 cell lines revealed by MSP analysis could be explained by the fact that, even though these cell lines have the same origin (pleural effusions of metastatic mammary patients), the MCF7 cell line (non-invasive cells) expresses specific markers for luminal epithelial phenotype and is employed as a model for estrogen receptor-positive tumors, whereas the MDA-MB-231 cell line (invasive cells) expresses mesenchymal markers (vimentin) and is used for estrogen receptor-negative breast cancers (30). Moreover, there were observed significant differences between the MCF7 and MDA-MB-231 proteomic profiles (31).In our previous study, it was shown that there are differences between B16 murinemelanoma cell sublines as regard molecular profile and proliferation behavior, B164A5 being described as a highly metastatic cell line (32). Moreover, the humanmelanoma cell line, A375, proved to be a valuable asset in inducing reproducible animal models with primary melanoma (33). The classification of melanoma cells in different clades could be explained by the fact that they originate from different species (human and murine), and in addition, A375 cells are an amelanotic primary melanoma cell line and their metastatic potential is lower compared to that of the murinemelanoma cells, B164A5 and B16-F0 (32,33).The differences between the cancer cell lines tested observed by applying MALDI-TOF MS assert the data that were presented after microscopic evaluation in terms of shape and invasive capacity.In conclusion, although different, the two statistical approaches (PCA and MSP) provided similar results. The MSP-based approach can be used for building groups which contain one cancer type or one cancer type from a given species. We were able to differentiate the cells according to cancer type (carcinoma versus melanoma) and the same cancer type originating from two different species (human versus micemelanomas). This proves that the MSP approach can be used not only to identify unknown samples, but also for their classification.The results highlighted the necessity to establish minimized profiles comprising a restricted mass range. The MALDI-TOF acquisition method could also be improved to obtain maximum resolution for the newly establish mass range of 9,500–12,500 m/z. We do not exclude the possibility of other discriminative peaks outside this range. However, this could not be achieved using Biotyper software which does not allow direct peak list editing.
Authors: Giuseppe Corona; Elisa De Lorenzo; Caterina Elia; Maria Paola Simula; Claudio Avellini; Umberto Baccarani; Francesco Lupo; Claudio Tiribelli; Alfonso Colombatti; Giuseppe Toffoli Journal: Int J Oncol Date: 2010-01 Impact factor: 5.650