Literature DB >> 16626496

Transcription profiles of non-immortalized breast cancer cell lines.

Mariana Fernandez-Cobo1, James F Holland, Beatriz G T Pogo.   

Abstract

BACKGROUND: Searches for differentially expressed genes in tumours have made extensive use of array technology. Most samples have been obtained from tumour biopsies or from established tumour-derived cell lines. Here we compare cultures of non-immortalized breast cancer cells, normal non-immortalized breast cells and immortalized normal and breast cancer cells to identify which elements of a defined set of well-known cancer-related genes are differentially expressed.
METHODS: Cultures of cells from pleural effusions or ascitic fluids from breast cancer patients (MSSMs) were used in addition to commercially-available normal breast epithelial cells (HMECs), established breast cancer cell lines (T-est) and established normal breast cells (N-est). The Atlas Human Cancer 1.2 cDNA expression array was employed. The data obtained were analysed using widely-available statistical and clustering software and further validated through real-time PCR.
RESULTS: According to Significance Analysis of Microarray (SAM) and AtlasImage software, 48 genes differed at least 2-fold in adjusted intensities between HMECs and MSSMs (p < 0.01). Some of these genes have already been directly linked with breast cancer, metastasis and malignant progression, whilst others encode receptors linked to signal transduction pathways or are otherwise related to cell proliferation. Fifty genes showed at least a 2.5-fold difference between MSSMs and T-est cells according to AtlasImage, 2-fold according to SAM. Most of these classified as genes related to metabolism and cell communication.
CONCLUSION: The expression profiles of 1176 genes were determined in finite life-span cultures of metastatic breast cancer cells and of normal breast cells. Significant differences were detected between the finite life-span breast cancer cell cultures and the established breast cancer cell lines. These data suggest caution in extrapolating information from established lines for application to clinical cancer research.

Entities:  

Mesh:

Year:  2006        PMID: 16626496      PMCID: PMC1524972          DOI: 10.1186/1471-2407-6-99

Source DB:  PubMed          Journal:  BMC Cancer        ISSN: 1471-2407            Impact factor:   4.430


Background

The search for differentially expressed genes in tumours has made extensive use of array technology. Most studies have involved tumour biopsy samples or established tumour-derived cell lines [1]. Differentially expressed genes may help to identify tumours with high metastatic potential and pathways that might be therapeutic targets. As noted by Dairkee et al. [2], one drawback of using established cell lines is that the process of immortalization in vitro can bias the expression profile when compared to native tissues. Another problem is that a number of widely-used cell lines have sub-lines that differ in their karyotypes and RNA expression levels [3-5]. Biopsies from tumours, unless they are microdissected, contain heterogeneous cell types. A molecular signature (or expression profile) of the whole tumour can be misleading since it is a composite of all the cells, normal and malignant, present in it. Although tumours may consist of several populations of cells that deviate slightly from the clonal strain of origin, metastases may involve only the subline(s) with specific genetic changes that favour metastatic behaviour. Cultures from freshly-derived ascites or pleural effusions may therefore be relatively homogeneous before they acquire the further mutations that immortalize cell lines with unlimited growth potential. In this communication we report which genes, within a defined set of well-known cancer-related genes, are differentially expressed in freshly-derived breast cancer cell cultures compared with primary normal, established normal and breast cancer cultures. We compared expression patterns in cultures of finite life-span cells from pleural effusions or ascites of breast cancer patients (Mount Sinai School of Medicine: MSSM) with commercially-available finite life-span normal breast epithelial cells (HMECs). To ensure comparability, cultures were used at similar passage numbers and the karyotypes were analyzed. We also compared the expression profiles of some of the most widely-used established breast cancer cell lines and three putatively normal established breast cell lines.

Methods

Cultures of finite life-span cells

a): HMECs: Different batches of normal human mammary epithelial cells cultures were obtained (Biowhittaker Molecular Applications Inc., NJ). HMEC specimens were enumerated serially and maintained with MEGM (Clonetics, MD) supplemented with 2.5% FCS (Gibco Invitrogen, CA). b): MSSMs: Breast cancer cells obtained from ascitic fluids or pleural effusions of patients with breast cancer were seeded and maintained in our laboratory. They were designated MSSM 3 through MSSM 14 and maintained in MEGM supplemented with 2.5% of the corresponding original fluid when available and 2.5% FCS (Gibco Invitrogen, CA). The full characterization of these cultures will be published elsewhere (manuscript in preparation). HMECs were passaged only twice or thrice in our laboratory. MSSMs were harvested at the 5th or 6th passage after initial plating.

Cultures of established cell lines

a): Normal-established (N-est.): the cell lines used were derived from normal tissues: MTSV1-7 (a gift from J. Taylor-Papadimitriou), MCF10A (from R Mira y Lopez) and MCF10F (ATCC, MD). b): Tumour-established (T-est.): cell lines established from breast cancer pleural effusions or tumour tissues: MDA-MB231, MDA-MB453, MDA-MB468, two strains of MCF 7 (designated MCF 7N and MCF 7P), T47D, BT20 and BT474 (all from ATCC, MD). They were maintained in medium and supplements as recommended by ATCC. All cell cultures were harvested 48–72 h after plating at about 80% confluence. Sources and markers are given in Table 1.
Table 1

Cell origin and RNA expression of some frequently used markers

cell lineorigin (a)KT14KT18VimERERBB2ERBB3MGB1MGB2
HMEC #1mammary epithelial+++++/-+++/-+
HMEC #2mammary epithelial+++NDNDNDNDND
HMEC #3mammary epithelialNDNDND-++-+
HMEC #6mammary epithelial++++-++-+
HMEC #7mammary epithelial+++++/-++-+
HMEC #8mammary epithelialNDNDND+/-++++
HMEC #9mammary epithelial+++++++++
HMEC #10mammary epithelial+++++/-++++
HMEC #11mammary epithelial+++++++-+
HMEC #12mammary epithelial++++++-+
HMEC #13mammary epithelial+++-++-+
MSSM 3ascites-++++++/---
MSSM 4pleural effusion-+++-+--+
MSSM 5pleural effusion-++++++/--+
MSSM 6pleural effusion-++++/-++-+
MSSM 7pleural effusion++++-+-+-
MSSM 8pleural effusion-++++/-++++
MSSM 9ascites-++++/-+-++
MSSM 10pleural effusion-+++-++/-++
MSSM 11pleural effusion-++++/-++++
MSSM 14ascites-++++/-+-++
MTSV1-7from milk-+++-++/---
MCF 10Amammary gland, adherent+++++/-++++
MCF 10Fmammary gland, floating+++/-+++-+
MDA-MB 231pleural effusion-+++-++-+
MDA-MB 453effusion-++-+ (c)+++
MDA-MB 468pleural effusion+++-+++-
MCF 7 Npleural effusion-++/-+++-+
MCF 7 Ppleural effusion-+++++++
T47Dpleural effusion-+-+++++
BT20prim tumor+++/-+ (b)++++
BT474invasive ductal carcNDNDND-+ND++

Determination of the presence of mRNA for the KT and VIM markers shows the results of the array experiment. Determination for the receptors was done in triplicate by RT-PCR. +/-: 1/3 show the specific product of the PCR, -: none of the three, +: all of them, ND: not done.

(a) HMECs and established cell lines according to the providers (Clonetics, ATCC).

(b) It is known that BT20 transcribes an mRNA for this gene with a deletion in the fifth exon and hence the cell does not express the protein (ATCC).

(c) MDA-MB 453 showed a 27.2 fold up-regulation when compared to the HMECs (29.3 with MCF 10A), consistent with its amplified ERBB2 gene.

Arrays

The Atlas Human Cancer 1.2 cDNA expression array (Clontech, CA) is a nylon membrane printed with 200–600 bp fragments of 1176 characterized genes involved in cancer, 9 housekeeping genes and 6 negative controls. RNAs were extracted and labelled with the Atlas pure total RNA labelling system and hybridized according to the manufacturer's instructions. All the cell lines used for the arrays (9 HMECs, 10 MSSMs, 3 N-est and 7 T-est) were probed twice in separate assays. The accuracy of the duplicates was assessed by Pearson's correlation coefficient based on the adjusted intensities of all the genes spotted on the membrane, which ranged from 0.93 to 0.99. Hybridizations with 30 μg of total RNA were performed according to the manufacturer's instructions. The hybridized membranes were exposed to a phosphorimager screen and were read at 100 μm resolution in a Storm Phosphorimaging system (Molecular Dynamics, CA). The scanned images were transformed to TIFF files with a PC bit order and then aligned and analyzed using AtlasImage 2.01 software (Clontech, CA). To average or compare the samples, the adjusted intensity signal was normalized using the global normalization mode featured in the software. We report only (a) those genes with significant (p < 0.01) differential expression of 2-fold or more, and (b) genes that were undefined for all the cell lines belonging to one type of sample, but were detected in other types with a difference at least equivalent to one background (540 units in intensity). (Undefined genes are those for which the intensity was below the signal threshold). The AtlasImage software compares only two samples at a time. When we used it to determine the differences between cell classes, we first averaged the cell lines in the four classes (HMEC, MSSM, N-est and T-est) and then performed the comparisons as instructed by the manufacturer. Statistical analyses (correlation and two-tailed p values) were performed using Microsoft Excel 2000. Further analyses were performed with Significance Analysis of Microarrays (SAM) [6], Prediction Analysis for Microarrays (PAM) (Stanford Univ., USA), FatiGO [7], Pomelo tool and SVM (Bioinformatics unit, CNIO, Spain [8]) and GoMiner [9].

Quantitative real-time PCR (Q-PCR)

To validate the results of the cDNA array experiments, some of the genes found to be differentially expressed were further examined by real-time PCR in 10 HMECs, 9 MSSMs, 3 N-est and 8 T-est cell lines. Five μg of total RNA (corresponding to about 100 ng of mRNA) were reverse-transcribed with oligo(dT) (SuperScript II system, Invitrogen, CA) in a 20 μl reaction volume, and after 125-fold dilution, 1.25 μl were used for a 40-cycle PCR on an ABI Prism 7900 thermal cycler. The reaction was carried out in a 384-well plate with a QuantiTect SYBR Green PCR kit (Qiagen Inc, CA) at an annealing temperature of 63°C and detection at 2–5°C below the Tm of the product as determined from its dissociation curve. Product size was confirmed by agarose gel electrophoresis. The efficiency of each pair of primers for amplification was determined and expression of each gene relative to G3PDH was assessed by the program Qgene [10]. Primers were designed using the program PrimerQuest or Primer3, unless otherwise stated. Primer sequences, lengths and Tms of the products are given in Additional file 4. Samples were tested twice in triplicate. Pearson correlation coefficients for the duplicate Q-PCR results ranged between 0.89 and 0.99.

Results

Overall gene expression and class prediction

All the MSSM cultures displayed similar growth rates and had no or minimal chromosomal changes (data not shown). Table 1 shows the main characteristics of these cells. Each cell line was probed twice. Averages of the duplicates after normalization of the adjusted intensities (as described in the Atlas manual and briefly in Materials and Methods) were used to obtain the expression values for further analysis. The overall gene expression profile, as determined by Pearson's correlation coefficient, discriminates between HMECs and cells derived from metastasis of breast cancer (MSSMs). As seen in Table 2 (and detailed in Additional file 1) the correlation within cell classes was > 0.90; the correlation between cell classes was < 0.82.
Table 2

Correlation between cell lines.

medianrange
IntragroupHMECs0.910.83–0.96
MSSMs0.940.81–0.97
Normal-est0.820.80–0.88
Tumour-est0.850.78–0.95
IntergroupHMECs vs MSSMs0.740.61–0.82
HMECs vs Normal-est0.850.69–0.93
HMECs vs Tumour-est0.770.67–0.85
MSSMs vs Normal-est0.770.68–0.87
MSSMs vs Tumour-est0.750.64–0.84
Normal-est vs Tumour-est0.820.77–0.88

Significance of HMECs or MSSMs to HMECs vs MSSMs by Wilcoxon-Mann-Whitney test two-sided, p value = p < 0.0001

Hybridization signals of 928 genes (78.9%) were represented in at least three cell lines. (One class, N-est, comprised three cell lines; the other three classes, HMEC, MSSM and T-est, comprised more.) We used this subset of 928 genes, plus the 9 control genes, to construct a classification model using two prediction programs: PAM and SVM. Since these algorithms do not work well when the numbers of members differ among classes, we trained the SVM program using comparable numbers of lines from three classes (8 HMECs, 9 MSSMs and 7 T-est) and treated the three N-est cell lines and the remaining HMECs and MSSMs as unknowns to be classified. The model thus generated had a classification accuracy of 100% (24/24) by the leave-one-out cross-validation method. The prediction for the three N-est lines was T-est, suggesting that these cells have an expression profile resembling those of the other established cell lines rather than the cultures of finite life-span cells. The remaining HMEC and MSSM cell cultures were correctly assigned to their respective classes (see Additional file 5). The same analysis with PAM, applying a threshold of 3.5, which gives the maximum number of significant genes yielding no misclassification error upon cross-validation, predicted MTSV1-7 and MCF 10F as members of the "T-est" class and MCF 10A as an HMEC-class member with a probability of 1. Again, HMEC 13 and MSSM 14 were properly assigned (Fig 1 and Additional file 2). Further to elucidate the classification of MCF10A, which is the most widely-used putatively normal breast cell line in array analysis, we examined the expression values of KT14, KT8/18 and KT19 (Fig. 2A and 2B) and CD104. The MCF10A values were similar to those of the HMECs.
Figure 1

PAM (Prediction Analysis of Microarrays). This is a statistical technique for class prediction from gene expression data using nearest shrunken centroids. This method identifies the subsets of genes that best characterize each class. a) Plot of the shrunkencentroids for the specified threshold with the ranked list of significant genes that are used for the predictions. b) Prediction and probabilities of the test cell lines (MTSV1-7, MCF10A, MCF10F, HMEC 13 and MSSM 14).

Figure 2

Expression of KT14, KT18 and KT19. a) Plot showsthe correlation of the adjusted intensity (AI) signal of the pair KT8/KT18 for each cell line. b) AI of KT14, KT18 and KT19 of each cell line and its relative percentage.

Comparison of non-immortalized normal and cancer breast cells

Of the 1176 genes in the array, 862 (73.3%) gave hybridization signals in at least three of the finite life-span cell lines used. Of these, 123 (14.3%) showed differential expression when HMEC and MSSM cells were analyzed using the AtlasImage software (ratio >2 and difference in adjusted intensity > 540), and 101 were deemed significant genes with SAM (q-value: 0.7298, median #FDR: 0.73718) (Additional file 6). Of these 123 from Atlas and 101 from SAM, 75 genes were differentially expressed with at least a 2-fold change according to SAM and AtlasImage, a minimum difference in intensity of 540 units and a p-value < 0.01 (Excel). These two programs use different algorithms to calculate the ratio (AtlasImage) and fold (SAM) values, so the outputs are not exactly the same. Fig. 3 shows the relative expressions of these 75 genes in all the cell lines analyzed by the arrays using the software Pomelo Tool (FDR and p-values for each gene in Additional file 7).
Figure 3

Relative expression of all the 29 cell linestested by Atlas arrays. Output of the Pomelo tool showing the log2 of the relative expression of the selected genes in each cell line to the average expression of all the cell lines. The 75 genes shown here were considered differentially expressed by SAM and by AtlasImage, with at least a 2-fold change for both programs, when we compared the HMEC and MSSM cell classes.

Some genes showed a broad range of expression values among cell lines belonging to the same group: one or two individual cell lines over-expressed the gene (>5 fold the average of the cell class), while others in the same class gave no signal or were barely above background. Since we were looking for genes that could enable us to differentiate among types of cell lines, and hence be useful as markers for each class, we inspected the values of these genes manually. The aim was to exclude from the 75 genes in Fig 3 any that gave extreme values in 2 or more cell lines within a class; "extreme values" were those that lay in the range of the other class, or biased the average of the class in question to generate the required 2-fold change. Twenty seven up-regulated and 21 down-regulated genes remained, in addition to the cytokeratins, which are not included in Tables 3 and 4 but shown in Fig 2. Their distribution according to GO (Gene Ontology) terms by FatiGO at level 3 of biological processes is shown. These changes in gene expression stress the importance in the malignant process of both a diminished capacity for cell-cell adhesion and the remodelling of the extracellular matrix (ECM). For example: genes involved in adhesion and downstream signalling (int α4/α6, P-cadherin (CDH3), γ catenin (JUP)) or inhibitors of remodelling (SPINT-2) were down-regulated, and ECM remodelling enzymes (MMP11, TIMP2 and the cascade TIMP1-SPARC-TGFBI) were up-regulated, as shown in Tables 3 and 4 (SVG file and p-values of GOMiner in Additional files 3 and 8 respectively).
Table 3

Gene expression of MSSM cells in relation to HMEC cells Up regulated Genes in MSSM cells

SAMt-testAtlas
NameScore(d)Foldp valueRatioDiffProtein/geneGene ontology

IGFBP49.787257.602.15E-0747.465296insulin-like growth factor binding protein 4cell communicationcellular physiological processmetabolismmorphogenesisregulation of cellular process
FES8.917237.101.13E-1031.761292C-fes proto-oncogenecell communicationcellular physiological processmetabolism
LAMA47.8262159.922.95E-12131.733399laminin alpha 4 subunitcell communicationcellular physiological processregulation of cellular process
FCGRT7.46586.221.43E-035.183425IgG receptor FC large subunit P51response to stimulus
FGFR17.32441925.867.01E-12Up2368N-sam; fibroblast growth factor receptor1cell communicationmetabolismmorphogenesis
MMP116.74594.794.30E-084.6729606matrix metalloproteinase 11; stromelysin 3metabolismmorphogenesis
TIMP16.31004.223.66E-074.1530855metalloproteinase inhibitor 1cellular physiological processmetabolismregulation of cellular process
TIMP26.31004.226.14E-045.641713tissue inhibitor of metalloproteinases 2cellular physiological process
PTPRS6.218511.575.03E-0512.56705protein tyrosine phosphatase sigmacell communicationmetabolism
TGFBI6.01303.991.71E-064.056759BIGH3cell communicationcellular physiological processregulation of cellular processresponse to stimulus
ITGB35.571171.886.47E-0957.561527integrin beta 3; CD61 antigencell communication
TCIRG15.349715.089.83E-0514.661257specific 116-kDa vacuolar proton pumpcellular physiological processregulation of cellular processresponse to stimulus
TNFRSF1A5.33922.972.29E-062.901875tumor necrosis factor receptor 1cell communicationcellular physiological processregulation of cellular process
MOX25.174642.043.61E-0539.85777OX2 membrane glycoproteinGen with GO but NOT at level 3 biological process
TP53I34.95206.538.84E-055.731225PIG3Gen with GO but NOT at level 3 biological process
BDNF4.890224.157.48E-0521.05762brain-derived neurotrophic factormorphogenesis
TYRO34.36613.239.88E-032.931425tyrosine-protein kinase receptor tyro3 precursor; rse; sky; dtkcell communicationmetabolism
CTRL4.216625.914.47E-0519.651063Chymotrypsin-like protease CTRL-1metabolism
SPARC4.12284.215.59E-044.302840secreted protein acidic and rich in cysteine; osteonectinmorphogenesis
GPR43.8917604.851.01E-05Up727probable G protein-coupled receptorcell communication
DAB23.86358.911.48E-058.361339mitogen-responsive phosphoprot. DOC2cellular physiological process
COL6A13.73204.573.96E-044.443145collagen 6 alpha 1 subunitcell communicationcellular physiological process
CDH23.7267910.061.16E-05Up1162N-cadherin; cadherin 2cell communication
CTGF3.719019.642.31E-0517.281091connective tissue growth factorcell communicationcellular physiological processmetabolismmorphogenesisregulation of cellular processresponse to stimulus
ITGA53.56463.492.47E-053.363815integrin alpha 5cell communicationregulation of cellular process
IGFBP53.491327.632.39E-0421.901045insulin-like growth factor binding protein 5cell communicationcellular physiological processmorphogenesis
TNFRSF10D3.27166.154.92E-045.251514decoy receptor 2cellular physiological process
Table 4

Gene expression of MSSM cells in relation to HMEC cells Down regulated Genes in MSSM cells.

SAMt-testAtlas
NameScore(d)Foldp valueRatioDiffProtein/geneGene ontology

SFN-18.5106-134.412.71E-08-76.89-1487514-3-3 protein sigma; stratifincell communicationcellular physiological processmetabolism
SPINT2-14.6598-37.304.10E-07-39.72-5653bikunin; hepatocyte growth factor activator inhibitor 2cellular physiological process
LAMB2-11.3177-12.538.94E-06-12.09-5510laminin beta 2 subunit precursorcell communication
ITGA7-11.0881-13.033.24E-05-12.06-12577integrin alpha 7B precursorcell communicationcellular physiological processmorphogenesis
CDH3-8.9105-12.113.62E-05-15.23-4283cadherin 3; P-cadherincell communicationresponse to stimulus
PLAU-7.9173-10.372.18E-05-22.29-2512urokinase-type plasminogen activatorcell communicationcellular physiological processmetabolismresponse to stimulus
ITGB4-7.9148-22.092.86E-05-20.11-16722integrin beta 4; CD104 antigencell communication
TPBG-7.6578-5.422.34E-04-4.84-19225T4 oncofetal antigen precursorcell communicationcellular physiological process
RPS6KA1-7.5622-3.375.64E-08-3.21-3450ribosomal protein S6 kinase alpha1cell communicationmetabolism
EDG4-7.2225-4.696.71E-04-4.45-1208G protein-coupled receptorcell communication
IRF6-6.7854-120.194.26E-11137.08-1905interferon regulatory factor 6metabolism
LITAF-6.7240-3.731.49E-03-3.53-2311PIG7cell communicationmetabolismregulation of cellular process
SEMA3F-6.4547-3.551.90E-04-2.68-1510semaphorinGen with GO but NOT at level 3 biological process
NOTCH1-6.0498-2.957.96E-04-2.86-1883neurogenic locus notch protein homolog 1metabolismresponse to stimulus
ITGA6-5.93321052.636.90E-07Down-1273integrin alpha 6 precursorcell communicationcellular physiological process
DSP-5.90151190.488.57E-07Down-1463desmoplakin I & IImorphogenesis
CDA-5.4364-21.034.64E-07-18.79-3523cytidine deaminasemetabolism
JUP-5.3936-6.779.66E-07-6.47-6165junction plakoglobin; desmoplakin IIIcell communication
JAG2-5.2949-4.221.20E-03-4.30-1841jagged2cell communicationcellular physiological processmorphogenesisregulation of cellular processresponse to stimulus
SERPINB1-4.8852-6.111.38E-04-5.46-1960leukocyte elastase inhibitorGen with GO but NOT at level 3 biological process
BTG2-4.8636-24.343.84E-05-11.74-752btg protein precursor; NGF- inducible anti-proliferative protein PC3cellular physiological processmetabolismregulation of cellular processresponse to stimulus

Genes ordered according to their SAM scores and distribution of the genes according the GO terms using FatiGo

Comparison of immortal and non-immortal breast cancer cells

Fifty genes showed significant >2-fold (according to SAM software) and >2.5-fold (according to Atlas) differences between the MSSMs and the established breast cancer cell lines. Twenty-five were up-regulated and 25 down-regulated in the established breast cancer cell lines compared to the finite life-span cultures. Most of the differentially expressed genes can be classified under the GO terms "metabolism" and "cell communication". In particular: most of the down-regulated genes seem to be related to remodelling of the extracellular matrix, cell adhesion and receptor-linked signal transduction, while the up-regulated genes are related to general signal transduction pathways and cell proliferation (Tables 5 and 6).
Table 5

Gene expression of T-est cells in relation to MSSM cells 25 Significant Up-regulated Genes in T-est cells

SAMAtlas
NameScore(d)FoldRatioDiffProtein/geneGene ontology

SPINT29.364651.35065.4377668bikunin; hepatocyte growth factor activator inhibitor 2cellular physiological process
PSME17.78243.0343.6303419interferon-gamma IEF SSP 5111response to stimulus
SFN6.591737.39425.868395414-3-3 protein sigma; stratifincell communicationcellular physiological processmetabolism
JUP6.28874.7795.8254405junction plakoglobin; desmoplakin IIIcell communication
TFAP2C6.017689.861108.1331607transcription factor erf-1; AP2 gamma transc factorcell communicationmetabolism
SLC25A55.69142.0282.5124761ADP/ATP carrier proteincellular physiological process
GSTZ15.568311.70413.4131986glutathione transferase zeta 1metabolism
TBRG45.42622.3622.8593099cell cycle progression 2 protein (CPR2)Gen with GO but NOT at level 3 biological process
EDG45.34597.1698.1972044G protein-coupled receptor EDG4cell communication
RPS6KA15.08822.0302.5281933ribosomal S6 kinase 1cell communicationmetabolism
CTSD4.98963.4304.1206462cathepsin D precursormetabolism
HDGF4.82362.1122.5902185hepatoma-derived growth factorcell communicationcellular physiological process
PCTK14.59112.2022.6482444serine/threonine-protein kinase PCTAIRE 1cellular physiological processmetabolism
LITAF4.48753.3183.9232166PIG7cell communicationmetabolismregulation of cellular process
PTPN64.428915.93912.2181391protein-tyrosine phosphatase1Ccell communicationcellular physiological processmetabolism
RFC24.32352.4692.8002036replication factor C 40-kDa subunit (RFC40)cellular physiological processmetabolism
RARG4.31422.6403.0201085retinoic acid receptor gamma 1metabolism
ADRBK14.23313.9634.1211211G protein-coupled receptor kinase 2 (GRK2)cell communicationmetabolism
CKS1B4.21663.3704.3013341cyclin-dependent kinase regulatory subunit 1Gen with GO but NOT at level 3 biological process
RPS214.18562.2102.6742531ribosomal protein S21metabolism
AURKB4.11294.0525.1681980aurora- & IPL1-like protein kinase1; ARK2cellular physiological processmetabolism
PPP2R43.97813.4814.0682433phosphotyrosyl phosphatase activatormetabolism
FASN3.67312.9533.6154590fatty acid synthasemetabolism
GDF153.60085.0325.87810268macrophage inhibitory cytokine 1 (MIC1)Gen with GO but NOT at level 3 biological process
MAPK133.59035.3746.2281652stress-activated protein kinase 4 (SAPK4)cell communicationcellular physiological processmetabolismresponse to stimulus

Genes ordered according to their SAM scores and distribution of the genes according the GO terms using FatiGo

Table 6

Gene expression of T-est cells in relation to MSSM cells 25 Significant Down-regulated Genes in T-est cells

SAMAtlas
NameScore(d)FoldRatioDiff.Protein/geneGene ontology

FES-7.9514-1063.830Down-1081C-fes proto-oncogenecell communicationcellular physiological processmetabolism
VIM-7.0022-5.070-3.698-9614vimentinGen with GO but NOT at level 3biological process
TGFBI-6.6491-22.619-17.180-6844BIGH3cell communicationcellular physiological processregulation of cellular processresponse to stimulus
BMP1-6.5296-12.975-9.085-1609bone morphogenetic prot. 1+ procollagen C-proteinasemetabolism
MLH1-6.4271-9.043-6.664-1552mutL protein homolog; DNA mismatch repair proteincellular physiological processmetabolismresponse to stimulus
LAMA4-6.3501-24.913-19.264-2630laminin alpha 4 subunit precursorcell communication
MMP11-6.2022-6.609-5.019-24432matrix metalloproteinase 11; stromelysin 3metabolism
TIMP1-6.1061-6.355-4.793-26048metalloproteinase inhibitor 1cellular physiological processmetabolismregulation of cellular process
PLXNA3-5.9530-3.489-2.845-2334transmembrane protein sex precursorGen with GO but NOT at level 3biological process
TIMP2-5.1069-6.041-4.092-1274tissue inhibitor of metalloproteinases 2cellular physiological process
ITGB8-5.0533-12.188-8.712-2414integrin beta 8cell communication
ITGB3-4.8610-1282.051Down-1259integrin beta 3cell communication
SPARC-4.7998-2857.143Down-2997secreted protein acidic and rich in cysteine precursor; osteonectinphysiological process
FN1-4.5334-50.633-34.784-10608fibronectin precursorcell communicationcellular physiological processmetabolismresponse to stimulus
MMP9-4.5288-12.126-7.859-1365matrix metalloproteinase 9metabolism
HSPG2-4.3027-1149.425Down-1164heparan sulfate proteoglycancell communication
MMP14-4.0581-5.566-3.840-2454matrix metalloproteinase 14; MT- MMP1metabolism
ITGA5-3.9413-11.151-7.838-3836integrin alpha 5cell communication
MT3-3.9295-7.365-5.866-9946metallothionein-IIIcellular physiological processmetabolismregulation of cellular processresponse to stimulus
DAB2-3.8058-1190.476Down-1232mitogen-responsive phosphoprotein DOC2cellular physiological process
SERPINB2-3.6797-1818.182Down-1773placental plasminogen activator inhibitor 2 (PAI-2); urokinase inhibitorcellular physiological processregulation of cellular process
IGFBP6-3.6326-12.402-8.758-1668insulin-like growth factor binding protein 6 precursorcell communicationcellular physiological processregulation of cellular process
COL6A1-3.6304-8.586-6.311-2767collagen 6 alpha 1 subunit (COL6A1)cell communicationcellular physiological process
IL6-3.4586-10.628-7.933-2586interleukin-6 precursorcell communicationcellular physiological processregulation of cellular processresponse to stimulus
LIF-3.4166-5.400-3.940-1085leukemia inhibitory factor precursorcell communicationcellular physiological processregulation of cellular processresponse to stimulus

Genes ordered according to their SAM scores and distribution of the genes according the GO terms using FatiGo

Real-time PCR

To validate the differences seen in the arrays, some genes were tested by real time PCR. We selected several genes of which the expression was significantly increased or decreased, according to SAM and Atlas, for both the MSSM/HMEC and T-est/MSSM comparisons: 14-3-3σ (also called SFN), SPINT2, FES, SPARC, BIGH3 (or TGFBI), TIMP1, TIMP2, MMP11 and DAB2. Also, we analyzed some genes that were deemed significant for only one of the comparisons: NOTCH1, PLAU, CDA and SERPINB2 (or PAI2), and a few genes that were non-significant but somehow related to some of the aforementioned genes: tPA, PAI1, uPAR, DCK. It has been reported that correlation between Q-PCR and array data is highly variable [11]. It depends, in part, on the sensitivity of arrays in detecting genes with low expression levels or saturation due to very high ones. The genes we have tested showed correlations that ranged from 0.563 to 0.959. For the comparison between the normal (HMEC) and the tumour (MSSM) finite life-span cultures, the Q-PCR results for the manually curated genes (as explained above) supported the findings of the arrays in 17/17 (100%) of cases. In the T-est/MSSM evaluation, agreement between the two techniques was found in 15/17 (88.2%) of the cases (Fig. 4 and data available upon request).
Figure 4

Relative expression of some genes validated byQ-PCR. a) Mean normalized expression (MNE) of SPARC. b) MNE of BIGH3 = TGBFI. c) MNE of PAI-1. d) MNE of NOTCH1 in HMECs and MSSMs cells. e) Relative expression, of MSSMs and established cell lines, toMCF10A. Note the difference between the two MCF7s. f) MNE of CYR61. g) MNE of DAB-2. For the complete list of validated genes, see Table S1.

One of the genes excluded from the HMEC/MSSM comparison was CYR61, which was tested by Q-PCR. Even though the overall up-regulation value for this gene was consistent (SAM: 3.53, Atlas: 3.48, Q-PCR: 3.29), its expression levels crossed over to the values of the other cell class in 4/17 cell lines, thus invalidating CYR61 as a reliable marker by itself.

Discussion

Established breast cancer cell lines have been widely used to study signal transduction pathways, test new pharmaceuticals and determine expression profiles that might predict the metastatic capabilities of primary tumours. In many cases MCF10A has been chosen as the "normal" control, even though this cell line has been reported to possess markers for both myoepithelial and luminal phenotypes [12]. One of the controversies about the use of HMECs as controls is the probable myoepithelial origin of these cells [13] based on expression of KT14 and CD104 (ITGB4) [14,15]. The expression levels of these genes in the MCF10 cells were equivalent to those in the HMECs. Furthermore, the prediction of PAM, based on the 58 genes that discriminate between the different classes of cells used in this study, was that MCF10A was similar to HMECs. Hence, MCF10A cells would have the same limitation as controls as the HMECs. Nevertheless, HMEC cells have the advantage of not being immortalized and pooling them combines the genetic backgrounds of a large cohort. Both classes of non-immortalized cell lines, HMEC and MSSM, are more homogeneous than the established ones (N-est and T-est), as seen in Table 2 and Additional file 1. This is probably due to the cumulative effects of the mutations accrued individually by the established cell lines during successive passages and immortalization. As seen in Fig 3, the expression of some genes in a particular class is clearly different from the others, e.g. up-regulation of FES, MMP11, DAB2 and down-regulation of SPINT2, SFN, JUP for the MSSM cells lines. Others are more distinctive of a "state", e.g. distribution of cytokeratins and down-regulation of certain integrins (ITGA6/ITGB4, ITGA7) in all the tumour-derived cell lines. Therefore, the expression levels of these genes can be seen as specific attributes of certain classes. Their combined expression defines a profile that can be used to construct a model similar to those built by SVM and PAM for predicting the classification of an unknown cell line accurately, as shown in fig 1B. We tested only two sub-lines of MCF-7, well-known to be highly variable [3-5], and a single sub-line of each of the other established cell lines. We recognize that other sub-lines might be different. When the MSSMs are compared with the HMECs, several genes display differential expression in a mode consistent with previous publications, where they have been shown to be significant for malignant progression or metastasis: down-regulation of the tumour suppressor and inhibitor of mitotic phase entry 14-3-3σ (SFN) [16,17], the serine protease inhibitor SPINT2 [18] and JUP (γ-catenin) [19]; and up-regulation of FES [20] and SPARC [21]. In addition, there is the "cadherin switching" (CDH3 = P-cadherin to CDH2 = N-cadherin) and its relationship to FGFR1 and MMP9 [22,23]. SPARC expression was detected in 17/17 human breast tumour biopsies and to a lesser extent in some established cell lines [21]. It has also been associated with malignant progression and invasive potential in breast cancer [24,25], and together with MMP11 in colorectal [26] and oesophageal cancer [27]. Its over-expression increases motility and invasion [28] and induces growth inhibition [29] in established breast cancer cell lines. In addition, it has been shown to induce expression of BIGH3 and PAI-1[30]. MSSM cells showed up-regulation of SPARC, BIGH3, PAI1 and MMP11, while the T-est cells showed down-regulation of SPARC, BIGH3 and PAI1 (MDA-MB231 is an exception for the latter gene) (Fig 4A-B-C). We also found genes of which the behaviour did not fully agree with previous descriptions, i.e. NOTCH1, CYR61 and DAB2. The many and varied functions of Notch signalling, achieved through activation or down-regulation, have been recently reviewed [31]. In the MSSM samples, NOTCH1 and its ligand JAG2 are down-regulated. In this case, this pathway is more likely to function as a tumour suppressor than an oncoprotein. This conclusion would be less clear if we had only compared established cell lines (mainly MCF10A with both MCF7s) (Fig 4D–E). CYR61 is a pro-angiogenic, secreted protein encoded by a growth factor-inducible immediate-early gene. It is over-expressed in some invasive established breast cancer cell lines and in 30–36% of primary tumours [32,33]. In this study we found only three cell lines with a truly high over-expression (MSSM6, MDA-MB231 and BT20) and five with a moderate over-expression compared to the mean expression in HMECs (between 2 and 4 fold). The same conclusion would have applied if we had considered MCF10A as a control (Fig 4F). DAB2 is considered a tumour suppressor since its expression is down-regulated in ovarian carcinomas and in some established breast cancer cell lines [34], and up-regulated during megakaryocyte differentiation [35]. Its continued expression in tumour cells led to growth inhibition or cell death [36] unless the cells were in contact with some type of basement membrane [37]. MSSM cells showed up-regulation of this gene (Fig 4G), perhaps because they grew as an attached cell line or because of their finite life-span phenotype in vitro. Epithelial-mesenchymal transition (EMT) is considered a mechanism for carcinoma progression and metastasis, and the expression of vimentin (VIM) is its main marker. This view has now been extended to include whole pathways and a more complex relationship with the microenvironment of the cell [38,39]. Among the other genes regulated during EMT [40], the MSSM cells showed up-regulation of COL6A1, SPARC, CDH2 and DAB2 and down-regulation of JUP and BTG2 (Tables 3 and 4).

Conclusion

Using arrays, we have studied 10 finite lifespan breast cancer cell lines freshly isolated from metastatic pleural or peritoneal fluids, 9 finite lifespan normal breast cell lines, 7 established breast cancer cell lines, and 3 established normal breast cell lines. We tested 1176 genes considered to be related to cancer. Within each cell class there was significant homogeneity of gene expression. Two clusters of genes distinguished the MSSMs from the HMECs. These 48 genes, which were differentially expressed by at least 2 fold, concerned cell-cell interactions and remodelling of the extracellular matrix. Fifty genes that were differentially expressed at least 2 fold between MSSMs and established breast cancer cell lines are generally considered to be involved in cell communication and metabolism. Established breast cancer cell lines have been used to model biochemical and pharmacological responses in human breast cancer; the differences from freshly isolated breast cancer lines imply they are not wholly satisfactory models.

Abbreviations

ER = oestrogen receptor, FCS = foetal calf serum, FDR = false discovery rate, KT = keratin, MGB = mammoglobin, SVM = support vector machines, VIM = vimentin.

Competing interests

The author(s) declare that they have no competing interests.

Authors' contributions

MFC designed and performed all the experimental work and drafted the paper. JFH provided fundamental advice on the clinical aspects of the project and participated in the final writing of the paper. BGTP has directed the entire project and participated in the final writing of the paper.

Pre-publication history

The pre-publication history for this paper can be accessed here:

Additional File 1

Fig S1.doc: Pearson's correlation coefficient between cell line expression profiles Click here for file

Additional File 2

Fig S2.doc: Additional Plots and list of significant genes generated by PAM. Click here for file

Additional File 3

Fig S3: SVG (Adobe scalable vector graphics). Go Miner DAG (directed acyclical graph) view of the changed genes within the GO hierarchy. Click here for file

Additional File 4

Table S1:.doc Primer sequences and PCR-product details. Click here for file

Additional File 5

Table S2:.doc: Support Vector Machines: Validations and predictions. Click here for file

Additional File 6

Table S3:.doc: SAM output. Click here for file

Additional File 7

Table S3:.doc: False discovery rates and p-values of the Pomelo Tool applied to the comparison HMEC-MSSM. Click here for file

Additional File 8

Table S5:.xls: p-values of Go Miner. Click here for file
  39 in total

1.  Re-expression of E-cadherin, alpha-catenin and beta-catenin, but not of gamma-catenin, in metastatic tissue from breast cancer patients [seecomments].

Authors:  I K Bukholm; J M Nesland; A L Børresen-Dale
Journal:  J Pathol       Date:  2000-01       Impact factor: 7.996

Review 2.  Expression of bone matrix proteins in human breast cancer: potential roles in microcalcification formation and in the genesis of bone metastases.

Authors:  A Bellahcène; V Castronovo
Journal:  Bull Cancer       Date:  1997-01       Impact factor: 1.276

3.  Expression of osteonectin mRNA in human breast tumours is inversely correlated with oestrogen receptor content.

Authors:  J D Graham; R L Balleine; J S Milliken; A M Bilous; C L Clarke
Journal:  Eur J Cancer       Date:  1997-09       Impact factor: 9.162

4.  Disabled-2 inactivation is an early step in ovarian tumorigenicity.

Authors:  Z Fazili; W Sun; S Mittelstaedt; C Cohen; X X Xu
Journal:  Oncogene       Date:  1999-05-20       Impact factor: 9.867

5.  Overexpression of stromelysin-3, BM-40/SPARC, and MET genes in human esophageal carcinoma: implications for prognosis.

Authors:  H Porte; J P Triboulet; L Kotelevets; F Carrat; S Prévot; B Nordlinger; Y DiGioia; A Wurtz; P Comoglio; C Gespach; E Chastre
Journal:  Clin Cancer Res       Date:  1998-06       Impact factor: 12.531

Review 6.  Cadherin switch in tumor progression.

Authors:  Rachel B Hazan; Rui Qiao; Rinat Keren; Ines Badano; Kimita Suyama
Journal:  Ann N Y Acad Sci       Date:  2004-04       Impact factor: 5.691

7.  Neoplastic progression of human colorectal cancer is associated with overexpression of the stromelysin-3 and BM-40/SPARC genes.

Authors:  H Porte; E Chastre; S Prevot; B Nordlinger; S Empereur; P Basset; P Chambon; C Gespach
Journal:  Int J Cancer       Date:  1995-02-20       Impact factor: 7.396

8.  Expression of basal and luminal epithelium-specific keratins in normal, benign, and malignant breast tissue.

Authors:  S H Dairkee; L Puett; A J Hackett
Journal:  J Natl Cancer Inst       Date:  1988-07-06       Impact factor: 13.506

9.  Distribution of SPARC in normal and neoplastic human tissue.

Authors:  P L Porter; E H Sage; T F Lane; S E Funk; A M Gown
Journal:  J Histochem Cytochem       Date:  1995-08       Impact factor: 2.479

10.  Genetic variability in MCF-7 sublines: evidence of rapid genomic and RNA expression profile modifications.

Authors:  Mélanie Nugoli; Paul Chuchana; Julie Vendrell; Béatrice Orsetti; Lisa Ursule; Catherine Nguyen; Daniel Birnbaum; Emmanuel J P Douzery; Pascale Cohen; Charles Theillet
Journal:  BMC Cancer       Date:  2003-04-24       Impact factor: 4.430

View more
  6 in total

1.  GILP family: a stress-responsive group of plant proteins containing a LITAF motif.

Authors:  C Cabreira-Cagliari; D G S Fagundes; N C F Dias; B Bohn; M Margis-Pinheiro; M H Bodanese-Zanettini; Alexandro Cagliari
Journal:  Funct Integr Genomics       Date:  2017-10-06       Impact factor: 3.410

2.  Transcription profile of a human breast cancer cell line expressing MMTV-like sequences.

Authors:  Mariana Fernandez-Cobo; Stella M Melana; James F Holland; Beatriz G T Pogo
Journal:  Infect Agent Cancer       Date:  2006-12-15       Impact factor: 2.965

3.  Characterization of interaction of magnetic nanoparticles with breast cancer cells.

Authors:  Macarena Calero; Michele Chiappi; Ana Lazaro-Carrillo; María José Rodríguez; Francisco Javier Chichón; Kieran Crosbie-Staunton; Adriele Prina-Mello; Yuri Volkov; Angeles Villanueva; José L Carrascosa
Journal:  J Nanobiotechnology       Date:  2015-02-26       Impact factor: 10.435

4.  LITAF is a potential tumor suppressor in pancreatic cancer.

Authors:  Yuan Zhou; Jing Huang; Xi Yu; Xin Jiang; Yaoyao Shi; Yuanyuan Weng; Yue Kuai; Lizhen Lei; Guoping Ren; Xiaowen Feng; Guoping Zhong; Qingmeng Liu; Hongyang Pan; Xinxia Zhang; Ren Zhou; Caide Lu
Journal:  Oncotarget       Date:  2017-12-14

5.  De-regulation of common housekeeping genes in hepatocellular carcinoma.

Authors:  Samuel Waxman; Elisa Wurmbach
Journal:  BMC Genomics       Date:  2007-07-18       Impact factor: 3.969

6.  Long-term exposure of MCF-12A normal human breast epithelial cells to ethanol induces epithelial mesenchymal transition and oncogenic features.

Authors:  Robert Gelfand; Dolores Vernet; Kevin Bruhn; Jaydutt Vadgama; Nestor F Gonzalez-Cadavid
Journal:  Int J Oncol       Date:  2016-03-29       Impact factor: 5.650

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.