Literature DB >> 19401702

Analysis of differential gene expression in colorectal cancer and stroma using fluorescence-activated cell sorting purification.

M J Smith1, A C Culhane, M Donovan, J C Coffey, B D Barry, M A Kelly, D G Higgins, J H Wang, W O Kirwan, T G Cotter, H P Redmond.   

Abstract

Tumour stroma gene expression in biopsy specimens may obscure the expression of tumour parenchyma, hampering the predictive power of microarrays. We aimed to assess the utility of fluorescence-activated cell sorting (FACS) for generating cell populations for gene expression analysis and to compare the gene expression of FACS-purified tumour parenchyma to that of whole tumour biopsies. Single cell suspensions were generated from colorectal tumour biopsies and tumour parenchyma was separated using FACS. Fluorescence-activated cell sorting allowed reliable estimation and purification of cell populations, generating parenchymal purity above 90%. RNA from FACS-purified and corresponding whole tumour biopsies was hybridised to Affymetrix oligonucleotide microarrays. Whole tumour and parenchymal samples demonstrated differential gene expression, with 289 genes significantly overexpressed in the whole tumour, many of which were consistent with stromal gene expression (e.g., COL6A3, COL1A2, POSTN, TIMP2). Genes characteristic of colorectal carcinoma were overexpressed in the FACS-purified cells (e.g., HOX2D and RHOB). We found FACS to be a robust method for generating samples for gene expression analysis, allowing simultaneous assessment of parenchymal and stromal compartments. Gross stromal contamination may affect the interpretation of cancer gene expression microarray experiments, with implications for hypotheses generation and the stability of expression signatures used for predicting clinical outcomes.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19401702      PMCID: PMC2694425          DOI: 10.1038/sj.bjc.6604931

Source DB:  PubMed          Journal:  Br J Cancer        ISSN: 0007-0920            Impact factor:   7.640


Gene expression microarray data have huge potential for cancer research and treatment. Gene expression profiles have been used to classify tumours (Bittner ; Ramaswamy ; Selaru ; Shipp ), study the biology of tumour progression and metastasis (Birkenkamp-Demtroder ; Ramaswamy ), predict clinical outcomes (van de Vijver ; Huang ; Iizuka ), classify drug resistance (Hofmann ; Chang ) and identify novel drug targets (Marton ). As the technology matures, it is pushing towards mainstream clinical application (Gershon, 2004; Jarvis and Centola, 2005; Cardoso ). Surmountable challenges remain, such as standardisation and validation of the competing platforms and analysis techniques. However, the heterogeneity of clinical tumour samples remains a fundamental problem which must be addressed (Winegarden, 2003). All cell subpopulations in a sample contribute to the gene expression profile. Relatively homogenous cell populations yield optimal expression data, and increasing ratios of stromal cells may obscure the gene expression of parenchymal cancer cells (Emmert-Buck ; Ross ; Butte, 2002; Sugiyama ). Stromal gene expression may cause misinterpretation of data, with important subtle cancer gene changes being masked by contaminating RNA, and increasing the potential for attributing incorrect functional gene associations (Smith ). However, interactions between stromal cells and tumour parenchyma are increasingly recognised as an important factor in tumour biology and clinical outcome, and the delineation and retention of stromal gene expression has considerable value (Fukino ; Patocs ). Laser capture microdissection (LCM) allows the selection of specific cells from tissue and could potentially circumvent many of these problems (Emmert-Buck ). Sugiyama examined the expression profile of LCM dissected tissue and whole tissue. They demonstrated significant differences in expression profiles, finding that the overall difference in the gene expression profile was related to levels of stromal contamination. However, LCM is a costly, laborious and highly skilled procedure which yields small quantities of RNA, which renders clinical application impractical (Liu, 2007). It has also been demonstrated by Michel that the LCM process introduces a systematic bias into gene expression profiles. Another problem encountered in the estimation of stromal content histologically, and therefore by both LCM and macrodissection, is the ‘reference trap’. A two-dimensional microscopic view of a complex three-dimensional structure, such as a tumour, leads to irreversible qualitative and quantitative loss of information (Nyengaard, 1999). This means that fractions of cells can be grossly under- or overestimated if unbiased sampling methods (i.e. stereological methods) are not used. Fluorescence cytometry (FC) and fluorescence-activated cell sorting (FACS) allow simultaneous quantitation and multiparametric assessment of the phenotype of cells by staining with fluorochrome-conjugated antibodies (Afanasyeva ). These are generally used to examine or sort peripheral blood samples (Waguri ), to identify tumour cells in malignant effusions, to isolate clones and infrequently for the separation of specific cell populations from solid tissue (Afanasyeva ; Suzuki ). Advances in laser technology, speed of sorting and range of fluorochromes, make FACS a potentially useful method for identifying and purifying cell populations from solid tumours for analysis. Fluorescence-activated cell sorting overcomes many of the problems associated with LCM and macrodissection, by allowing systematic sampling of a large number of parenchymal tumour cells, allowing confirmation of the purity of targets and potentially, a better average of gene expression in cancer cells. We aimed to evaluate the feasibility of using FACS for producing homogenous cell populations for gene expression microarray analysis of colorectal tumour samples. Specifically, we wished to compare the differences in gene expression profiles elicited from whole tissue and sorted cells.

Materials and methods

Colorectal carcinoma tissue

Colorectal carcinoma (CRC) tissue samples were obtained, with informed consent, from patients undergoing curative bowel resection (Table 1). All patients were Irish Caucasians. None of the patients received preoperative chemotherapy or radiotherapy. Three primary cancers were used to compare the gene expression of sorted cells and whole tumour samples, after optimisation of our FACS methodology. Collection of tissue was approved by the Clinical Research Ethics Committee of the Cork Teaching Hospitals.
Table 1

Microarray sample details

CEL file name Patient Sample type TNM classification Stage Location Sex Age (years)
A053-04-E1E1FACST4N0M1IVCaecumFemale68
A053-03-E6E6FACST2N0MxIRectosigmoidMale55
A053-05-E12E12FACST3N0MxIIASigmoidMale35
A053-06-E1WE1Whole tissueT4N0M1IVCaecumFemale68
A053-07-E6WE6Whole tissueT2N0MxIRectosigmoidMale55
A053-08-E12WE12Whole tissueT3N0MxIIASigmoidMale35

Generation of a single cell suspension from colorectal cancer tissue

Tumour samples were washed in Dulbecco's modified Eagle's medium (DMEM; BioWhittaker, Wokingham, UK) and macroscopic necrotic tissue was excised with a scalpel and washed in DMEM. A portion of the biopsy was immediately snap frozen in liquid nitrogen and stored at −80°C until RNA extraction and approximately 1 g of tumour tissue was mechanically disaggregated, and subsequently enzymatically digested with bovine collagenase II, IV (Sigma-Aldrich, Dublin, Ireland) and DNAse I (Roche, Clarecastle, Ireland) at concentrations of 2 and 1 mg ml−1, at room temperature (22°C) for 45–60 min. This mix was then filtered through 70 mm pore mesh (Becton Dickinson, Oxford, UK). All suspensions were generated under standardised environmental conditions, being kept at 4°C, except for the enzyme digestion stage.

Flow cytometry

Dissociated cells in suspension were incubated on ice with mouse anti-human epithelial antigen (HEA) monoclonal antibody (mAb) conjugated with FITC (clone BER-EP4; Dako, Glostrup, Denmark), anti-CD14 mAb conjugated with phycoerythrin (PE) and anti-CD45 conjugated with PerCP mAb (both from BD Pharmingen, Erembodegem-Aalst, Belgium) or relevant isotype controls, for 45 min. The labelled cells were analysed and separated using FACS Vantage with CellQuest Pro software (Becton Dickinson). Establishment of the gates was based on the staining profiles of the negative controls, positive controls (SW-620 cells, labelled with HEA) and to eliminate low forward scatter signal events, eliminating debris, red cells and apoptotic cells. The mAb BER-EP4 binds to a partially formol-resistant epitope on the protein moiety of two 34- and 39 kDa glycopolypeptides on human epithelial cells. It does not bind to any non-epithelial cells (Latza ). Specifically, it does not bind to mesenchymal or lymphoid tissue. However, in large cell populations antibodies can bind in a non-specific manner. To control for this, we blocked antibodies with 1% fetal calf serum and used isotype control antibodies as negative controls. There is also a possibility of immune cells expressing the HEA antigen after ingestion of apoptotic cells. As immune cell infiltrate is a large component of stromal tissue, we decided to use antibodies to allow us to quantify and negatively select immune cells to avoid contamination in our sorted epithelial fraction. Phagocyte numbers have been found to increase from 1.5- to 2.5-fold in Duke's B and C tumours, respectively, and T cells by 1.4-fold in colorectal tumours (Allen and Hogg, 1985, 1987). We decided that a combination of antibodies binding to CD14, which is the LPS receptor and is expressed strongly on the surface of monocytes, weakly on the surface of granulocytes and by most tissue macrophages, and CD45, a tyrosine phosphatase a critical requirement for T- and B-cell antigen receptor-mediated activation, which is expressed, typically at high levels, on all haematopoietic cells (expression is at a higher density on lymphocytes, approximately 10% of surface area is CD45), would be the ideal combination. It has previously been demonstrated that using positive selection of HEA-expressing cells and negative selection of CD45- or CD14-expressing cells yields using immunomagnetic cell sorting lead to high yields of epithelial cells from cell solutions (Zigeuner ; Guo ).

Cell sorting and confirmation of cell phenotype

A one-step, three-colour, sorting approach was used. Our goal was to positively select colorectal parenchyma and negatively select for stromal cells. A diagram illustrating the method is shown in Figure 1. Sorting gates were set for positive selection of HEA+ CD14CD45− and negative selection of HEA− CD14+ CD45+ cells. Unstained cells, cells stained with isotype controls, were used for all samples, and SW-620 cells were used as a positive control for CRC cells. At least 7 million HEA+ cells were sorted, as below this level we found RNA quantity was variable. Cells were sorted into BD polypropylene flow tubes coated with 4% bovine serum albumin (Sigma-Aldrich). Purity of the sample was checked after sorting by reanalysing HEA+ CD14CD45− fraction, on the same machine, after a full cleaning protocol. Purity greater than 90% was deemed acceptable.
Figure 1

This figure demonstrates our method for separating tumour stroma and parenchymal cells using fluorescence-activated cell sorting. Briefly, debris is gated out, target populations identified and positively (parenchyma) or negatively selected (stroma). Dot plots represent 10 000 events, and show side scatter and forward scatter plots in Step 1 and fluorescence plots in Steps 2 and 3. Cells have been simultaneously stained with mouse anti-human anti-HEA, anti-CD14 and anti-CD45 monoclonal antibodies conjugated with FITC, PE and PerCP, respectively, which are resolved on FL-1, FL-2 and FL-3. Step 1 demonstrates our method of gating out debris and identification of the HEA positive population with scatter plots. Step 2 demonstrates the identification of CD45 (A) and CD14 (B) positive populations, followed by gating of HEA+CD14 population (C) and gating of HEA+ CD14-CD45-population (D), which consists of 43% of cells. Step 3 is the identification and selection of the R3 region (HEA+ CD14- CD45- population) for cell sorting, and the estimation of presorting parenchymal content, which is estimated at only 43% of cells in this sample. After cell sorting the flow cytometer is cleaned, and the post-sorting populations evaluated. Step 4 shows a histogram demonstrating the post-sorting populations of cells. The histogram plots the fluorescence of HEA+ cells on FL-1 (region M2) against counts on the y-axis. There is clear separation of the tumour parenchyma (M2 region) and the stromal (M1 region) cells. We estimate the populations have a purity of over 90%.

After sorting, cells were confirmed to be colorectal tumour cells by microscopic examination, by cytospinning on to Superfrost Plus microscope slides (BDH Laboratory Supplies, Poole, UK) followed by ethanol fixation and staining with Rapi-Diff (Cytocolor, Hinckley, OH, USA) or immunocytochemistry. Immunocytochemistry was performed using Dako MNF-116 anti-pan-cytokeratin antibody, using the standard EnVision kit protocol (Dako). We have also applied this method to successfully purify tumour parenchyma in CRC liver metastases, primary breast tumours, and with modification, to sort breast cancer bone marrow micrometastases.

RNA isolation

RNA was extracted from three corresponding whole tumour samples and FACS-purified tumour parenchyma using a modification of the Tri Reagent (Molecular Research Center, Cincinnati, OH, USA) protocol (Curtin and Cotter, 2004). RNA with an absorbance ratio A260/240 >1.8 and no evidence of RNA degradation by gel electrophoresis was accepted. We then checked the RNA quality using the Agilent Bioanalyzer (Agilent, Santa Clara, CA, USA) runs. We used RNA with a RIN (RNA integrity number) value ⩾8 (Schroeder ).

cRNA preparation

The labelling of the total RNA was performed according to the ‘Small Sample Labeling Protocol vII’ (Affymetrix, Santa Clara, CA, USA). Total RNA (100 ng) was used as starting material for the first round of cDNA preparation. The first and second strand cDNA synthesis was performed using the Superscript II system (Invitrogen, Dublin, Ireland) according to the manufacturer's instructions except using an oligo-dT primer containing a T7 RNA polymerase promoter site. The first round of in vitro transcription (IVT) was performed using the MEGAscript T7 kit (Ambion, Warrington, UK). The second round of cDNA preparation was done as first round except now random hexamers replaced the oligo-dT primer. Labelled cRNA was prepared using the BioArray High Yield RNA Transcript Labeling Kit (Enzo, Farmingdale, NY, USA). Biotin labelled CTP and UTP (Enzo) were used in the reaction together with unlabelled NTPs. During the labelling, the IVT product and also the fragmented IVT product were checked by gel electrophoresis. Following the IVT reaction, the unincorporated nucleotides were removed using RNeasy columns (Qiagen, Crawley, UK).

Oligonucleotide array hybridisation and scanning

Fragmented cRNA was loaded onto the GeneChip HU133 Plus 2.0 probe array cartridge (Affymetrix). The washing and staining procedure was performed in the Affymetrix Fluidics Station 450 (Affymetrix). The biotinylated cRNA was stained with a streptavidin–PE conjugate, and the probe arrays were scanned at 560 nm using a confocal laser-scanning microscope (Affymetrix Scanner 3000; Affymetrix). After hybridisation and scanning, we checked several quality parameters: scaling factor ⩽3-fold difference within a study; 3′/5′ ratio for probe sets for GAPDH ⩽3; present (P) calls in the same range for all samples in the study and RawQ below 100. All of our arrays passed all stages of the quality control. The readings from the quantitative scanning were analysed by the Affymetrix Gene Expression Analysis Software (Affymetrix).

Statistical analysis of gene expression data

Affymetrix GeneChip array data were normalised, pre-processed and analysed using R and Bioconductor statistical software (Gentleman ). Raw CEL file data from human whole genome Affymetrix U133Plus2 gene GeneChips of purified ‘of whole primary’ colon cancer samples (n=3) and tumour parenchyma samples purified by FACS (n=3) were imported into R. Initial exploratory data analysis performed using the overview function in the package made4 (Culhane ) suggested that the assumption of a constant sum across all microarray samples may not be valid for these data. Moreover, there were significantly more MAS 5.0 P calls in the whole samples than in the FACS-purified samples (paired t-test, P<0.05). Therefore, data were normalised using the Li and Wong's invariant set method using the ‘expresso’ function in the Affy package in Bioconductor (Li and Hung Wong, 2001; Gentleman ). Normalised data were log2 transformed and assessed initially using use two exploratory data analysis approaches: hierarchical cluster analysis (1-Pearson correlation coefficient distance with average linkage joining) and dimension reduction using correspondence analysis (COA; Eisen ; Fellenberg ). Figures were created using the made4 package in Bioconductor (Culhane ).

Detection of genes differentially expressed in purified tumour

Given the low number of replicates in this study, it is challenging to estimate of gene mean and variance; therefore, rank-based non-parametric methods may be more efficient in these data. It is reported that rank product performs comparably or outperforms t-statistic-based methods when replicates numbers are very low (less than five) (Breitling ; Jeffery ). Rank products analysis is a non-parametric statistic that detects genes that are consistently highly ranked in lists, that is genes that are consistently upregulated genes in a number of replicate experiments. Rank products analysis does not require a measure of gene-specific variance and is therefore particularly powerful when only a small number of replicates are available. Rank products analysis was performed using the Bioconductor package RankProd. False discovery rates were estimated using 100 permutations. To aid interpretation of these genes lists, we used DAVID to assess which Gene Ontology biological and functional categories were overrepresented in this list of genes (Dennis ). We used the highest stringency level, for other analyses we used an EASE of 0.01, and false discovery rate of 1000. Heatmap images of gene expression profiles were generated using the made4 package in Bioconductor. The Human Genome Organisation (HUGO) gene symbols for Affymetrix probe sets were retrieved using the annaffy Bioconductor package and the annotation library hu133plus2 (build Tuesday, 4 October 2005, 20:53:27).

Results

Patient demographics and FACS

Three matched patient biopsies were taken immediately after resection. No patients received pre-operative chemoradiotherapy. Metastases (m1) were observed intra-operatively in one patient. The other patients were free from metastases (mx). Histological examination of the E1 and E12 samples demonstrated moderately differentiated adenocarcinomas with strand-like infiltrative pattern of malignant glands through muscularis propria, interspersed by stroma. The E6 sample demonstrated a well-differentiated adenocarcinoma with closely packed glands invading into muscularis propria, with stroma between the glands. After generation of single cell suspensions from our experimental tumour biopsies, stromal content was estimated to range from 37 to 60%, and sorted to greater than 90% purity as described earlier (Figure 2). The parenchymal component of the tumours was estimated to range from 50 to 80% on histological assessment. The E6 sample demonstrated the biggest discrepancy in estimation of parenchymal content (FC estimate 37 vs 80% histological assessment), which may be explained by the fact that the sample contained a large muscularis propria component, which could account for the high proportion of non-staining FC events. Sorted cells were subsequently confirmed as tumour parenchyma by light microscopy assessment (Figure 3). We found the sorted cell population was homogenous and had the morphological appearances consistent with CRC cells after staining with Rapi-Diff and comparison to SW-620 colorectal cell line. The cells also stained positive for the cytokeratin MNF-116, confirming they were epithelial in nature. Using FC we found that the population of HEA+ cells fluoresced in the same region as cells stained with MNF-116.
Figure 2

Fluorescence-activated cell sorting of CRC cells from experimental samples. We generated single cell suspensions from three patient samples and sorted them to greater than 90% purity in each case. Dot plots show the pre- and post-sorting HEA+ populations. The table shows the estimates of tumour cell purity in the samples and the initial histological estimates of tumour parenchyma content. We sorted an average of 9 million cells per sample.

Figure 3

Cytological confirmation of cell-sorted tumour parenchyma. To confirm that our HEA+ CD14− CD45− cell population was tumour parenchyma, we examined three post-sorting populations after cytospinning and Rapi-Diff II (Diagnostic Developments) staining and compared them to SW-480 cells, a cell line derived from a primary colorectal tumour ( × 40 magnification shown). Then we confirmed the cells were epithelial by staining with MNF-116 pan-cytokeratin (Dako).

Differential gene expression

There were significantly more MAS 5.0 P calls in the whole samples than in the FACS-purified samples (paired t-test, P<0.05; Figure 4). Ordination was used to explore the data, and correspondence analysis was applied to the data using the made4 package in Bioconductor (Culhane ). Correspondence analysis is a useful dimension reduction method for observing the χ2 or associations between genes and samples. The dendrogram showed that the whole and the purified samples could be portioned into two distinct clusters (Figure 5A). These clusters were also observed on the most variant or first axis of a COA of these data (Fellenberg et al, 2001) (Figure 5B). Interestingly, the second most variant axis (F2, vertical) separated the metastatic (E1) and metastatic-free samples. Although we have few replicates in this study, it appeared that metastatic and metastatic-free tumour samples were more defined in the purified samples when compared to the whole samples. The discrimination between metastatic and metastatic-free samples accounted for more variance than difference between tumour stage.
Figure 4

Differential gene expression between whole tumour and cell-sorted colorectal cancer biopsies. This graph displays the genes called as presented by MAS 5.0 in each sample (P, present; A, absent; M, marginal). There was a significant difference in genes called as present in the whole-tissue samples (P=0.0407, Student's paired t-test). These genes are therefore expressed in non-parenchymal (or stromal) tissue.

Figure 5

Hierarchical clustering using Eisen's formula of a correlation similarity metric and average linkage clustering led to the cell-sorted samples and whole-tissue samples clustering together, showing they are most similar to each other (A). This is despite the fact that they are paired samples. Correspondence analysis also demonstrates that the sorted samples and whole-tissue samples cluster together (B). The first axis (horizontal) splits the whole and FACS-purified samples. The second axis (vertical) split E1 and E1W from E6, E12, E12W and E6W. Genes that separate the samples are shown with HUGO classification.

Expression of 289 genes were detected in whole-biopsy samples but not in purified samples (P<0.05). Of these, 50 differentially expressed genes were highly significant (P<0.01; Table 2). Expression of 103 genes were detected in purified samples, but significantly downregulated in whole samples (P<0.05), of which 33 of these were significant at P<0.01 (Table 3) which are displayed in the Heatmaps in. Heatmaps of the highly significant differentially expressed genes were generated, and displayed in Figure 6.
Table 2

Genes called significant in whole tumour (P<0.01, rank products)

Probe set ID Gene symbol Gene title RP/Rsum pfp
207961_x_at MYH11 Myosin, heavy chain 11, smooth muscle53.3310
218469_at GREM1 Gremlin 1, cysteine knot superfamily, homolog (Xenopus laevis)18.34440
205594_at ZNF652 Zinc-finger protein 652123.64050.0011
226663_at ANKRD10 Ankyrin repeat domain 10122.72710.0013
201438_at COL6A3 Collagen, type VI, α3113.38360.0014
212764_at ZEB1 Zinc finger E-box-binding homeobox 1139.71270.0015
224823_at MYLK Myosin light chain kinase108.55230.0017
228030_atTranscribed locus, strongly similar to NP_005768.1 RNA-binding motif protein 6 [Homo sapiens]138.6180.0017
212077_at CALD1 Caldesmon 1134.66390.0018
206199_at CEACAM7 Carcinoembryonic antigen-related cell adhesion molecule 7133.8680.002
212354_at SULF1 Sulfatase 193.76570.002
235028_atCDNA FLJ42313 fis, clone TRACH201942570.7020.0025
230269_atTranscribed locus145.63950.0029
241879_atTranscribed locus64.20240.0033
227260_at ANKRD10 Ankyrin repeat domain 10201.57460.0064
227061_atCDNA FLJ44429 fis, clone UTERU2015653209.82290.0066
202202_s_at LAMA4 Laminin, α4200.68960.0067
218468_s_at GREM1 Gremlin 1, cysteine knot superfamily, homolog (Xenopus laevis)194.72910.0067
238750_atTranscribed locus211.76120.0067
225269_s_at C2orf12 Chromosome 2 open-reading frame 12183.46780.0068
227140_atCDNA FLJ11041 fis, clone PLACE1004405197.63190.0068
203691_at PI3 Peptidase inhibitor 3, skin-derived (SKALP)200.11680.0069
208747_s_at C1S Complement component 1, s subcomponent194.13830.007
221748_s_at TNS1 Tensin 1185.74470.007
225681_at CTHRC1 Collagen triple helix repeat containing 1215.99280.0071
1557270_atCDNA FLJ36375 fis, clone THYMU2008226171.76430.0072
202404_s_at COL1A2 Collagen, type I, α2191.96360.0073
225786_at LOC284702 Hypothetical protein LOC284702162.58290.0073
225664_at COL12A1 Collagen, type XII, α1166.4780.0075
243134_atTranscribed locus219.38810.0075
210809_s_at POSTN Periostin, osteoblast-specific factor188.67810.0076
225275_at EDIL3 EGF-like repeats and discoidin I-like domains 3171.17170.0076
221729_at COL5A2 Collagen, type V, α2231.14450.0086
202948_at IL1R1 Interleukin 1 receptor, type I252.70680.0087
225107_at HNRNPA2B1 Heterogeneous nuclear ribonucleoprotein A2/B1243.10250.0087
226731_at PELO Pelota homolog (Drosophila)246.39120.0087
209656_s_at TMEM47 Transmembrane protein 47230.25680.0088
218353_at RGS5 Regulator of G-protein signaling 5254.3930.0088
224565_at TncRNA Trophoblast-derived noncoding RNA273.76870.0088
212067_s_at C1R Complement component 1, r subcomponent241.68540.0089
201852_x_at COL3A1 Collagen, type III, α1 (Ehlers-Danlos syndrome type IV, autosomal dominant)271.87570.009
208782_at FSTL1 Follistatin-like 1268.86270.009
212353_at SULF1 Sulfatase 1265.45040.0091
221778_at JHDM1D Jumonji C domain containing histone demethylase 1 homolog D (S. cerevisiae)229.7420.0091
224694_at ANTXR1 Anthrax toxin receptor 1268.35330.0091
225809_at DKFZP564O0823 DKFZP564O0823 protein261.48810.0091
215076_s_at COL3A1 Collagen, type III, α1 (Ehlers-Danlos syndrome type IV, autosomal dominant)240.33410.0092
1555878_at RPS24 Ribosomal protein S24260.42350.0093
201540_at FHL1 Four-and-a-half LIM domains 1258.03770.0093
231579_s_at TIMP2 TIMP metallopeptidase inhibitor 2265.30350.0093
Table 3

Genes called significant in FACS purified CRC cells (P<0.01, rank products)

Probe set ID Gene symbol Gene title RP/Rsum pfp
200664_s_at DNAJB1 DnaJ (Hsp40) homolog, subfamily B, member 168.07090
204018_x_at HBA1 Haemoglobin, α166.03740
209116_x_at HBB Haemoglobin, β10.55240
209458_x_at HBA1 Haemoglobin, α160.47850
211699_x_at HBA1 Haemoglobin, α167.09890
211745_x_at HBA1 Haemoglobin, α152.93710
214414_x_at HBA1 Haemoglobin, α161.13650
217232_x_at HBB Haemoglobin, β8.12130
217316_at OR7A10 Olfactory receptor, family 7, subfamily A, member 1059.44480
217414_x_at HBA1 Hemoglobin, α128.78390
225762_x_at LOC284801 Hypothetical protein LOC2848019.0590
225767_at42.74130
229667_s_at HOXB8 Homeobox B834.41760
204419_x_at HBG1 Haemoglobin, γA97.7980.0012
1565817_at IKZF1 IKAROS family zinc-finger 1 (Ikaros)93.47670.0013
211696_x_at HBB Haemoglobin, β92.89530.0013
209795_at CD69 CD69 molecule92.5880.0014
231100_at RRAD Ras-related associated with diabetes135.61850.0044
207574_s_at GADD45B Growth arrest and DNA-damage-inducible, β152.10690.0052
208252_s_at CHST3 Carbohydrate (chondroitin 6) sulfotransferase 3149.78730.0052
212099_at RHOB Ras homolog gene family, member B151.44720.0055
225377_at C9orf86 Chromosome 9 open-reading frame 86145.88610.0055
230935_at144.7220.0058
243001_at C18orf22 Chromosome 18 open-reading frame 22167.48740.006
202768_at FOSB FBJ murine osteosarcoma viral oncogene homolog B163.36930.0062
210042_s_at CTSZ Cathepsin Z174.62380.0062
1569428_at WIBG Within bgcn homolog (Drosophila)190.22990.0071
206834_at HBB Haemoglobin, β203.10120.0073
1556262_atCDNA clone IMAGE:4822139208.89250.0074
220369_at SMEK1 SMEK homolog 1, suppressor of mek1 (Dictyostelium)189.23570.0074
244804_at SQSTM1 Sequestosome 1210.70270.0075
235102_x_atTranscribed locus199.66530.0076
213515_x_at HBG1 Haemoglobin, γA223.8580.0088
237518_atTranscribed locus225.910.0091
Figure 6

Genes detected by rank products analysis (P<0.01). Heatmaps showing the differentially expressed genes detected in whole but not purified samples (A) and purified but not whole samples (B) using rank product analysis (P<0.01) using Z-score normalised values (row centred). Red tiles represent upregulated genes and blue represent downregulated genes.

Functional annotation

We used DAVID to classify gene function in the whole tumour sample (Table 4). Most functional classes are consistent with stromal as opposed to tumour cell function (e.g., proteinaceous extracellular matrix P=2.37 × 10−08, extracellular matrix P=2.49 × 10−08, collagen triple helix repeat P=7.12 × 10−07), although genes known to be expressed in tumour epithelium were identified (e.g., GREM1). Looking at individual genes, upregulation of connective tissue genes was prevalent (e.g., COL6A3, COL1A2, COL12A1, COL5A2, COL3A1, CTHRC1, SULF1), as were genes involved in extracellular matrix function (e.g., LAMA4, PI3, POSTN, TIMP2) and cell adhesion (e.g., TNS1). Genes involved in endothelial function (e.g., TIMP2) and specifically, colon cancer tumour endothelium, such as the anthrax toxin receptor (ANTXR1), were also upregulated (Liu ).
Table 4

Functional classification of whole tumour gene expression with DAVID (top 50)

Category Term Gene count Percentage P-value
GO_CCExtracellular region part1532.61%3.87E−10
GO_CCExtracellular region1736.966.31E−10
SP_PIRHydroxylation715.225.82E−09
GO_CCProteinaceous extracellular matrix1021.742.37E−08
SP_PIRExtracellular matrix919.572.49E−08
GO_CCExtracellular matrix1021.742.71E−08
SP_PIRSignal2247.833.88E−08
GO_CCExtracellular matrix part715.221.00E−07
UP_SEQSignal peptide2247.831.49E−07
SP_PIRTrimer510.873.61E−07
INTERPROCollagen triple helix repeat613.047.12E−07
SP_PIRTriple helix510.877.53E−07
SP_PIRHydroxylysine510.878.60E−07
SP_PIRSecreted1532.618.62E−07
GO_BPPhosphate transport613.049.47E−07
SP_PIRHydroxyproline510.871.10E−06
GO_CCCollagen510.871.22E−06
SP_PIRCollagen613.041.24E−06
SP_PIRDirect protein sequencing1839.132.48E−06
SP_PIRPyroglutamic acid510.873.81E−06
SP_PIRStructural protein613.046.91E−06
INTERPROCollagen helix repeat510.877.04E−06
GO_BPOrganic anion transport613.041.73E−05
GO_BPAnion transport613.043.99E−05
GO_MFExtracellular matrix structural constituent510.876.54E−05
KEGGECM-receptor interaction510.877.67E−05
UP_SEQShort sequence motif:Cell attachment site510.879.21E−05
PIR_SUPERFAMILYCollagen α1(I) chain36.529.79E−05
KEGGFocal adhesion613.041.39E−04
SP_PIRGlycoprotein1941.301.57E−04
GO_BPSystem development1328.261.62E−04
GO_MFStructural molecule activity919.571.68E−04
UP_SEQPropeptide:C-terminal propeptide36.521.83E−04
SP_PIREhlers-Danlos syndrome36.521.89E−04
GO_BPOrgan development1123.912.43E−04
GO_BPExtracellular matrix organization and biogenesis48.702.45E−04
GO_CCFibrillar collagen36.522.71E−04
INTERPROFibrillar collagen, C-terminal36.522.84E−04
KEGGCell communication510.874.06E−04
GO_BPMulticellular organismal development1430.435.90E−04
GO_BPMulticellular organismal process1736.966.33E−04
UP_SEQDisulphide bond1532.616.92E−04
SMARTCOLFI36.527.14E−04
UP_SEQGlycosylation site:N-linked (GlcNAc…)1839.138.15E−04
GO_BPAnatomical structure development1328.260.001069024
UP_SEQDomain:VWFC36.520.001162772
GO_BPExtracellular structure organization and biogenesis48.700.001291298
INTERPROvon Willebrand factor, type C36.520.003487936
GO_BPDevelopmental process1532.610.004413792
GO_MFComplement component C1s activity24.350.004676533
GO_CCExtracellular space613.040.005323916
UP_SEQDomain:VWFA 424.350.006011568
SMARTVWC36.520.007203275
SP_PIRPyrrolidone carboxylic acid36.520.008914529
SP_PIRSkin24.350.009320842
In contrast, the 31 highly significantly expressed genes in the FACS-purified cells do not display characteristics of stromal gene expression, and may be representative of tumour parenchyma gene expression. Genes involved in cell signalling, such as SQSTM1, which regulates activation of the nuclear factor-κB (NF-κB) signalling pathway, receptor internalisation, and protein turnover, and RRAD, a member of the Ras/GTPase superfamily, are also overexpressed (Moyers ; Seibenhener ). Genes known to be expressed in CRC were also significantly upregulated, such as HOX2D and RHOB, which mediate apoptosis in neoplastic cells, and are targets for novel antitumour agents, such as farnesyltransferase inhibitors (Vider ; Delarue ).

Comparison with Kwong et al expression signature

Kwong examined the expression signature derived from 60 tumours (normal mucosa, adenoma, tumour and liver metastases) and identified an expression profile that was able to differentiate between normal and neoplastic samples, but not individual tumour stages. They suggested that stromal genes may obscure the subtle molecular changes in tumours of differing pathological stage. Examination of their gene list, specifically the 34 upregulated genes in their signature, reveals the expression of extracellular matrix proteins such as, collagen, type I, α1 (COL1A1) and fibronectin. The authors believe this represented gene expression derived from infiltrating lymphocytes and other stroma. The gene list they identified shares similarities to gene expressed in our whole tumour sample (ANTXR1, COL12A1, COL5A2, CTHRC1, POSTN).

Discussion

Our study demonstrates that it is feasible to reproducibly separate and purify tumour parenchyma and other cell populations from a single cell suspension generated from a solid tumour using FACS. We also found that the gene expression profile elicited from the whole tumour was significantly different from that of the purified tumour parenchyma, and that source of this differential expression may be tumour stroma. When tumour parenchymal purity is necessary, FACS may be an alternative to LCM, in particular in tumours such as CRC, melanoma and other non-sclerotic tumours amenable to the generation of a single cell suspension. In our samples, we noted a large variance in estimated quantity of tumour parenchyma and stroma. Using FC we estimated that the parenchymal component of the tumours ranged from 37 to 60% and stroma from 40 to 63%. This was enriched to over 90% (range 90–96%) in each case with one sorting run per sample after calibration of the machine and settings for each sample. Verification of cell type is straightforward with standard staining techniques. We believe this level of stroma would grossly affect the gene expression profile taken from the biopsy and would be in keeping with the findings of Sugiyama . For publication we specifically used rank products as this has been shown to be reliable in small sample sizes in microarray experiments (Breitling ; Jeffery ). We found that the expression profile in the whole elicited also made biological sense. We are confident that there are real differences in the gene identified, which are related to the stromal gene expression, and that this has implications for clinical application of gene expression microarrays in CRC. The MammaPrint assay, which is a clinical application of the van‘t Veer 70-gene breast cancer expression profile, relies on a single fresh sample of tumour to predict prognosis (van ‘t Veer ; Cardoso ). The samples are examined, and a stromal content of <50% is deemed acceptable for the test. This would arguably eliminate two of our samples from analysis, which we were able to enrich to >90% purity. However, Wang , who derived an expression profile predicting recurrence of Duke's B colorectal carcinoma, included only samples that were enriched to over 85% purity. We believe that in CRC samples, the stroma will contaminate the sample causing problems with patient classification, but that can be overcome with parenchymal purification. The optimal tumour/stroma ratio for gene expression studies is yet to be determined and may vary depending on the tumour type. To ensure good quality expression data, we used several layers of quality control, starting with RNA gel electrophoresis and then checking RNA integrity with the Agilent Bioanalyzer (Agilent). Subsequently, after hybridisation and scanning, we checked several quality parameters such as scaling factor, 3′/5′ ratios, P calls and RawQ values. Quantitative RT–PCR was not employed as all of our arrays passed all stages of the quality control, and we do not believe that it will be used in clinical practice. This has been borne out by the current application of the MammaPrint assay and is a similar approach to other clinical microarray studies (Wang ; Glas ; Ach ; Cardoso ). We found a significant difference in the total number of probes called as present using the Affymetrix MAS5 signal algorithm. This shows that the whole tissue expressed a larger number of genes than sorted cells, demonstrating the wide range of genes expressed by non-tumour cells. We used Li and Wong's invariant set method for normalisation of the data sets (Li and Hung Wong, 2001). This uses a set of non-differentially expressed genes to normalise data that are identified by using an iterative procedure. Gene expression common to the sorted cells and whole tissue should be reasonably uniform. Any genes found to be expressed in the whole tissue can be presumed to be contained in stromal tissue. We then sought to examine how similar the gene expression profile of each sample was. Using hierarchical clustering, all three sorted samples clustered together, as did the whole-tissue samples clustered together. Correspondence analysis also showed a clear separation of sorted samples and whole-tissue samples. The E1 and E1W sample also separated from the other samples. This is not surprising as this sample was from a more advanced tumour than the others. The stromal component of the whole-tissue sample was the biggest determinate in differences and could easily separate all samples. This may explain why some studies show very similar GEP throughout tumour stages, and the similarity of some of the genes in our whole tumour sample to that of Kwong et al (Birkenkamp-Demtroder ; Kwong ). Qualitatively the expression of the whole tumour samples were consistent with tumour stroma, with genes highly specific for colorectal tumour stroma (e.g., ANTXR1), and DAVID analysis identifying highly significant functional groups involved in extracellular matrix function. Conversely, FACS-purified parenchyma expressed genes specifically associated with colorectal neoplasia, such as HOX2D and RHOB. The ability to examine the samples in parallel affords increased precision in analysis of tumour fraction gene expression and offers new opportunities to examine tumour–stroma interactions. Fluorescence-activated cell sorting parenchymal purification has several advantages over LCM. Disaggregation of a tumour sample generates a large random sample of tumour cells and may elicit a more representative and relevant gene expression profile than LCM, without the need for RNA amplification. Previous studies have assessed the differences between LCM acquired tissue, macrodissection and whole-tissue samples for microarray studies (Sugiyama ; Michel ; de Bruin ). Sugiyama suggests that if the stromal compartment is >30% LCM should be used, and showed marked differences in GEP in LCM-derived tissue compared with bulk biopsy. Similarly, Michel demonstrated that the LCM process introduces a bias into GEP profiles. Although they found that large expression changes were maintained, many genes changed with lower expression levels may be lost. This is problematic for several reasons – particularly as smaller changes in mRNA expression may have larger effects downstream than larger ones. Also it makes comparisons difficult between studies. Although LCM aims to overcome such problems, the very premise it is built on may introduce bias. A two-dimensional microscope view of a complex three-dimensional structure such as a tumour leads to irreversible qualitative and quantitative loss of information (Nyengaard, 1999). This means that fractions of cells can be grossly under- or overestimated if unbiased sampling methods such as stereological methods are not used (the ‘reference trap’). At worst it can lead to a gene expression profile of a tiny fraction of tumour being misinterpreted as expression of the whole tumour. Macrodissection is subject to similar compromises. Fluorescence-activated cell sorting overcomes many of these problems by allowing systematic sampling of cells, providing a large sample of cells, which allows confirmation of purity of targets and also a better average of gene expression in a tumour. In conclusion, FACS is effective in producing homogenous cell populations for gene expression microarray experiments in solid tumours and is viable alternative to macrodissection, LCM and whole tumour sampling in microarray experiments. Fluorescence-activated cell sorting overcomes many of the practical and theoretical problems associated with LCM. The gene expression profile of FACS-purified tumour parenchyma is significantly different to that of clinically resected tumour biopsies. Our analysis suggests that stromal gene expression is responsible for the differential expression and makes a significant contribution to the gene expression profile of whole tumour CRC biopsies. Therefore, one should consider a purification strategy when planning solid tumour gene expression microarray experiments. Although many of the sources of technical noise and variation in gene expression microarray technology have been overcome, there remain challenges, such as the approach to tumour heterogeneity, which need to be overcome before it is accepted into clinical practice.
  54 in total

Review 1.  Stereologic methods and their application in kidney research.

Authors:  J R Nyengaard
Journal:  J Am Soc Nephrol       Date:  1999-05       Impact factor: 10.121

2.  Drug target validation and identification of secondary drug target effects using DNA microarrays.

Authors:  M J Marton; J L DeRisi; H A Bennett; V R Iyer; M R Meyer; C J Roberts; R Stoughton; J Burchard; D Slade; H Dai; D E Bassett; L H Hartwell; P O Brown; S H Friend
Journal:  Nat Med       Date:  1998-11       Impact factor: 53.440

3.  Laser capture microdissection.

Authors:  M R Emmert-Buck; R F Bonner; P D Smith; R F Chuaqui; Z Zhuang; S R Goldstein; R A Weiss; L A Liotta
Journal:  Science       Date:  1996-11-08       Impact factor: 47.728

4.  Monocytes and other infiltrating cells in human colorectal tumours identified by monoclonal antibodies.

Authors:  C Allen; N Hogg
Journal:  Immunology       Date:  1985-06       Impact factor: 7.397

5.  Ber-EP4: new monoclonal antibody which distinguishes epithelia from mesothelial.

Authors:  U Latza; G Niedobitek; R Schwarting; H Nekarda; H Stein
Journal:  J Clin Pathol       Date:  1990-03       Impact factor: 3.411

6.  Combined use of positive and negative immunomagnetic isolation followed by real-time RT-PCR for detection of the circulating tumor cells in patients with colorectal cancers.

Authors:  Junming Guo; Bingxiu Xiao; Xinjun Zhang; Zhijin Jin; Jian Chen; Lijun Qin; Xiongying Mao; Guangyu Shen; Hui Chen; Zhong Liu
Journal:  J Mol Med (Berl)       Date:  2004-10-13       Impact factor: 4.599

7.  Human colorectal carcinogenesis is associated with deregulation of homeobox gene expression.

Authors:  B Z Vider; A Zimber; D Hirsch; D Estlein; E Chastre; S Prevot; C Gespach; A Yaniv; A Gazit
Journal:  Biochem Biophys Res Commun       Date:  1997-03-27       Impact factor: 3.575

8.  Rad and Rad-related GTPases interact with calmodulin and calmodulin-dependent protein kinase II.

Authors:  J S Moyers; P J Bilan; J Zhu; C R Kahn
Journal:  J Biol Chem       Date:  1997-05-02       Impact factor: 5.157

9.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

10.  Quantitative analysis of myocardial inflammation by flow cytometry in murine autoimmune myocarditis: correlation with cardiac function.

Authors:  Marina Afanasyeva; Dimitrios Georgakopoulos; Diego F Belardi; Amrish C Ramsundar; Jobert G Barin; David A Kass; Noel R Rose
Journal:  Am J Pathol       Date:  2004-03       Impact factor: 4.307

View more
  11 in total

1.  How Should Biobanks Prioritize and Diversify Biosample Collections? A 40-Year Scientific Publication Trend Analysis by the Type of Biosample.

Authors:  Jae-Eun Lee; Young-Youl Kim
Journal:  OMICS       Date:  2018-03-27

2.  Adipocyte-derived endotrophin promotes malignant tumor progression.

Authors:  Jiyoung Park; Philipp E Scherer
Journal:  J Clin Invest       Date:  2012-10-08       Impact factor: 14.808

3.  Estimation of the fraction of cancer cells in a tumor DNA sample using DNA methylation.

Authors:  Takamasa Takahashi; Yasunori Matsuda; Satoshi Yamashita; Naoko Hattori; Ryoji Kushima; Yi-Chia Lee; Hiroyasu Igaki; Yuji Tachimori; Masato Nagino; Toshikazu Ushijima
Journal:  PLoS One       Date:  2013-12-02       Impact factor: 3.240

4.  Identification of potential therapeutic targets for colorectal cancer by bioinformatics analysis.

Authors:  Ming Yan; Maomin Song; Rixing Bai; Shi Cheng; Wenmao Yan
Journal:  Oncol Lett       Date:  2016-10-31       Impact factor: 2.967

5.  Identifying miRNA and gene modules of colon cancer associated with pathological stage by weighted gene co-expression network analysis.

Authors:  Xian-Guo Zhou; Xiao-Liang Huang; Si-Yuan Liang; Shao-Mei Tang; Si-Kao Wu; Tong-Tong Huang; Zeng-Nan Mo; Qiu-Yan Wang
Journal:  Onco Targets Ther       Date:  2018-05-15       Impact factor: 4.147

Review 6.  The Biological Role of the Collagen Alpha-3 (VI) Chain and Its Cleaved C5 Domain Fragment Endotrophin in Cancer.

Authors:  Jingya Wang; Wensheng Pan
Journal:  Onco Targets Ther       Date:  2020-06-22       Impact factor: 4.147

7.  Single-cell sequencing analysis characterizes common and cell-lineage-specific mutations in a muscle-invasive bladder cancer.

Authors:  Yingrui Li; Xun Xu; Luting Song; Yong Hou; Zesong Li; Shirley Tsang; Fuqiang Li; Kate McGee Im; Kui Wu; Hanjie Wu; Xiaofei Ye; Guibo Li; Linlin Wang; Bo Zhang; Jie Liang; Wei Xie; Renhua Wu; Hui Jiang; Xiao Liu; Chang Yu; Hancheng Zheng; Min Jian; Liping Nie; Lei Wan; Min Shi; Xiaojuan Sun; Aifa Tang; Guangwu Guo; Yaoting Gui; Zhiming Cai; Jingxiang Li; Wen Wang; Zuhong Lu; Xiuqing Zhang; Lars Bolund; Karsten Kristiansen; Jian Wang; Huanming Yang; Michael Dean; Jun Wang
Journal:  Gigascience       Date:  2012-08-14       Impact factor: 6.524

8.  Characterization of adult α- and β-globin elevated by hydrogen peroxide in cervical cancer cells that play a cytoprotective role against oxidative insults.

Authors:  Xiaolei Li; Zhiqiang Wu; Yao Wang; Qian Mei; Xiaobing Fu; Weidong Han
Journal:  PLoS One       Date:  2013-01-17       Impact factor: 3.240

9.  Stroma derived COL6A3 is a potential prognosis marker of colorectal carcinoma revealed by quantitative proteomics.

Authors:  Jie Qiao; Cai-Yun Fang; Sun-Xia Chen; Xiao-Qing Wang; Shu-Jian Cui; Xiao-Hui Liu; Ying-Hua Jiang; Jie Wang; Yang Zhang; Peng-Yuan Yang; Feng Liu
Journal:  Oncotarget       Date:  2015-10-06

10.  Isolation and gene expression profiling of intestinal epithelial cells: crypt isolation by calcium chelation from in vivo samples.

Authors:  Aine Balfe; Grainne Lennon; Aonghus Lavelle; Neil G Docherty; J Calvin Coffey; Kieran Sheahan; Desmond C Winter; P Ronan O'Connell
Journal:  Clin Exp Gastroenterol       Date:  2018-01-12
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.