Literature DB >> 15318932

Transcriptomic changes in human breast cancer progression as determined by serial analysis of gene expression.

Martin C Abba1, Jeffrey A Drake, Kathleen A Hawkins, Yuhui Hu, Hongxia Sun, Cintia Notcovich, Sally Gaddis, Aysegul Sahin, Keith Baggerly, C Marcelo Aldaz.   

Abstract

INTRODUCTION: Genomic and transcriptomic alterations affecting key cellular processes such us cell proliferation, differentiation and genomic stability are considered crucial for the development and progression of cancer. Most invasive breast carcinomas are known to derive from precursor in situ lesions. It is proposed that major global expression abnormalities occur in the transition from normal to premalignant stages and further progression to invasive stages. Serial analysis of gene expression (SAGE) was employed to generate a comprehensive global gene expression profile of the major changes occurring during breast cancer malignant evolution.
METHODS: In the present study we combined various normal and tumor SAGE libraries available in the public domain with sets of breast cancer SAGE libraries recently generated and sequenced in our laboratory. A recently developed modified t test was used to detect the genes differentially expressed.
RESULTS: We accumulated a total of approximately 1.7 million breast tissue-specific SAGE tags and monitored the behavior of more than 25,157 genes during early breast carcinogenesis. We detected 52 transcripts commonly deregulated across the board when comparing normal tissue with ductal carcinoma in situ, and 149 transcripts when comparing ductal carcinoma in situ with invasive ductal carcinoma (P < 0.01).
CONCLUSION: A major novelty of our study was the use of a statistical method that correctly accounts for the intra-SAGE and inter-SAGE library sources of variation. The most useful result of applying this modified t statistics beta binomial test is the identification of genes and gene families commonly deregulated across samples within each specific stage in the transition from normal to preinvasive and invasive stages of breast cancer development. Most of the gene expression abnormalities detected at the in situ stage were related to specific genes in charge of regulating the proper homeostasis between cell death and cell proliferation. The comparison of in situ lesions with fully invasive lesions, a much more heterogeneous group, clearly identified as the most importantly deregulated group of transcripts those encoding for various families of proteins in charge of extracellular matrix remodeling, invasion and cell motility functions.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15318932      PMCID: PMC549167          DOI: 10.1186/bcr899

Source DB:  PubMed          Journal:  Breast Cancer Res        ISSN: 1465-5411            Impact factor:   6.466


Introduction

Invasive ductal breast carcinoma (IDC) is the most common malignancy of the breast, accounting for ~80% of all invasive breast tumors [1]. Although an issue of much controversy over the years, there is now general agreement and overwhelming histopathological and genetic evidence indicating that most invasive breast carcinomas appear to develop gradually from defined precursor lesions [2]. However, it also became clear that progression toward more aggressive phenotypes is not obligatory [3]. It is further evident that many genetic abnormalities underlying tumor progression are probably phenotypically silent. Numerous molecular genetic changes have been reported as relevant in human breast carcinogenesis, such as anomalies affecting cell proliferation, apoptosis and invasion [4]. Preinvasive breast lesions such as high-grade ductal carcinoma in situ (DCIS) are known to have acquired a myriad of genomic and transcriptomic changes, but as their name implies they are not yet invasive. The development of the ability to invade surrounding tissues is perhaps the most critical event in cancer progression. Among proposed invasion-related genes with reported altered expression in tumor cells are cell adhesion molecules, proteases and cytoskeletal molecules that may influence motility [5]. Identifying the key and most common gene expression abnormalities involved in the transition steps from preinvasion to a fully invasive phenotype is an extremely important topic of research and the main objective of the present report. Studies on this area may provide clues to better diagnose premalignant lesions at high risk of progression and may aid in achieving a better understanding of critical early molecular mechanisms involved in breast cancer evolution. Serial analysis of gene expression (SAGE) is a comprehensive profiling method that allows for global, unbiased and quantitative characterization of transcriptomes [6,7]. SAGE provides a statistical description of the mRNA population present in a cell without prior selection of the genes to be studied, and this constitutes a major advantage. In this sense, only open systems can identify expressed genes that have not yet been cloned or partially sequenced. A second major advantage is that the information generated is digital in format, and can be directly compared with data generated from any other laboratory or with data available in public databases such as the Cancer Genome Anatomy Project . To perform a comparative SAGE analysis of normal, preinvasive and invasive lesions, we used a modified t test that we have recently developed [8]. This method has the advantage of taking into account both the intra-sample and inter-sample variability, identifying 'common patterns' of gene changes systematically occurring across samples. Most of the tests developed for measuring differential expression in SAGE data focus on capturing the first type of variation correctly, but tend to neglect the second type [9,10]. The aim of the present study was to provide a statistically robust global gene expression analysis on the progression of breast cancer using the described statistical approach comparing breast normal and tumor SAGE libraries obtained from public databases combined with additional SAGE libraries recently generated in our laboratory.

Materials and methods

SAGE libraries

To perform the comparative analysis of different stages of breast cancer progression, we combined SAGE libraries available in public databases with breast cancer libraries generated and sequenced at our own laboratory. To this end, 12 SAGE libraries of breast tissues (four normal breast tissues, six DCIS tissues and two IDC tissues) were downloaded from the Cancer Genome Anatomy Project – SAGE Genie database (libraries generated at the Polyak Laboratory, Dana-Farber Cancer Institute, Boston, MA, USA). We used 11 additional breast cancer SAGE libraries generated by ourselves, at an approximate resolution of 100,000 SAGE tags per library. All IDC SAGE libraries used in this study were from lymph node-negative, estrogen receptor-positive and progesterone receptor-positive tumor samples, with a tumor size classification of T1 or T2 (i.e. T1–T2 N0 M0). Table 1 summarizes all the SAGE libraries used in this comparative analysis.
Table 1

Breast-specific serial analysis of gene expression (SAGE) libraries

HistologyLibrary nameTag countUnique tags
Normal breast tissue
 Normal 1SAGE breast normal AP Br Na37,41915,886
 Normal 2SAGE breast normal epithelium AP 1a49,02118,276
 Normal 3SAGE breast normal organoid Ba58,32619,602
 Normal 4SAGE breast normal organoid B2a59,48120,391
Ductal carcinoma in situ
 DCIS 1SAGE breast carcinoma MD DCISa42,17414,237
 DCIS 2SAGE breast carcinoma AP DCIS 3a57,92431,142
 DCIS 3SAGE breast carcinoma B DCIS 4a60,69920,224
 DCIS 4SAGE breast carcinoma B DCIS 5a43,11815,935
 DCIS 5SAGE breast carcinoma epithelium AP DCIS 6a73,40930,256
 DCIS 6SAGE breast carcinoma B BWHT18a50,87919,182
 DCIS 7MDACC 22Tb102,53333,305
Invasive ductal carcinoma
 IDC 1MDACC 09Tb91,64737,863
 IDC 2MDACC 14Tb100,25526,422
 IDC 3MDACC 15Tb90,19827,653
 IDC 4MDACC 17Tb100,38629,300
 IDC 5MDACC 18Tb101,54329,936
 IDC 6MDACC 19Tb100,33428,498
 IDC 7MDACC 20Tb100,04728,903
 IDC 8MDACC 21Tb103,82531,412
 IDC 9MDACC 24Tb99,54630,363
 IDC 10MDACC 25Tb100,50130,778
 IDC 11SAGE breast carcinoma B IDC 3a68,93722,732
 IDC 12SAGE breast carcinoma B IDC 5a60,47620,457
Total23 breast libraries1,752,678

a Libraries available in public databases. b Libraries generated in our laboratory.

SAGE methodology

For the SAGE libraries generated in our laboratory we followed standard methods. Briefly, total RNA was extracted from snap-frozen tissues using TRIzol (Invitrogen, San Francisco, CA, USA). SAGE library construction was performed with the I-SAGE kit (Invitrogen) according to the manufacturer's protocol and introducing only minor modifications. The anchoring enzyme was NlaIII and the tagging enzyme used was BsmFI. Concatemerized ditags were cloned into pZERO-1 and sequenced with an ABI 3700 DNA Analyzer (Applied Biosystems, Foster City, CA, USA).

SAGE data processing

SAGE tags were extracted from sequencing files using the SAGE2000 software version 4.0 (a kind gift from Dr K. Kinzler, John Hopkins School of Medicine, Baltimore, MD, USA). Tag abundances for all libraries were normalized to a total of 100,000 tags (at which level a tag present 10 times has an abundance of 0.01%). Tag to gene assignments as well as additional annotations using public databases (e.g. Gene Ontology, Locus Link, Unigene cluster) were performed, using web-based SAGE library tools developed by ourselves . In our comparison we used only tags with only one reliable assigned gene.

Statistical analysis of SAGE libraries

To compare the 23 SAGE libraries, we utilized a modified t test recently developed by us [8]. This analysis allowed us to identify SAGE tags with significantly different expression levels (P < 0.01) between normal tissue and DCIS and between DCIS and IDC. Tags with total counts of less than three in all libraries were filtered out before the analysis. In order to enable visualization and illustration of our analyses, we utilized the TIGR MultiExperiment Viewer (MeV 2.2) software (The Institute for Genomic Research, Rockville, MD, USA). This tool was employed for normalization and average clustering of the SAGE data. The aim of the heat maps presented is simply to organize and illustrate the data by graphical means. Briefly, the normalization included logarithmic transformation followed by median centering by samples and genes. We used standard average hierarchical clustering techniques to classify and illustrate further the differences found by the modified t test, showing the clusters of differentially coexpressed genes between the normal tissue, DCIS and IDC groups.

Results and discussion

Generation and analysis of SAGE libraries

The primary goal of our study was to identify the most commonly occurring transcriptome changes in the transition from normal breast epithelium to DCIS and invasive carcinoma. To this end, SAGE data obtained from 11 breast cancer libraries generated in our laboratory (1,090,815 tags) were combined and compared with data available in the public domain (661,863 tags), thus generating a dataset of almost 1.7 million breast cancer and normal specific tags, representing approximately 25,157 transcripts from a total of 23 libraries (Table 1). Our statistically stringent analysis revealed 52 transcripts commonly deregulated across the board when comparing normal tissue with DCIS (Fig. 1), and 149 transcripts when comparing DCIS with IDC (P < 0.01) (Fig. 2) (see additional data files 1 and 2 for additional information with statistical cutoff at P < 0.05). Selected genes based on relative abundance, highly statistical differences and high fold changes between compared groups are sorted and represented in Tables 2 and 3.
Figure 1

Hierarchical clustering of the most commonly different expressed genes between normal breast tissue and ductal carcinoma in situ (DCIS) groups (P < 0.01). Color scale at bottom of picture is used to represent expression level: low expression is represented by green, and high expression is represented by red.

Figure 2

Hierarchical clustering of the most commonly differentially expressed genes between ductal carcinoma in situ (DCIS) and invasive ductal carcinoma (IDC) groups (P < 0.01). Color scale at bottom of picture is used to represent expression level: low expression is represented by green, and high expression is represented by red.

Table 2

Most frequent differentially expressed genes between normal breast epithelium and ductal carcinoma in situ (DCIS)

 TagGeneDescriptionLocus linkFold changeP value
DCIS overexpressed genes
 GTATTTAACTPKD1-likePolycystic kidney disease 1-like7993213.70.0100
 CGGACTCACTSTARD10START domain containing 101080911.20.0086
 GTGTTGGGGGEPS8L2EPS8-like 2647879.60.0099
 TTTCTGGAGGKIAA0545KIAA0545 protein230948.60.0100
 GATAAATTAAFLJ14153Hypothetical protein647478.50.0055
 GAGAAATATCNP220Nuclear protein273328.00.0088
 CCCTCTTTGGLOC118487mRNA similar to RIKEN cDNA 11100010191184877.40.0037
 CTGGGACTGALSM4U6 small nuclear RNA associated (S. cerevisiae)258046.40.0055
 CTGGGCCAGCVAMP5Vesicle-associated membrane protein 5107916.40.0068
 GCCCTTTCTCMRC2Mannose receptor, C type 299025.80.0015
 TCTTGATTTAA2MAlpha-2-macroglobulin25.80.0083
 TAGTTTGTGGMSH2MutS homolog 2, colon cancer44365.60.0091
 TCAGTGAACTHPS4Hermansky–Pudlak syndrome 4897815.60.0100
 GTTTATTCTTFOXA1Forkhead box A131695.30.0044
 GCCGCTGCCAPPP1R13BProtein phosphatase 1233684.30.0054
 TAAAGTGTCTPIGSPhosphatidylinositol glycan, class S940053.90.0100
DCIS underexpressed genes
 GGGACGAGTGTM4SF1Transmembrane 4 superfamily member 14071-442.60.0083
 TAACAGCCAGNFKBIANuclear factor kappa light polypeptide gene4792-158.62.4 × 10-6
 CAACTAATTCCLUClusterin1191-63.90.0036
 GCCTTAACAAPBEFPre-B-cell colony-enhancing factor10135-44.30.0020
 GGGTTTTTATNSEP1Nuclease sensitive element binding protein 14904-36.20.0001
 GACACGAACARASD1RAS, dexamethasone-induce 151655-31.40.0095
 AAGATTGGTGCD9CD9 antigen (p24)928-29.60.0003
 ACCAAATTAATNFRSF10BTumor necrosis factor receptor superfamily8795-29.40.0003
 CTGGGCCTGALITAFLipopolysaccharide-induced tumor necrosis factor9516-28.90.0076
 CTGCCATAACSBDSShwachman–Bodian–Diamond syndrome51119-24.20.0005
 CACAGGCAAABZW1Basic leucine zipper and W2 domains 19689-22.10.0056
 GTTCCCTGGCFAUFinkel–Biskis–Reilly murine sarcoma virus2197-22.10.0028
 GTCTGCACCTDKFZp547C1Hypothetical protein254851-21.90.0087
 TACGTTGCAGGC20Translocation factor sui1 homolog10289-21.80.0079
 TGTAAAGATTCCNL1Cyclin L157018-21.20.0008
 TGTTAAGTTCCRY1Cryptochrome 1 (photolyase-like)1407-18.70.0091
 GAAATAAAGTFLJ21657Hypothetical protein64417-18.50.0032
 ATGGGCTTGASQRDLSulfide quinone reductase-like (yeast)58472-17.30.0061
 TCAAGAAATTPSME3Proteasome activator subunit 310197-15.70.0028
 CCGTGGTCGTFBLFibrillarin2091-15.30.0100
 TGGAACAGGATGIFTransforming growth factor beta-induced factor (TALE family homeobox)7050-12.20.0030
 AATGCTGGCADNAJB6DnaJ homolog, subfamily B, member 610049-11.70.0065
 AATGAGCAACGBP2Guanylate binding protein 2, interferon-inducible2634-11.10.0082
 GACCTATCTCKIAA0992Paladin23022-10.90.0092
 AACTCTTGAAEIF3S3Eukaryotic translation initiation factor 38667-10.60.0082
 GGGATTTTGTPMAIP1Phorbol-12-myristate-13-acetate-induced protein 15366-10.60.0091
 AAAGCAAAAAPTPN4Protein tyrosine phosphatase, non-receptor type 45775-10.40.0040
 ACTGACTATCNEU1Sialidase 1 (lysosomal sialidase)4758-10.30.0095
 TTCCAGTTCAPDE4BPhosphodiesterase 4B, camp-specific5142-9.90.0087
 GAATGATTTCORF1-FL49Putative nuclear protein84418-9.50.0070
 GACTCGCTCCHSJ001348cDNA for differentially expressed CO16 gene54742-9.50.0072
 TGGTTACAAANDEL1Nude nuclear distribution gene E homolog like 181565-8.90.0100
 AGTATGAGGATNFAIP3Tumor necrosis factor, alpha-induced protein 37128-8.10.0083
 CAGTTTAAAACRSP6Cofactor required for Sp1 transcriptional activation9440-7.70.0100
Table 3

Most frequent differentially expressed genes between ductal carcinoma in situ and invasive ductal carcinoma (IDC)

 TagGeneDescriptionLocus linkFold changeP value
IDC overexpressed genes
 TGGAAATGACCOL1A1Collagen type I, alpha 11277315.40.0054
 ATGTGAAGAGSPARCSecreted protein, cysteine-rich (osteonectin)6678286.80.0003
 TTTGGTTTTCCOL1A2Collagen type I, alpha 21278210.90.0084
 TTGCTGACTTCOL6A1Collagen type VI, alpha 1129173.90.0023
 TTATGTTTAALUMLumican406056.70.0011
 TTGGAGATCTNDUFA4NADH dehydrogenase (ubiquinone)469756.40.0065
 CCACAGGGGACOL3A1Collagen type III, alpha 1128149.40.0056
 ATCTTGTTACFN1Fibronectin 1233544.30.0031
 TTGTAATCGTOAZ1Ornithine decarboxylase antizyme 1494638.60.0038
 TGTAATCAATHNRPA1Heterogeneous nuclear ribonucleoprotein A1317838.20.0039
 GGAAGCTAAGOSF-2Osteoblast specific factor 2 (fasciclin I-like)1063136.30.0005
 ACCTGTATCCIFITM3Interferon induced transmembrane protein 31041034.30.0021
 GGAAATGTCAMMP2Matrix metalloproteinase 2431329.10.0008
 TGCACTTCAASPARCL1SPARC-like 1 (mast9, hevin)840421.70.0050
 GGAACTTTTASULF2Sulfatase 25595919.80.0017
 CTGTTAGTGTMDH1Malate dehydrogenase 1419018.50.0026
 TATGAATGCTCSPG2Chondroitin sulfate proteoglycan 2 (versican)146218.20.0017
 TCCAAATCGAVIMVimentin743117.70.0014
 TGTAGTTTGASKP1AS-phase kinase-associated protein 1A650016.90.0013
 TAATAAACAGASAH1n-Acylsphimgosine amidohydrolase42716.80.0085
 GCCTCCTCCCEIF3kEukaryotic translation initiation factor 32733516.80.0098
 GAAACAAGATPGK1Phosphoglycerate kinase 1523015.80.0006
 TGCTTTGGGATTC11tetratricopeptide repeat domain 115102415.80.0018
 GAAATCAAAASIGLEC5Sialic acid binding Ig-like lectin 5877814.70.0098
 ATGTAGTAGTHNRPDHeterogeneous nuclear ribonucleoprotein D318414.70.0025
 GACCACCTTTMFAP2Microfibrillar-associated protein 2423714.40.0000
 ACTTATTATGDCNDecorin163413.90.0007
 TCTCTACCCAAPLP2Amyloid beta (A4) precursor-like protein 233413.70.0076
 TGCAATATGCFBN1Fibrillin 1220013.50.0037
 ATTTCTTCAASFRP2Secreted frizzled-related protein 2642313.40.0030
 ATAAAAAGAACTSKCathepsin K (pycnodysostosis)151313.00.0003
 GTACATTGTAMGC15737Hypothetical protein8501212.60.0011
 TGATGTTTGADAZAP2DAZ associated protein 2980212.50.0013
 TCCGTGGTTGBASP1Membrane attached signal protein 11040912.10.0055
 ACTGCTTTACDKFZp564I1922Adlican2587812.00.0086
 TCTGCAATGATINP1Trasnforming growth factor beta-inducible nuclear protein 11041212.00.0033
 GTTTCTTCCCSELHSelenoprotein H288063611.80.0037
 AATATGCTTTATP6V1E1ATPase52911.60.0009
 TTATGGATCTSPON2Spondin 2, extracellular matrix protein1041711.30.0003
 AAAATAAAGAAPEX1Nuclease, multifunctional DNA repair enzyme32811.30.0099
 TGTGTGTTTGHIF0H1 histone family300511.20.0029
 TATGTTTCAGPTPN12Protein tyrosine phosphatase578211.10.0003
 ACCAAAGCCCMGC9651Hypothetical protein11493210.60.0075
 CAAGGATCTANICE-3NICE-3 protein2591210.50.0057
 GACGTCTTAAPSMA4Proteasome subunit, alpha type568510.30.0038
 CAGATAACATTOMM20Translocase980410.10.0064
 AACTCTTGAAEIF3S3Eukaryotic translation initiation factor 3866710.00.0047
 TTCTTGGTGTTRPS1Trichorhinophalangeal syndrome I72279.90.0042
 TGCCTTAGTADNAJC1DNAJ homolog642159.80.0040
 AGACAAGCTGSFRS5Splicing factor64309.40.0015
 ACAAGAATTGSYPLSynaptophysin-like protein68569.30.0027
 TACATCCGAAMTPNMyotrophin1363199.30.0013
As expected, we detected various ribosomal genes among the most abundant transcripts in all the breast SAGE libraries, and these genes were highly upregulated in the invasive carcinomas. This agrees with the previous global expression profiles and with the comparisons of cancers and the corresponding normal tissues in general [7,11,12]. To simplify illustration of the data, ribosomal genes are not included in the figures and tables.

Global comparison of normal tissues and DCIS

Among the 52 transcripts detected as differentially expressed in DCIS (P < 0.01), 36 were downregulated transcripts and 16 were upregulated transcripts in these lesions when compared with normal breast epithelial cells and mammary epithelial organoids (Fig. 1 and Table 2). We defined and classified the 52 genes differentially expressed into the nine functional categories [13] shown in Fig. 3a. Interestingly, we found that 38% of these transcripts are related to the cell cycle (15%), signal transduction (8%) and apoptosis (15%).
Figure 3

Classification in functional categories of affected transcripts. (a) Differentially expressed between normal breast tissue and ductal carcinoma in situ (DCIS) (P < 0.01). (b) Transcripts differentially expressed between DCIS and invasive ductal carcinoma (IDC) (P < 0.01).

As expected, our analysis of DCIS versus normal breast epithelium revealed numerous similarities with SAGE data reported previously [12,14], but more importantly it also provided novel information. The expression of numerous genes was significantly downregulated in DCIS, including: transmembrane 4 super family member 1 (TM4SF1), nuclear factor kappa light polypeptide (NFKB1A), pre-B-cell (PBEF), RAS dexamethasone-induced (RASD1), tumor necrosis factor receptor superfamily member 10b (TNFRSF10B), and tumor necrosis factor α-induced protein (TNFAIP). All these transcripts were also observed downregulated in previous reports [12,14] (Table 2). On the contrary, our analysis revealed additional clusters of genes significantly downregulated in the DCIS group that were not previously reported by others: clusterin/apolipoprotein J (CLU), nuclease sensitive element binding protein 1 (NSEP1), lipopolysaccharide-induced TNF factor (LITAF/PIG7), basic leucine zipper/W2 domains 1 (BZW1), and cyclin L1 (CCNL1) (Table 2). Clusterin was one of the most dramatically downregulated genes (-63.9-fold; P = 0.0036) in DCIS libraries. This gene encodes a heterodimeric, highly conserved, secreted glycoprotein. Alterations in Clusterin expression and/or protein maturation are linked to changes in tissue growth or regression, which may be related to specific proapoptotic or antiapoptotic protein isoforms [15]. Clusterin was reported as overexpressed during tissue and cell involution, and was downregulated in esophageal squamous cell carcinoma and prostate carcinoma, suggesting that this expression alteration could be a general phenomenon during tumor progression [16,17]. On the contrary, and in contrast to these and our observations, Redondo and colleagues reported increased Clusterin expression in breast carcinoma samples [18]. The reason for this discrepancy is unclear at this point. The role of Clusterin in cell survival, cell death and neoplastic transformation remains controversial [15]. Another commonly observed downregulated gene in DCIS libraries was NSEP-1 (-36.2-fold; P = 0.0001). Also known as YB1, NSEP-1 is a member of the highly conserved Y-box family of proteins, which regulate the transcription of several genes associated with cell death including both fas, a cell death-associated receptor, and the tumor suppressor gene p53 [19]. The decrease in expression of NSEP-1 transcripts could play an important role in the early stages of breast carcinogenesis in order to overcome cell proliferation controls. Interestingly, and as previously observed, we also detected significant downregulation of various cytokines and chemokines: interleukin enhancer binding factor 2 (ILF2), interleukin 13 receptor alpha 1 (IL13RA1), leukemia inhibitory factor (LIF), cardiotrophin-like cytokine (CLC), chemokine C–C ligand 2 (CCL2), and chemokine C–X–C ligand 1 (CXCL1). All these cytokines and chemokines are highly expressed in normal mammary epithelium and are dramatically downregulated in the DCIS samples. These differentially expressed genes were detected within a range of 0.02

20]. Some of these molecules (e.g. CXCL1, LIF) appear to play important roles in the normal periodic cycles of growth and involution of the mammary gland following pregnancy and lactation. They may thus be part of the physiologic mechanisms associated with the massive apoptosis observed during involution [21,22]. Unfortunately we understand very little of the relevance of their intriguing de facto silencing in expression, both in in situ as well as in invasive breast cancer lesions. Interestingly, we also detected a series of transcripts commonly overexpressed in the DCIS samples: polycyctic kidney disease 1-like (PKD1-like), START domain containing 10 (STARD10), EPS8-like2 (EPS8L2), and KIAA0545 protein (Fig. 1d). One of these genes, EPS8-like2, encodes a protein that is related to epidermal growth factor receptor pathway substrate 8 (EPS8), and was shown to be essential in Ras/PI3K to Rac signaling [23]. PKD1-like encodes a member of the polycystin protein family. Members of this protein family may function in cell development and morphology, and may modulate intracellular calcium homoeostasis and other signal transduction pathways [24,25]. Although the PKD1 gene has been associated with cancer mechanisms, this homologous family member has not been implicated in carcinogenesis processes to the best of our knowledge. KIAA0545, also known as signal-induced proliferation-associated 1 like 3 (SIPA1L3), is a member of the Sipa1 family and encodes a protein bearing a domain highly homologous to the catalytic region of human Rap1 GTPase-activating protein (Rap1GAP). Sipal1 is involved in the regulation of the Ras-mediated signal transduction pathway for cell proliferation and cell cycle progression [26]. These genes could be involved in signaling pathways that lead to cell proliferation, but their potential role in malignant transformation remains unknown.

Differentially expressed genes associated with NF-κB and tumor necrosis factor pathways

One of the transcripts observed to be most differentially expressed when comparing normal tissue with DCIS was NFKBIA (better known as IκBα), demonstrating a 150-fold higher expression (P < 0.0001) in normal mammary epithelial cells (Table 2 and Fig. 1b). NFKBIA is a member of IκB family genes that play a critical role in regulating the activity of the NF-κB transcription factor [27,28]. NF-κB plays a major role in diverse biological processes such as cell proliferation, differentiation, apoptosis and metastasis [29,30]. NF-κB is also required to prevent cell death induced by tumor necrosis factor (TNF) [31]. Interestingly, and perhaps pointing to connected pathways and related outcomes, we also detected a strong decrease in the expression levels of TNFRSF10 (29-fold; P < 0.0003), LITAF/PIG7 (29-fold; P < 0.0003) and TNFAIP3 (eightfold; P < 0.0083) transcripts in the DCIS group. The protein encoded by TNFRSF10, also known as TRAIL/APO2, is a member of the TNF-receptor superfamily and contains an intracellular death domain. This receptor can be activated by TNF-related apoptosis inducing ligand and its role is to transduce apoptosis signals [32,33]. LITAF/PIG7, a transcription factor, termed lipopolysaccharide-induced TNF-alpha factor, also found downregulated, was reported to regulate TNF-alpha gene expression playing a major role in TNF-alpha activation [34]. This gene, also known as P53-induced gene 7 (PIG7), has been shown to be induced by p53 when apoptosis is triggered, and therefore could also play a role in programmed cell death [35]. The concerted decline of these transcripts early in breast tumor progression appears conducive to a virtual silencing of apoptosis induction pathways and a consequential net increase in cell proliferation. In other words, the homeostasis of proliferation cell death normally operating in the breast epithelium is altered and inclined towards a net gain in cell numbers via multiple signaling pathways.

Global comparison of in situ and invasive carcinomas

We found 149 transcripts differentially expressed between DCIS and IDC at P < 0.01. All of these genes were found overexpressed commonly at the invasive stage (Fig. 2). Table 2 summarizes the 52 most commonly overexpressed genes in invasive carcinoma lesions. We defined and classified the 149 genes differentially expressed in 10 functional categories [13] as shown in Fig. 3b. Interestingly, we found that 37% of these upregulated transcripts are related to the cell cycle (12%), extracellular matrix or secreted proteins (13%), cell adhesion and motility (6%), and signal transduction (6%). We were also able to detect 31 underexpressed genes in invasive carcinomas when compared with DCIS, but only when the stringency of the statistical comparison was dropped to within the 95% confidence interval (i.e. P < 0.05), reflecting a lower level of consistency in these gene expression changes when comparing DCIS with IDC (Fig. 4). Examples of these genes include: transmembrane 4 superfamily member 1 (TM4SF1) (-26.7-fold; P = 0.04), tumor necrosis factor receptor-associated factor 4 (TRAF4) (-10.7-fold; P = 0.04), PPAR binding protein (PPARBP) (-8.2-fold; P = 0.04), aldo-keto reductase family 1 (AKR1A1) (-6.7-fold; P = 0.03), hypothetical protein dJ465N24.2.1 (-6.4-fold; P = 0.028), microtubule-associated protein 1 (MAP1LC3A) (-3.7-fold; P = 0.02) and retinoblastoma binding protein 6 (RBBP6) (-2.6-fold; P = 0.04).
Figure 4

Hierarchical clustering of downregulated genes in invasive ductal carcinoma (IDC) (P < 0.05). Color scale at bottom of picture is used to represent expression level: low expression is represented by green, and high expression is represented by red. DCIS, ductal carcinoma in situ.

The first of these transcripts, TM4SF1, was also the most dramatically downregulated gene in DCIS when compared with normal breast libraries (-442.6-fold; P = 0.0083). The transmembrane proteins TM4SF1, also known as the tetraspanin superfamily, are implicated in diverse signal transduction events that play a role in the regulation of cell development, cell proliferation, differentiation and motility [36]. The tetraspanins are associated with adhesion receptors of the integrin family and regulate integrin-dependent cell migration [36]. In the present study, the loss in gene expression of TM4SF1, from normal breast tissue to invasive carcinomas, appears to be a common event in the progression of breast carcinomas. In addition, downregulated levels of the TRAF4 transcript could cooperate in the evolution from DCIS to invasive carcinomas. TRAF4 is a proapoptotic gene member of the TRAF family of adaptor proteins that mediate cellular signaling by binding to various members of the tumor necrosis family receptor superfamily and interleukin-1/Toll-like receptor superfamily [37]. Interestingly, a recent study showed that overexpression of TRAF4 can induce apoptosis, playing a role in p53-mediated proapoptotic signaling in response to cellular stress [38].

Differentially expressed genes related with extracellular matrix remodeling and invasion processes

During their metastatic conversion, epithelial carcinoma cells acquire the ability to invade the surrounding tissues and later disseminate to secondary organs mostly via lymphatic vessels. The metastatic process is not just a function of acquisition of novel migratory and invasive properties by the epithelial tumor cells; the surrounding stroma also plays a critical role in this process [2]. Dramatic changes take place in order to remodel the extracellular matrix environment in response to the infiltrating cancer cells (desmoplastic reaction) [39-41]. In this sense, we identified high expression levels of several transcripts that could be a reflection of the host stromal response, such as collagen 1α1, collagen 1α2, collagen 3α1, collagen 6α1, fibronectin I, fibrilli, microfibrillar-associated protein 2, and Spondin 2. It is known that the proteolytic degradation of the extracellular matrix is more than the simple removal of a physical barrier to invasion; such processes and the increased expression of the involved genes are known to also significantly influence mechanisms controlling cell proliferation [42]. Matrix metalloproteinases are zinc-dependent endopeptidases involved in matrix degradation and tissue remodeling [43]. These endopeptidases are capable of degrading both the extracellular matrix and basement membrane, physical barriers that play important roles in preventing against expanding growth and migration of cancer cells [44]. It is therefore widely accepted that overexpression of matrix metalloproteinases is associated with cancer-cell invasion and metastasis. A member of the matrix metalloproteinase family (MMP-2) was highly expressed (29.1-fold; P = 0.0008) in IDC libraries in comparison with in situ carcinomas. MMP-2 has been shown overexpressed in various human tumors, including breast cancer [45,46]. To no surprise and as observed in other studies, we also detected significant increases in SPARC (286-fold; P = 0.0003) and a new related gene SPARC-like1 (21.7-fold; P = 0.005) among the groups of genes upregulated in invasive lesions. The SPARC gene encodes for a secreted protein acid rich in cysteines also known as osteonectin [47]. This protein is involved in a variety of diverse biological processes including tissue remodeling, cell adhesion, proliferation, differentiation, matrix synthesis/turnover, angiogenesis and tumor cell migration and invasion [47]. Overexpression of the SPARC gene has been reported associated with melanoma and metastatic carcinomas of the breast, and increased SPARC expression has been observed in conjunction with increased c-Jun and Fra-1 expression in a panel of invasive breast cancer cell lines [48]. Human SPARC-like1, also known as mast9 or hevin, is a member of the SPARC protein family. Interestingly, previous reports indicated downregulation of SPARC-like1 in prostate and colon carcinomas [49,50]. Contrary to these observations, we observed consistent high expression of this transcript across all IDC libraries. Functional assays suggest that SPARC-like1 may serve as an antagonist to cell adhesion, playing a key role in the inhibition of attachment, and may facilitate spreading of endothelial cells on fibronectin substrates [51]. Taken together, these expression profiles suggest that MMP-2, SPARC and SPARC-like1 are probably critical mediators of extracellular matrix remodeling and are all important in facilitating breast cancer invasion and progression. Other genes commonly expressed at high levels in invasive carcinomas and of much lower expression in DCIS and normal breast tissues include lumican (LUM/LDC) (56.7-fold; P = 0.0011), versican (CSPG2) (18.2-fold; P = 0.0017), vimentin (VIM) (17.7-fold; P = 0.0014), decorin (DCN/PG2) (13.9-fold; P = 0.0007) and adlican (DKFZp564I1922) (12-fold; P = 0.0086). Lumican and decorin are members of the small leucine-rich proteoglycan family of proteins [40]. Several studies have demonstrated that small leucine-rich proteoglycan proteins can modulate cellular behavior, including cell migration and proliferation during tumor growth. Furthermore, the high expression level of lumican was associated with high tumor grade and was expressed specifically in breast cancer tissues, but not in normal breast tissues, suggesting that lumican is differentially expressed during breast tumor progression [40,52]. These findings suggest that lumican may play an important role in breast cancer growth. Recent studies have suggested that expression of increased amounts of versican, a chondroitin sulphate proteoglycan, in neoplastic tissues may play a role in promoting tumor cell proliferation and migration [53]. Abnormal versican deposition has been observed in a number of tumor types, including breast cancer [54]. Furthermore, it has been suggested that the versican-rich extracellular matrices exert an anti-adhesive effect on cells, thus facilitating tumor-cell migration and invasion [55]. Vimentin is a type III intermediate filament normally expressed in cells of mesenchymal origin [56]. However, numerous studies have now demonstrated that vimentin can also be expressed in epithelial cells involved in physiological or pathological processes requiring epithelial cell migration [57]. Vimentin has indeed been described in migratory epithelial cells involved in embryological and organogenesis processes and tumor invasion [58]. Also, vimentin antisense transfection in vimentin-expressing breast cell lines was shown to reduce their in vitro invasiveness or migration, strongly emphasizing a functional contribution of vimentin to epithelial cell invasion/migration [59].

Conclusions

Using comprehensive gene expression profiling by means of SAGE combined with a recently developed statistical approach, we identified the most consistent and statistically significant changes occurring in breast cancer progression detected by this methodology. A comparison of the genes identified in our DCIS and IDC analysis with previous observations [11,12,14,41] revealed expected similarities. More importantly, several genes were identified in our analysis that were not previously reported or detected in other SAGE studies. This suggests that the comparative analysis we performed of normal breast tissue, DCIS and invasive carcinomas by means of the modified t test appears statistically rigorous and applicable to SAGE studies in which multiple libraries are compared. In the present study we observed that deregulation of genes involved in the control of cell proliferation, apoptosis and mammary gland development are frequently altered at the in situ stage (Fig. 5). Meanwhile, alterations in the expression of genes related to the cell cycle and extracellular matrix remodeling (proteinases, collagenases, cysteine proteinases), and several transcripts related to cell adhesion and motility, were abundantly deregulated at the invasive carcinoma stage (Fig. 5). Additional analysis and validation of the identified genes will be required to determine the clinical value, and to determine whether they may constitute novel targets for translational research.
Figure 5

Schematic model portraying some of the most significant transcriptomic changes observed in breast cancer progression. DCIS, ductal carcinoma in situ; IDC, invasive ductal carcinoma.

Competing interests

None declared.

Abbreviations

DCIS = ductal carcinoma in situ; IDC = invasive ductal carcinoma; NF = nuclear factor; SAGE = serial analysis of gene expression; TNF = tumor necrosis factor.

Additional File 1

Complete list of differentially expressed genes between normal breast epithelium and ductal carcinoma in situ (p < 0.05). See Click here for file

Additional File 2

Complete list of differentially expressed genes between ductal carcinoma in situ and invasive ductal carcinoma (p < 0.05). See Click here for file
  57 in total

Review 1.  Control of apoptosis by Rel/NF-kappaB transcription factors.

Authors:  M Barkett; T D Gilmore
Journal:  Oncogene       Date:  1999-11-22       Impact factor: 9.867

2.  Overexpression of clusterin in human breast carcinoma.

Authors:  M Redondo; E Villar; J Torres-Muñoz; T Tellez; M Morell; C K Petito
Journal:  Am J Pathol       Date:  2000-08       Impact factor: 4.307

3.  Oncostatin M and leukemia inhibitory factor regulate the growth of normal human breast epithelial cells.

Authors:  S L Grant; A M Douglas; G A Goss; C G Begley
Journal:  Growth Factors       Date:  2001       Impact factor: 2.511

4.  In silico comparison of the transcriptome derived from purified normal breast cells and breast tumor cell lines reveals candidate upregulated genes in breast tumor cells.

Authors:  Maarten R Leerkes; Otavia L Caballero; Alan Mackay; Humberto Torloni; Michael J O'Hare; Andrew J G Simpson; Sandro J de Souza
Journal:  Genomics       Date:  2002-02       Impact factor: 5.736

5.  SPARC regulates the expression of collagen type I and transforming growth factor-beta1 in mesangial cells.

Authors:  A Francki; A D Bradshaw; J A Bassuk; C C Howe; W G Couser; E H Sage
Journal:  J Biol Chem       Date:  1999-11-05       Impact factor: 5.157

6.  Expression of MMP-2 and MMP-9, their inhibitors, and the activator MT1-MMP in primary breast carcinomas.

Authors:  J L Jones; P Glynn; R A Walker
Journal:  J Pathol       Date:  1999-10       Impact factor: 7.996

7.  Cancer statistics, 2000.

Authors:  R T Greenlee; T Murray; S Bolden; P A Wingo
Journal:  CA Cancer J Clin       Date:  2000 Jan-Feb       Impact factor: 508.702

8.  The PKD1 gene product, "polycystin-1," is a tyrosine-phosphorylated protein that colocalizes with alpha2beta1-integrin in focal clusters in adherent renal epithelia.

Authors:  P D Wilson; L Geng; X Li; C R Burrow
Journal:  Lab Invest       Date:  1999-10       Impact factor: 5.662

9.  Combining serial analysis of gene expression and array technologies to identify genes differentially expressed in breast cancer.

Authors:  M Nacht; A T Ferguson; W Zhang; J M Petroziello; B P Cook; Y H Gao; S Maguire; D Riley; G Coppola; G M Landes; S L Madden; S Sukumar
Journal:  Cancer Res       Date:  1999-11-01       Impact factor: 12.701

10.  Vimentin contributes to human mammary epithelial cell migration.

Authors:  C Gilles; M Polette; J M Zahm; J M Tournier; L Volders; J M Foidart; P Birembaut
Journal:  J Cell Sci       Date:  1999-12       Impact factor: 5.285

View more
  54 in total

Review 1.  Regulation of major histocompatibility complex class I molecule expression on cancer cells by amyloid precursor-like protein 2.

Authors:  Haley L Peters; Amit Tuli; Mahak Sharma; Naava Naslavsky; Steve Caplan; Richard G MacDonald; Joyce C Solheim
Journal:  Immunol Res       Date:  2011-10       Impact factor: 2.829

2.  DNAJB6 induces degradation of beta-catenin and causes partial reversal of mesenchymal phenotype.

Authors:  Aparna Mitra; Mitchell E Menezes; Lalita A Shevde; Rajeev S Samant
Journal:  J Biol Chem       Date:  2010-06-03       Impact factor: 5.157

3.  Simultaneous Proteomic Discovery and Targeted Monitoring using Liquid Chromatography, Ion Mobility Spectrometry, and Mass Spectrometry.

Authors:  Kristin E Burnum-Johnson; Song Nie; Cameron P Casey; Matthew E Monroe; Daniel J Orton; Yehia M Ibrahim; Marina A Gritsenko; Therese R W Clauss; Anil K Shukla; Ronald J Moore; Samuel O Purvine; Tujin Shi; Weijun Qian; Tao Liu; Erin S Baker; Richard D Smith
Journal:  Mol Cell Proteomics       Date:  2016-09-25       Impact factor: 5.911

Review 4.  Mechanistic pathways and biological roles for receptor-independent activators of G-protein signaling.

Authors:  Joe B Blumer; Alan V Smrcka; Stephen M Lanier
Journal:  Pharmacol Ther       Date:  2006-11-28       Impact factor: 12.310

5.  Amyloid precursor-like protein 2 association with HLA class I molecules.

Authors:  Amit Tuli; Mahak Sharma; Xiaojian Wang; Laura C Simone; Haley L Capek; Steven Cate; William H Hildebrand; Naava Naslavsky; Steve Caplan; Joyce C Solheim
Journal:  Cancer Immunol Immunother       Date:  2009-01-31       Impact factor: 6.968

Review 6.  Next-generation sequencing: a powerful tool for the discovery of molecular markers in breast ductal carcinoma in situ.

Authors:  Hitchintan Kaur; Shihong Mao; Seema Shah; David H Gorski; Stephen A Krawetz; Bonnie F Sloane; Raymond R Mattingly
Journal:  Expert Rev Mol Diagn       Date:  2013-03       Impact factor: 5.225

7.  Role of heparan sulfatases in ovarian and breast cancer.

Authors:  Ashwani Khurana; Daniah Beleford; Xiaoping He; Jeremy Chien; Viji Shridhar
Journal:  Am J Cancer Res       Date:  2013-01-18       Impact factor: 6.166

8.  Dysregulated expression of Fau and MELK is associated with poor prognosis in breast cancer.

Authors:  Mark R Pickard; Andrew R Green; Ian O Ellis; Carlos Caldas; Vanessa L Hedge; Mirna Mourtada-Maarabouni; Gwyn T Williams
Journal:  Breast Cancer Res       Date:  2009-08-11       Impact factor: 6.466

9.  Nuclear detection of Y-box protein-1 (YB-1) closely associates with progesterone receptor negativity and is a strong adverse survival factor in human breast cancer.

Authors:  Edgar Dahl; Abdelaziz En-Nia; Frank Wiesmann; Renate Krings; Sonja Djudjaj; Elisabeth Breuer; Thomas Fuchs; Peter J Wild; Arndt Hartmann; Sandra E Dunn; Peter R Mertens
Journal:  BMC Cancer       Date:  2009-11-24       Impact factor: 4.430

10.  Comparison of breast cancer to healthy control tissue discovers novel markers with potential for prognosis and early detection.

Authors:  Michèl Schummer; Ann Green; J David Beatty; Beth Y Karlan; Scott Karlan; Jenny Gross; Sean Thornton; Martin McIntosh; Nicole Urban
Journal:  PLoS One       Date:  2010-02-09       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.