Literature DB >> 29333087

Antioxydation And Cell Migration Genes Are Identified as Potential Therapeutic Targets in Basal-Like and BRCA1 Mutated Breast Cancer Cell Lines.

Maud Privat1,2, Justine Rudewicz2, Nicolas Sonnier1,2,3, Christelle Tamisier2, Flora Ponelle-Chachuat1,2, Yves-Jean Bignon1,2,3.   

Abstract

Basal-like breast cancers are among the most aggressive cancers and effective targeted therapies are still missing. In order to identify new therapeutic targets, we performed Methyl-Seq and RNA-Seq of 10 breast cancer cell lines with different phenotypes. We confirmed that breast cancer subtypes cluster the RNA-Seq data but not the Methyl-Seq data. Basal-like tumor hypermethylated phenotype was not confirmed in our study but RNA-Seq analysis allowed to identify 77 genes significantly overexpressed in basal-like breast cancer cell lines. Among them, 48 were overexpressed in triple negative breast cancers of TCGA data. Some molecular functions were overrepresented in this candidate gene list. Genes involved in antioxydation, such as SOD1, MGST3 and PRDX or cadherin-binding genes, such as PFN1, ITGB1 and ANXA1, could thus be considered as basal like breast cancer biomarkers. We then sought if these genes were linked to BRCA1, since this gene is often inactivated in basal-like breast cancers. Nine genes were identified overexpressed in both basal-like breast cancer cells and BRCA1 mutated cells. Amongst them, at least 3 genes code for proteins implicated in epithelial cell migration and epithelial to mesenchymal transition (VIM, ITGB1 and RhoA). Our study provided several potential therapeutic targets for triple negative and BRCA1 mutated breast cancers. It seems that migration and mesenchymal properties acquisition of basal-like breast cancer cells is a key functional pathway in these tumors with a high metastatic potential.

Entities:  

Keywords:  BRCA1; Basal-like breast cancer; RNA-Seq; antioxydation; cell migration

Mesh:

Substances:

Year:  2018        PMID: 29333087      PMCID: PMC5765739          DOI: 10.7150/ijms.20508

Source DB:  PubMed          Journal:  Int J Med Sci        ISSN: 1449-1907            Impact factor:   3.738


Introduction

Basal like breast cancers (BLBC) represent between 10 and 20% of breast cancers. They are associated with an aggressive phenotype, high histological grade, poor clinical behavior, and high rates of relapse 1. This cancer subgroup is characterized by lack of estrogen receptor (ER), progesterone receptor (PR), and HER2 amplification (TNBC: triple-negative breast cancers) with expression of basal cytokeratins 5/6, 14, 17, epidermal growth factor receptor (EGFR), and/or c-KIT. Currently, BLBC lack any specific targeted therapy, due to the fact that they do not express ER or HER2 and thus are typically refractory to endocrine therapy and to trastuzumab, a humanized monoclonal antibody that targets HER2. The identification of new markers and therapeutic targets is thus necessary for this bad prognosis cancer type. First, BRCA1-associated BC are mostly BLBC 2 and sporadic BLBC (occurring in women without germline BRCA1 mutations) often show dysfunction of the BRCA1 pathway. The characteristics of hereditary BRCA1-associated BC found in sporadic BLBC cancers have thus been termed «BRCA-ness» with potential clinical implications 3. As BRCA1 pathway may be deficient in BLBC, these tumors may respond to specific therapeutic regimens, such as inhibitors of the poly (ADP-ribose) polymerase (PARP) enzyme 4. Cells deficient in BRCA1 have indeed a defect in the repair of DNA double strand breaks which could make them particularly sensitive to the chemotherapy drugs that generate such breaks, such as inhibitors of PARP enzyme. However, not all BLBC are associated with BRCA1 inactivation. Then, EGFR could represent a therapeutic target as it is often overexpressed in BLBC. Recently, a phase II clinical trial showed good results (57% of pathological complete response) of panitumumab combined with an anthracycline/taxane-based chemotherapy in operable triple-negative breast cancer 5. Nevertheless, this study highlighted biological signatures correlated with treatment response. Heterogeneity of triple negative breast cancers requires subtyping in order to better identify molecular-based therapy. In 2006 already, Neve et al. separated BLBC cell lines in two subgroup (basal A and basal B) with different invasive properties 6. Lehmann et al. then identified 6 triple-negative breast cancer subtypes including 2 basal-like (BL1 and BL2), an immunomodulatory (IM), a mesenchymal (M), a mesenchymal stem-like (MSL) and a luminal androgen receptor (LAR) subtype 7. All these subclassification of triple negative breast cancers were identified by studying transcriptomic profiles. Epigenetic modifications in breast cells could also allow identifying characteristics of these breast cancers. Roll et al. reported a hypermethylator phenotype in BLBC, characterized by methylation-dependent silencing of CEACAM6, CDH1, CST6, ESR1, GNA11, MUC1, MYB, SCNN1A, and TFF3 genes that are involved in a wide range of neoplastic processes relating to tumors with poor prognosis 8. Our results of Methyl-Seq did not confirm this hypermethylator phenotype but we could not identify hypermethylated BLBC specific genes. On the other hand the RNA-Seq data allowed us to identify antioxidation and cell migration as specifically activated pathways in basal-like breast cancer cells.

Materials and methods

Biological material

The main characteristics of the cell lines used are presented in table 1. MDA-MB-231 and HCC1937 human breast cancer cell lines were purchased from the American Type Culture Collection (Rockville, MD, USA) and were grown in RPMI medium supplemented with 10% foetal calf serum, 2 mM L-glutamine and 20 μg/ml gentamicin. SUM149 and SUM1315 human breast cancer cell lines were obtained from Asterand (Hertfordshire, UK) and grown in Ham's F12 medium according to the manufacturer's instructions. SUM1315MO2 cells were transfected with a pLXSN plasmid containing the full-length BRCA1 cDNA using Fugene 6 transfection reagent (Roche Molecular Biochemicals). Control cells were transfected with the pLXSN empty vector. After selection in 721.5 µM G418 (Sigma Aldrich), clones were tested for BRCA1 expression by Western blotting 9. All cell lines were grown at 37˚C in a humidified atmosphere containing 5% CO2. All our cell lines are stored and managed by the CJP Biological Resources Center (BB-0033-00075).
Table 1

Main characteristics of the cell lines.

Cell lineSite of originPathologyMolecular type (6)Triple negative subtype (7)BRCA1 statusTP53 status
MCF10ANormal breastFibrocysticBasal B-Wild typeWild Type
MCF7Pleural effusionAdenocarcinomaLuminal-Wild typeWild Type
T47DPleural effusionAdenocarcinomaLuminal-Wild typeMissense mutation
MDA231Pleural effusionAdenocarcinomaBasal BMSLWild typeMissense mutation
MDA436Pleural effusionAdenocarcinomaBasal BMSL5382insCNonsense mutation
HCC1937Primary tumorInfiltrating ductal carcinomaBasal ABL15396 + 1G>ANonsense mutation
SUM149Primary tumorInflammatory breast carcinomaBasal BBL22288delTMissense mutation
SUM1315Skin metastasisInfiltarting ductal carcinomaBasal B-185delAGMissense mutation
SUM1315-LXSN (SL)Skin metastasisInfiltarting ductal carcinomaBasal B-185delAGMissense mutation
SUM1315-BRCA1 (SB)Skin metastasisInfiltarting ductal carcinomaBasal B-185delAG + sauvageMissense mutation

Cell immunohistochemistry

Cells were fixed in Preservcyt solution (Thinprep) and cytoblocks were prepared with Shandon Cytoblock kit (Thermo Scientific). Hormone receptors (ER and PR), HER2, EGFR and cytokeratin status were studied as already described 5. The immunostainings were scored semi-quantitatively by an expert pathologist under a light upright microscope.

Nucleic acid extraction

DNA extraction was performed using the QIAamp DNA mini kit (Qiagen) for cell lines and the QIAamp DNA micro kit (Qiagen) for tumors. RNA extraction was performed using RNeasy mini kit (Qiagen). The quantity and quality of the nucleic acids obtained were measured spectrophotometrically at 260nm and 280nm. RNA were also checked on 2100 Bioanalyzer (Agilent Technologies).

RNA sequencing and data processing

First, mRNA were purified using Oligotex mRNA mini kit (Qiagen). cDNA libraries were then generated following the GS-FLX Titanium cDNA Rapid Library Preparation Method Manual (Roche). Finally, emPCR amplification and 454 sequencing were performed according to the manufacturer's protocol (emPCR Amplification Manual- Lib-L LV and Sequencing Method Manual-GS FLX Titanium Series, Roche). RNA-Seq data are available in the ArrayExpress database (www.ebi.ac.uk/arrayexpress) under accession number E-MTAB-5465. Sequence reads were aligned on the human genome (hg19) with GS Reference Mapper software (Roche) and mapped on the human exome using a home-made software named AGSA. Data was then normalized by calculating the 'reads per kilo base per million mapped reads' (RPKM) for each gene. When the RPKM value was below the threshold of 0.3, then it was considered as background noise and replaced by zero.

Validation of gene regulation by q-RT-PCR

Total RNAs were extracted from cell lines using RNeasy mini kit (Qiagen) according to the manufacturer's protocol. Quality of RNAs was checked using the 2100 BioAnalyzer (Agilent Technologies). Five microgram RNA was then reverse-transcribed using First-strand cDNA synthesis kit (GE Healthcare). Multiplex quantitative RT-PCR was performed using a 7900HT Fast Real-Time PCR System (Applied Biosystems). Predesigned and validated gene-specific probe-based Taq-Man Gene Expression Assays were used and relative gene expression was determined using the comparative threshold cycle method. Ribosomal 18S was chosen as the endogenous control gene.

Methyl-DNA sequencing and data processing

First, DNA was fragmented by nebulization during 1min30sec at 2.1 bar of nitrogen pressure. After DNA purification using Qiaquick PCR purification kit (Qiagen), methylated DNA was captured using MethylCap kit (Diagenode) following the supplier recommendations. Libraries were generated using GS FLX Titanium Rapid Library Preparation Kit (Roche). Finally, emPCR amplification and 454 sequencing were performed according to the manufacturer's protocol (emPCR Amplification Manual- Lib-L LV and Sequencing Method Manual-GS FLX Titanium Series, Roche). Methyl-Seq data are available in the ArrayExpress database (www.ebi.ac.uk/arrayexpress) under accession number E-MTAB-5468. Sequence reads were aligned on the human genome (hg19) with GS Reference Mapper software (Roche) and mapped on the human proximal promotors using a home-made software named AGSA. Data was then normalized by calculating the 'reads per million mapped reads' (RPM) for each gene. When the RPM value was below the threshold of 0.3, then it was considered as background noise and replaced by zero.

TCGA data analysis

Both clinical and RNA sequencing data (Illumina HiSeq RNAseq Version 2 data) of invasive breast cancers were downloaded from The Cancer Genome Atlas (TCGA) database. A total of 449 patients with information on ER, PR and HER2 status were selected to compare the expression profiles of genes in the respective tumors. 71 cases were found to have a negative ER, PR end HER2 phenotype (i.e., triple-negative), whereas 371 cases were positive for at least one of these receptors.

Statistical analysis

Statistical analysis of data was performed using R software. Significant differences between cell line groups were sought by Wilcoxon test. Statistical overrepresentation test were performed using PANTHER classification system 10. For TCGA data, Student's t-test was used to assess statistical differences in mean normalized expression between triple negative and non triple negative groups. A p-value ≤ 0.05 was considered statistically significant.

Results

Data normalization

Our transcriptome data consisted of 16,008 genes before and 13,168 genes after RPKM normalization. Methylome data consisted of 6,140 genes before and 6,109 genes after RPM normalization. Eliminated genes are those whose expression or methylation, for each cell line, did not exceed the background noise. For the transcriptome, the average standard of each cell line data was much higher than the median and the third quartile revealing a very high concentration of data around zero and a very large number of reads for some genes as observed in the box plots (Fig 1 and Table 2). Regarding the methylome, median and third quartile were zero while the average was between 0.8 and 1.8 readings per gene revealing again a very high concentration of data around zero and a very large number of reads for some genes. In addition, for the transcriptome and methylome, all cell lines displayed a strong deviation above the average of the number of reads per gene. This revealed a very high dispersion of data.
Fig 1

Data standardization. The distribution of transcriptome and methylome data are represented in boxplot. For most of the cell lines, the distribution has many values ​​close to zero and a minority of extreme values.

Table 2

Summary of transcriptome and methylome data.

MCF10AMCF7T47DMDA231MDA436HCC1937SUM149SUM1315SLSB
Transcriptome (RPKM)Mean3.082.112.122.872.072.992.582.241.912.01
Median0.550.530.580.570.530.600.580.540.480.49
3rd quartile1.561.471.581.761.451.821.631.461.371.43
Maximum1213.981033.06645.80774.17982.73975.89965.45517.51453.00664.35
Standard deviation21.1211.919.98014.6512.3114.5515.5611.658.339.43
Methylome (RPM)Mean0.950.890.951.531.340.880.830.750.920.76
Median0.000.000.000.000.000.000.000.000.000.00
3rd quartile0.000.000.000.000.000.000.000.000.000.00
Maximum17.5637.7157.8742.3370.1227.3039.1420.6952.6042.95
Standard deviation1.772.082.703.063.901.992.331.612.332.18

Means, medians, 3rd quartile, maximum values and standard deviations of the normalized data RPKM (transcriptome) and RPM (methylome) are presented for each cell line.

From RPKM or RPM data, log normalization was performed to dilate the low values and strengthen high values. Standardization was also performed to obtain a standardized normal distribution. These two normalizations could be coupled to give reduced centered log normalized data.

Non supervised analysis

First we investigated how breast cancer cell lines clustered by RNA-Seq and Methyl-Seq. For RNA-Seq, a hierarchical clustering on the RPKM normalized RNA-Seq data was generated from Euclidean distances according to Ward's method (Fig 2). This hierarchical clustering was performed for the base matrix (Fig 2A), the log normalized data (Fig 2B), the standardized data (Fig 2C) and the log-standardized data (Fig 2D). This showed that whatever the normalization, the SUM1315 lines, SB and SL are always classified together, like the two luminal T47D and MCF7 cell lines. As already observed with RNA microarrays, transcriptomic analyse by RNA-Seq thus allowed to separate luminal and basal-like breast cancer cells.
Fig 2

Ascending hierarchical classification of breast cancer cell line RNA-Seq data. Classification of cell lines according to the Euclidean distances of gene expression for the basic matrix (A), log normalized (B), centered reduced (C) and log normalized and centered reduced (D).

In contrast, the benign MCF10A line appears to be different from the tumor lines only for the base matrix (Fig 2A). This is in agreement with the fact that this cell line was classified as basal-like in several studies 6,11. For Metyl-seq, a hierarchical clustering on the RPM normalized Methyl-Seq data was generated from Euclidean distances according to Ward's method (Fig 3). This hierarchical clustering was performed for the base matrix (Fig 3A), the log normalized data (Fig 3B), the standardized data (Fig 3C) and the log-standardized data (Fig 3D). Metyl-seq data were neither massively influenced by breast cancer subtype nor by BRCA1 mutation. It seems like MDA231 and MDA436 present a hypermetylated phenotype. These two cell lines have thus a higher mean of methylation (1.535 and 1.343 versus 0.7 to 0.95 for all the other cell lines).
Fig 3

Ascending hierarchical classification of breast cancer cell line Methyl-Seq data. Classification of cell lines according to the Euclidean distances of gene methylation for the basic matrix (A), log normalized (B), centered reduced (C) and log normalized and centered reduced (D).

Search for genes most significantly regulated

To determine whether the data are normally distributed, a Kolmogorov-Smirnov test was performed by taking as reference the normal distribution on basic matrix (data), normalized log (data log), centered reduced (data C & R) and centered reduced normalized log (log data C & R). For transcriptome and methylome data, the p-value was less than 2.10-16 for the four matrices. Thus, the p-value is very significantly lower the first degree risk α, set at 0.01. It is recognized that the data does not follow a normal distribution. Wilcoxon non-parametric statistical test was thus performed for each gene to select genes significantly regulated between two groups of cell lines.

Subtype: 1205 genes with p<0.05

First of all, we compared the luminal to the basal cell lines. 1205 genes were found significantly different for expression in luminal versus basal-like cell lines (p<0.05). Among them, we found well-known basal-specific genes, such as EGFR, VIM, CAV1 and CAV2 1. Conversely, ESR1, coding for the estrogen receptor alpha, is only expressed in the two luminal cell lines. Luminal keratins (KRT8, KRT18 and KRT19) are also significantly more expressed in luminal cell lines. For the genes used for clinical classification of basal-like breast cancers, we could compare our RNA-Seq data to immunohistochemistry results (Table 3). A very good correlation was observed between mRNA expression analysed by RNA-Seq and protein expression studied by immunocytochemistry.
Table 3

Comparison of RNA-Seq and immunohistochemistry data.

GeneProteinMCF10AMCF7T47DMDA231MDA436HCC1937SUM149SUM1315SLSB
RNA-SeqESR1ERalpha01.050.880000000
IHCER-+ (90%)+ (50%)-------
RNA-SeqPGRPR000.450000000
IHCPR-+ (40%)+ (80%)-------
RNA-SeqERBB2HER20.401.310.900.4600.430.36000
IHCHER2----------
RNA-SeqKRT5CK5148.0000011.76.3000
KRT6ACK6102.900000.88.2000
KRT6BCK66.8000005.1000
KRT6CCK63.700000.32.4000
IHCCK5/6+ (90%)----+ (25%)+ (25%)---
RNA-SeqKRT14CK1476.1000.305.50.4000
IHCCK14+ (80%)----+ (20%)+ (<1%)---
RNA-SeqEGFREGFR0.90000.880.820.501.220.440.510.62
IHCEGFR+ (100%)-+ (40%)+ (100%)+ (90%)+ (100%)+ (100%)+ (90%)+ (60%)+ (80%)

RNA-Seq results are presented as RPKM values. Results of immunohistochemistry are presented as negative (-) or positive (+), specifying the percentage of labeled cells.

IHC: immunohistochemistry; ER: estrogen receptor; PR: progesteron receptor; CK: cytokeratin.

With the objective of identifying new therapeutic targets for basal-like breast cancers, we selected genes that were highly expressed (>10 RPKM) and significantly up-regulated in basal cell lines. This reduced the list to 77 candidate genes (Table 4). Thanks to the expression data extracted from the TCGA project "Breast Invasive Carcinoma", we confirmed significant overexpression for 48 genes of this list in triple negative breast tumors compared to non triple-negative breast tumors. Among them, some have already been described as basal-like markers, such as Annexin A1 12,13 and Vimentin 14,15.
Table 4

List of the 77 genes highly expressed and significantly up-regulated in basal-like breast cancer cell lines.

MCF7T47DMCF10AMDA231HCC1937MDA436SUM149SUM1315SLSB
MIR198014,1993,95774,17975,89982,74814,74517,5282,5270,38
UQCRHL #58,1677,51430,89182,15184,14222,29352,09412,39223,07181,27
TMSB10 #25,429,40150,63508,74111,4885,40215,62244,01142,27192,18
PFN1 #77,6279,56152,67276,20206,92233,86257,91111,56137,20122,80
S100A6 #22,9819,45416,84181,7687,12197,18213,62134,5377,2878,92
MIR63824,2825,5356,63163,59310,9527,4926,7837,2945,6943,01
S100A2 *#00396,331,836,931,0358,9513,3912,774,03
LGALS1 #4,338,9794,7331,8440,8366,7488,9276,6445,5635,35
MRPL51 #9,486,1033,8840,2583,6153,4779,5953,9043,9033,53
GADD45G IP1 #20,2113,1056,2546,8146,2752,1354,9861,4249,4144,08
VIM *#0,5004,1666,680,7745,395,5592,39111,7957,37
GBA3 #23,0113,3338,6063,1141,7626,4747,5044,6742,6543,55
SF3B5 #21,5711,7360,9126,8630,7565,2822,7344,0540,8647,65
C12orf5715,123,3619,9044,0539,1116,0870,0962,2734,2425,15
SOD112,5018,3728,3732,5732,4543,4732,2840,6032,8841,36
SEP154,654,2018,8868,5088,4312,5322,0023,3815,9220,07
RHOA *18,0815,9431,0223,7134,3431,1130,4939,1742,3326,96
ANXA1 #00,7428,2548,4868,2932,6723,7021,0515,5919,83
CTSL12,721,468,665,4511,9742,067,1997,4835,7548,36
SEC61G #6,179,3131,3115,7425,8625,4028,0955,5518,6422,24
PRKCDBP #0012,95110,2858,211,4816,035,365,9711,87
GPX10045,8721,0756,0026,4232,336,8213,1510,05
CAV10019,909,0611,4012,188,7062,7335,8544,37
MIR548G7,94017,2054,0517,8323,5826,8420,8619,9319,37
IGFBP3 #000,523,0635,625,030,427,2371,3272,54
MRPL18 #11,3210,2820,1425,4131,2422,6722,0425,2623,3221,50
TOMM5 #5,418,7122,6737,9032,0423,6410,3524,7313,8817,39
NOL7 #5,338,4510,2527,7822,538,9411,4242,4328,9029,18
NCL #12,2112,2118,5620,2012,6040,1020,9724,8020,6520,23
LOC100134713 #1,050,4045,2632,0438,6313,575,7818,7712,099,76
MIR2210022,9018,1216,7120,6522,8331,3512,8830,41
DRAP14,315,408,4850,7453,099,5612,6010,8310,2410,17
PRNP #3,604,9426,8231,7839,996,2519,8215,226,4115,54
SNX3 *#7,774,9420,6925,2640,0923,5710,6217,0813,869,34
GADD45A *0,781,598,9131,5932,6217,732,6720,1927,4512,92
TM4SF1 #0,5103,2737,4425,036,819,9726,0318,6726,40
MRPL34 #12,2011,1929,6419,4519,3118,2615,3118,1013,4918,97
MANF *#3,784,368,9452,2630,048,2013,0013,6916,198,36
PRDX1 #7,426,6215,438,6220,4516,2421,7621,9719,5026,71
MLF2 #9,787,3213,4113,1526,3620,4723,3716,4419,1916,94
C17orf89 #4,005,1913,2132,8251,747,7412,948,258,0511,30
PARK74,739,4617,0928,6429,0311,8123,9912,1810,7912,07
NDUFA13,533,6227,6031,0016,1715,7017,7322,275,957,62
WBP5 #01,769,7242,2611,1831,024,5920,0015,799,17
MRPS339,149,8334,5911,7517,3917,4316,1913,8115,7910,16
EEF1E1 #4,185,2915,9841,8524,6812,1220,587,805,378,01
EBNA1BP2 #3,207,189,5823,1018,6810,839,7117,3226,0016,38
TGFBI #0014,000,361,690,987,1828,4525,4553,07
TIMMDC14,624,317,569,9645,5314,1712,6716,416,678,83
SSBP1 #4,594,4422,0317,0217,6412,737,9217,3012,9011,92
NDUFA64,886,1422,3013,7220,1315,8515,9116,977,087,03
DEGS13,712,4015,7017,4714,809,6224,4713,437,9014,16
POMP4,664,5314,3115,1618,076,2714,0118,7114,0015,72
FAM96B #2,862,497,9820,2737,203,5815,769,4514,007,22
CNIH42,382,7025,5424,5215,685,4225,055,205,074,73
TUBB6*#3,223,1411,6031,306,524,956,5414,0220,8814,37
MGST3 #2,933,5928,316,6113,6312,6210,6914,6912,3110,71
MRPL505,924,3515,897,4819,599,2113,5218,7313,8410,01
EMP31,180,712,5323,7512,148,895,8920,7317,7714,68
ITGB1 *1,713,113,2412,4213,205,086,8127,4321,6312,56
CAPG #0,442,7216,1224,8014,308,3810,0012,289,436,45
ARGLU15,764,2212,5414,8323,0411,0613,229,796,3410,58
NOP16 #7,043,5517,4711,5811,9311,6411,6717,289,5210,16
HIST1H2BC001,290,733,721,724,5943,4621,8122,94
UQCR115,032,0543,527,936,626,198,9910,776,978,51
VKORC1L13,514,516,7320,5735,7510,507,376,234,626,97
MRPS15 #3,554,5315,288,9016,478,1311,8611,2412,7712,19
UBA52 #4,683,1718,728,218,3311,3614,4717,478,438,56
MRPS6 #1,553,2510,226,784,268,2116,3826,159,3812,48
TRAPPC3 #5,104,3811,1613,8611,947,539,1914,1413,2112,60
SSU72 #7,687,1712,5213,8015,577,808,4511,3310,3111,90
RBX1 #1,511,998,7622,4422,067,599,5910,662,053,22
COTL1 #2,132,154,195,4520,0613,866,979,8010,0213,43
DDT #3,884,6329,745,756,535,779,579,917,708,61
RPS12 #4,113,5917,477,048,509,259,5214,968,637,79
NOP56 #3,234,497,5220,8719,587,558,794,965,147,90
PRPS1L1*1,961,964,2619,409,9716,524,434,9714,806,06

RNA-Seq results are presented as RPKM values.

* : genes that are also significantly overexpressed in BRCA1 mutated (SL) compared to BRCA1 restored (SB) cell lines.

: genes that are significantly overexpressed in triple negative breast cancers in the TCGA RNAseq data.

This 77 basal-like specific genes list was also submitted to Panther statistical overrepresentation test. Two molecular functions were given as overrepresented in this list (Fig 4A): antioxydant activity (5 genes, p=0.0318), and cadherin involved in cell-cell adhesions (6 genes, p=0.0231). For these 11 genes, TCGA data were compared for triple negative and non triple negative breast cancers (Fig 4B). Significant overexpression in triple negative breast cancers was found for SOD1, MGST3 and PRDX1 antioxydation genes and for PFN1, ITGB1, ARGLU1 and ANXA1 cadherin binding genes.
Fig 4

Expression of some genes involved in antioxydation and in cadherin binding. A. Gene expression found in our RNA-Seq study are presented as RPKM values. For all these genes, significant overexpression was observed in triple negative cell lines. B. TCGA data are presented as RPKM values. **: p <0.01; ***: p<0.001

Influence of BRCA1: specific study SL / SB: 979 genes with p<0.05

We studied our BRCA1 transfected cell lines: SL and SB come from the BRCA1 mutated SUM1315 cell line that was stably transfected with empty LXSN plasmid (SL: SUM1315-LXSN) or with a BRCA1 coding plasmid (SB: SUM1315-BRCA1). As already demonstrated 9, we could check that SB cell line expressed around 5 times more BRCA1 transcripts than SL cell line. Comparing these 2 cell line RNA sequencing, we found 979 genes significantly differently expressed. Among them, 304 genes were expressed at least twice more in SL comparing SB (Table 5). This gene list was submitted to Panther statistical overrepresentation test. The molecular functions the most overrepresented included semaphorin receptor activity (5 genes, p=0.0071), growth factor binding (12 genes, p=0.00327) and cell adhesion molecule binding (21 genes, p=0.0177).
Table 5

List of the 304 genes significantly up-regulated in BRCA1 mutated cell line (SL) compared to BRCA1 wild-type cell line (SB).

SLSBSLSB
PPP1R13B0.4800.000PROCR6.6881.963
F2RL12.6200.000UPP11.9050.561
PAX80.8300.000TADA33.0760.906
FBXL160.9100.000FURIN3.9871.184
F2R0.7500.000ZNF3371.6040.479
UBE2D10.6600.000TDP11.1030.331
UBASH3B0.9000.000SAC3D12.5000.750
SMYD40.5500.000ZBED42.8130.854
DBNDD21.5400.000RFWD33.3861.030
S1PR12.7500.000ACTR81.4200.442
PLXNB10.2400.000ZNF4671.3330.417
CDC42EP34.0000.000ATG71.0530.331
DLX51.2000.000RAB11A7.8002.500
PLAGL22.6200.000LEO12.5050.806
TMEM1640.3600.000PTPN231.1410.369
SLC25A200.6100.000CPT1A1.7140.560
MRE11A0.2600.000TRIP111.2210.399
FLRT22.0000.000CRTC30.6590.217
LOC1014482020.9800.000ENO22.6690.882
NR1I30.9100.000RPLP114.8334.938
SOX91.7500.000ZNF3460.8750.292
PSTK0.6800.000FAM72D1.0000.333
HOXB91.1200.000GSE11.1250.377
SPP11.0000.000EFTUD11.5480.522
NOV1.1400.000PTPN11.5180.513
FAM173A0.9700.000TCF71.4750.500
DACT11.0600.000CSTB21.7507.417
SQRDL1.0400.000PRSS2310.2503.500
PRSS120.4400.000SEC22C1.5350.526
SERPINE21.0200.000SNAPC12.5080.863
CIB20.9700.000TGFB21.2800.442
NHLRC20.3100.000ZDHHC91.1460.396
LMBRD10.5700.000PPP1R13L1.2240.424
SORT10.7100.000PINX11.6700.583
AKAP121.6500.000TOR1A4.8501.700
RGS41.8800.000FEM1B6.7502.375
CACNG71.1400.000DDAH11.0580.375
ANKRD501.4800.000SORBS33.4361.222
THBS10.9800.000TGFB13.3451.190
KRTAP2-33.2500.000LASP11.9700.702
FGF57.7500.000C11orf713.5001.250
PCCA0.2880.000PALM2-AKAP21.0690.383
LAMB30.3180.000VAT14.0421.450
RAD51D1.0580.000MBOAT20.9720.350
MMD0.3000.000PDZD81.6210.588
EVA1A2.0000.000GPC12.7120.994
PLA2G161.1250.000ZFC3H10.9190.337
GTDC22.7500.000GTF2A25.2501.938
IQCC0.8380.000ZNF4102.0910.772
TCAIM0.5460.000TMEM126B3.3751.250
DCLRE1C0.8540.000TMEM52.6750.992
NUP2100.3940.000TRAPPC6B1.2250.458
SLC38A60.4820.000ZFR1.4730.552
EEF1A21.5290.000MDM21.3750.517
PODXL6.7770.167GOLPH32.5420.958
SIX13.0000.125NINJ11.8750.708
RAD180.8790.050TNNT12.7571.048
BAK11.4580.083USP41.2220.464
ATP5O1.1670.083TRIP131.3260.504
SPOCK11.0790.083LRIG10.5390.205
FXYD51.2170.100ALDH5A12.4690.948
ZNF121.5000.125CSTF2T3.2501.250
SLC19A21.7000.150CHCHD31.2630.488
HOXC101.3750.125SIN3A3.3341.293
TAGLN3.1670.292FIBP2.1210.826
TSR21.3330.125TSC22D41.7080.667
GEMIN20.5210.050GINS12.7041.063
FNDC11.7340.167CD912.7195.000
TAF30.6460.063HYAL22.0000.792
FAN10.8310.083GNA133.9381.563
ADCK11.4170.143FLNB1.5260.606
TMOD32.2950.250ABHD60.9420.375
RYBP1.7080.188EZR-AS16.2502.500
CTSH0.5670.063NBEAL20.6460.258
RAB27A0.7500.083KIF71.0770.431
FAM174A1.1250.125YME1L15.1812.073
ZBTB470.7420.083NAP1L12.6511.063
ALOX12B1.1040.125SRP542.8491.149
ITPKC0.5210.063SEL1L1.3850.559
CLP11.0000.125SEC23A3.2821.325
GCSH1.2920.167EAPP2.0670.838
ZDHHC31.7460.229TTC81.0290.418
MST40.7500.100TAF1D1.9630.800
FAM102A1.3690.188IKBIP5.3132.167
MORC40.8760.121IDH22.6021.064
HSBP12.9580.417NUDT50.9580.400
STC110.6251.500RALY6.7502.821
ATE11.5980.226VIM118.61149.583
ZNF6721.7500.250PHF70.8930.375
UFD1L0.5750.083SVIL1.0010.421
CTNNBIP15.0830.750MET2.5611.084
PMEPA10.8330.125TGFBR22.7981.185
RAB8B0.8830.133SMARCC11.7140.733
TDRKH0.9600.146ROCK10.8750.375
TK21.0920.167NFATC31.3020.558
FHL10.7920.125RRP1B3.0121.303
PTPN21.4670.244NUDT216.1072.643
RAB132.6460.458SEC134.4441.929
HDAC90.7130.125LARP67.6253.313
ATP1A30.2250.042ANP32B2.0360.887
ZNF5290.8750.167TPX28.5783.813
OCIAD22.8330.542KIF5B5.3632.392
RAP1GAP21.3890.266DHCR242.3611.054
RAB5A3.1040.604ZC3H142.0230.903
TTLL50.7360.144NPC27.9503.550
S100A25.6251.125UNC45A4.3061.924
DEDD3.0630.625NHLRC30.8380.375
SIMC10.7000.146TRPC4AP3.2231.450
RPSA4.7711.000ATP6V0A20.7020.317
COMMD72.1850.467APEX22.1960.992
SH3KBP11.6940.371FBXO344.4172.000
SPTLC21.4130.313TERF2IP8.0833.667
TMEM2513.3750.750RINT11.5240.692
SRPX2.3770.533SUPV3L11.8020.821
EIF1AD1.4790.333UBP12.6181.196
ASNA15.2261.192DARS21.5330.708
THAP102.3750.542HSD17B111.7300.800
PPP3CC0.7700.179MEX3B8.3753.875
ZKSCAN11.0630.250KIFC31.5310.710
PLXDC11.9080.458SMC21.9020.883
TSPAN311.3420.325DDX182.0710.962
PTPRF3.3530.813ZBTB51.7920.833
TMED104.8751.188POLG2.5311.177
UXT1.1880.292SNUPN3.2231.500
DYNLRB14.2921.063MFGE81.5270.714
WSB21.9630.492CNOT101.5950.746
UBXN2A0.7290.183ATG4B0.8510.400
VTI1B2.2920.583SLC30A61.2500.590
ITPR32.7430.707UBR71.3660.650
RAB39B3.3750.875RAB1A5.2502.500
SS180.8640.225REEP42.1411.021
SLC2A12.1840.579PGM31.8660.890
CTGF14.8503.950STYX1.0040.479
PRKACA3.1030.825PKM52.11424.886
PRPS1L17.5002.000AGPAT22.2671.083
PIP4K2A1.0030.271QRICH13.2501.558
ITPK11.9820.536CENPN2.0080.964
ELK12.1250.575BRD31.1190.539
PWWP2A1.3750.375SLC38A11.9730.955
RAC14.4581.217RARS2.7771.344
SMC50.8350.228MYBL21.4820.718
RAB183.2790.905PPM1G7.4313.600
BAHD11.0750.300RPL27A3.5081.700
ZBTB16.2081.750HDGFRP35.2502.550
EPC10.9900.281AQR0.5960.292
VPS180.9500.271GADD45A15.3137.500
CHMP2B1.8330.525QARS2.1971.089
RANBP101.0120.292NRP11.4890.738
KIF3B2.3130.667KIF234.4442.208
PUSL11.2920.375HK13.4441.713
ADAM191.5260.444SNX39.0004.500
GORAB1.7080.500TRAK12.0011.001

RNA-Seq results are presented as RPKM values.

We found 9 genes that were overexpressed both in basal-like cell lines compared to luminal cell lines and in BRCA1 mutated (SL) compared to BRCA1 restored (SB) cell lines (Fig 5A). We performed q-RT-PCR experiments in order to validate these overexpressed genes. For 7 of these genes, we could validate higher expression in basal-like cell lines (Figure 5B) and in BRCA1 mutated cell line (Figure 5C). These 7 genes can be considered as basal-like biomarkers and potential therapeutic targets, particularly in BRCA1-mutated cancers. Above these genes, four are linked to cytoskeleton and could thus be implicated in epithelial cell migration: VIM, ITGB1, RHOA and TUBB6.
Fig 5

Expression of genes overexpressed in triple negative and in BRCA1-mutated cell lines. A. Gene expressions found in our RNA-Seq study are presented as RPKM values. For all these genes, significant overexpression was observed both in triple negative versus luminal cell lines and in BRCA1 mutated (SL) versus BRCA1 (SB) restored cell lines. B. Gene overexpression in basal-like breast cancer cell lines was confirmed by q-RT-PCR. *: p <0.05 C. Gene overexpression in BRCA1 mutated (SL) compared to BRCA1 restored (SB) cell lines was confirmed by q-RT-PCR. *: p <0.05

Discussion

Our study tested RNA-Seq and Methyl-Seq as sensitive methods to categorize breast cancer tumor cells. We found that Methyl-Seq is not an appropriate method to differentiate breast cancer subtypes. The weak quantity of data generated with our technology is a limit that could probably be overcome with the latest generations of sequencers. Nevertheless it seems that the subtypes of breast cancers do not broadly influence genome methylation. Among the genes published by Roll et al.8, only ESR1 was found to be methylated in the MDA-MB-231 and MCF10A cell lines. We also observed a specific profile of the MDA-MB-231 and MDA-MB-436 cell lines, which appear to be globally hypermethylated. Deeper study will be needed to understand why these cell lines are hypermethylated. By contrast, RNA-Seq was proven as a reliable method for breast cancer classification. We showed that different subtypes could be separated by global gene clustering. Moreover transcript expression analyzed by RNA-Seq was shown to correlate with protein expression evaluated by immunohistochemistry. With the steady decrease in sequencing prices, RNA-Seq could become the standard method for the molecular characterization of breast tumors. This could make it possible to propose a personalized treatment according to the therapeutic targets overexpressed in the tumor. In our study we chose to focus on basal-like breast tumors. We thus identified some potentially new targets in this breast cancer subtype. PANTHER analysis 10 revealed that genes involved in cadherin binding were overrepresented in the basal-like overexpressed genes. For most of them, we could confirm overexpression in triple negative breast cancers thanks to the Cancer Genome Atlas data. Among them PFN1 codes for profilin1, a regulator of actin polymerization but it is known to be downregulated in breast cancers 16. In contrast, our study confirms overexpression of two genes involved in breast cancer progression, the ANXA1 and ITGB1 genes. ANXA1 was already shown to be associated with triple negative breast cancers 13,17. A recent study already identified ITGB1, that codes for integrin beta 1, as a potential prognosis biomarker in triple negative breast cancers 18. Furthermore, genes involved in cell oxidation regulation were also found to be overexpressed in basal-like breast cancers. GPX1 codes for glutathione peroxidase 1 that protects cells against oxidative stress. A polymorphism of this gene has been associated to breast cancer risk 19. The PARK7 gene (also known as DJ-1) is involved in neuron protection against oxidative stress and cell death. It was recently shown to interact with HER3 receptor 20. For these two genes, our results showed at least a 3 fold overexpression in basal-like breast cancer cell lines but in TCGA triple negative breast tumors this overexpression was not confirmed. For SOD1, PRDX1 and MGST3, triple negative cell overexpression was observed both in our cell lines and in TCGA tumors. MGST3 is another gene belonging to antioxidant system shown to be overexpressed in some melanomas 21. Superoxyde dismutase 1 is an enzyme coded by the gene SOD1. It has already been suggested as a potential anti-cancer drug 22,23. At last, PRDX1 has a controversial role in oxidization-reduction balance and its expression seems to be up-regulated in breast cancer tissues 24. Oxydative stress could thus be targeted in basal-like breast cancers. This therapeutic strategy has already been proposed in other types of cancer 25,26. BRCA1 is a double strand break repair gene known to be frequently inactivated in basal-like breast cancers. In our results, BRCA1 mutation does not appear to influence massively the transcriptome of breast cancer cells. Nevertheless, we looked for genes overexpressed both in basal-like and in BRCA1 mutated breast cancer cells. ITGB1 is one of these genes, BRCA1 could thus be proposed as a regulator of integrin beta 1. It could be particularly interesting to inhibit ITGB1 in BRCA1 mutated basal-like breast cancers. VIM gene codes for vimentin protein, a mesenchymal marker, and we also found it overexpressed in BLBC cell lines and in BRCA1 mutated cell line. It is also an interesting biomarker of triple negative and BRCA1 mutated breast cancers. MicroRNA-138, which targets vimentin, has thus been proposed as a therapeutic agent for breast cancer 28. RhoA and TUBB6 are also overexpressed in BRCA1 mutated and basal-like breast cancers. TUBB6 gene codes for a tubulin protein, the major constituent of microtubule cytoskeleton. RhoA is a small GTPase involved in actin cytoskeleton organization and it thus regulates cell shape and motility 27. Physical reorganization of the cytoskeleton appears to be important in BRCA1 mutated breast cancers. This ability to remodel the cellular form could explain the high metastatic capacities of these cancers. Targeting the proteins involved in this function could thus be an effective therapeutic strategy for basal type breast cancers.
  26 in total

1.  Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies.

Authors:  Brian D Lehmann; Joshua A Bauer; Xi Chen; Melinda E Sanders; A Bapsi Chakravarthy; Yu Shyr; Jennifer A Pietenpol
Journal:  J Clin Invest       Date:  2011-07       Impact factor: 14.808

Review 2.  Expanding roles of superoxide dismutases in cell regulation and cancer.

Authors:  Meixia Che; Ren Wang; Xiaoxing Li; Hui-Yun Wang; X F Steven Zheng
Journal:  Drug Discov Today       Date:  2015-10-19       Impact factor: 7.851

3.  GPX1 Pro198Leu polymorphism and breast cancer risk: a meta-analysis.

Authors:  Jia Hu; Guo-Wu Zhou; Ning Wang; Ya-Jie Wang
Journal:  Breast Cancer Res Treat       Date:  2010-03-21       Impact factor: 4.872

4.  Profilin1 regulates invadopodium maturation in human breast cancer cells.

Authors:  A Valenzuela-Iglesias; V P Sharma; B T Beaty; Z Ding; L E Gutierrez-Millan; P Roy; J S Condeelis; J J Bravo-Cordero
Journal:  Eur J Cell Biol       Date:  2014-12-31       Impact factor: 4.492

5.  The Expression Patterns of ER, PR, HER2, CK5/6, EGFR, Ki-67 and AR by Immunohistochemical Analysis in Breast Cancer Cell Lines.

Authors:  Kristina Subik; Jin-Feng Lee; Laurie Baxter; Tamera Strzepek; Dawn Costello; Patti Crowley; Lianping Xing; Mien-Chie Hung; Thomas Bonfiglio; David G Hicks; Ping Tang
Journal:  Breast Cancer (Auckl)       Date:  2010-05-20

6.  Breast cancer cell response to genistein is conditioned by BRCA1 mutations.

Authors:  Maud Privat; Corinne Aubel; Stéphanie Arnould; Yves Communal; Marc Ferrara; Yves-Jean Bignon
Journal:  Biochem Biophys Res Commun       Date:  2009-01-04       Impact factor: 3.575

7.  Annexin A1 Preferentially Predicts Poor Prognosis of Basal-Like Breast Cancer Patients by Activating mTOR-S6 Signaling.

Authors:  Anjana Bhardwaj; Nivetha Ganesan; Kazunoshin Tachibana; Kimal Rajapakshe; Constance T Albarracin; Preethi H Gunaratne; Cristian Coarfa; Isabelle Bedrosian
Journal:  PLoS One       Date:  2015-05-22       Impact factor: 3.240

8.  Annexin A1 expression in a pooled breast cancer series: association with tumor subtypes and prognosis.

Authors:  Marcelo Sobral-Leite; Jelle Wesseling; Vincent T H B M Smit; Heli Nevanlinna; Martine H van Miltenburg; Joyce Sanders; Ingrid Hofland; Fiona M Blows; Penny Coulson; Gazinska Patrycja; Jan H M Schellens; Rainer Fagerholm; Päivi Heikkilä; Kristiina Aittomäki; Carl Blomqvist; Elena Provenzano; Hamid Raza Ali; Jonine Figueroa; Mark Sherman; Jolanta Lissowska; Arto Mannermaa; Vesa Kataja; Veli-Matti Kosma; Jaana M Hartikainen; Kelly-Anne Phillips; Fergus J Couch; Janet E Olson; Celine Vachon; Daniel Visscher; Hermann Brenner; Katja Butterbach; Volker Arndt; Bernd Holleczek; Maartje J Hooning; Antoinette Hollestelle; John W M Martens; Carolien H M van Deurzen; Bob van de Water; Annegien Broeks; Jenny Chang-Claude; Georgia Chenevix-Trench; Douglas F Easton; Paul D P Pharoah; Montserrat García-Closas; Marjo de Graauw; Marjanka K Schmidt
Journal:  BMC Med       Date:  2015-07-02       Impact factor: 8.775

9.  Triple-negative breast cancer: BRCAness and concordance of clinical features with BRCA1-mutation carriers.

Authors:  E H Lips; L Mulder; A Oonk; L E van der Kolk; F B L Hogervorst; A L T Imholz; J Wesseling; S Rodenhuis; P M Nederlof
Journal:  Br J Cancer       Date:  2013-04-04       Impact factor: 7.640

10.  Silencing vimentin expression decreases pulmonary metastases in a pre-diabetic mouse model of mammary tumor progression.

Authors:  Z Zelenko; E J Gallagher; A Tobin-Hess; V Belardi; R Rostoker; J Blank; Y Dina; D LeRoith
Journal:  Oncogene       Date:  2016-08-29       Impact factor: 9.867

View more
  3 in total

1.  A high expression ratio of RhoA/RhoB is associated with the migratory and invasive properties of basal-like Breast Tumors.

Authors:  Maud Privat; Amélie Cavard; Yanis Zekri; Flora Ponelle-Chachuat; Ioana Molnar; Nicolas Sonnier; Yves-Jean Bignon
Journal:  Int J Med Sci       Date:  2020-10-01       Impact factor: 3.738

2.  AKT1 Transcriptomic Landscape in Breast Cancer Cells.

Authors:  Bijesh George; Bin Gui; Rajeswari Raguraman; Aswathy Mary Paul; Harikrishna Nakshatri; Madhavan Radhakrishna Pillai; Rakesh Kumar
Journal:  Cells       Date:  2022-07-25       Impact factor: 7.666

3.  A flexible model-free prediction-based framework for feature ranking.

Authors:  Jingyi Jessica Li; Yiling Elaine Chen; Xin Tong
Journal:  J Mach Learn Res       Date:  2021-05       Impact factor: 5.177

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.