| Literature DB >> 29093438 |
Moustafa Abdalla1,2,3,4, Danh Tran-Thanh2, Juan Moreno2, Vladimir Iakovlev2, Ranju Nair1, Nisha Kanwar1,2, Mohamed Abdalla2, Jennifer P Y Lee2, Jennifer Yin Yee Kwan2, Thomas R Cawthorn1,2, Keisha Warren1, Nona Arneson1, Dong-Yu Wang1, Natalie S Fox5,6, Bruce J Youngson2,7, Naomi A Miller2,7, Alexandra M Easson8, David McCready8, Wey L Leong8, Paul C Boutros5,6,9, Susan J Done10,11,12,13.
Abstract
Almost all genomic studies of breast cancer have focused on well-established tumours because it is technically challenging to study the earliest mutational events occurring in human breast epithelial cells. To address this we created a unique dataset of epithelial samples ductoscopically obtained from ducts leading to breast carcinomas and matched samples from ducts on the opposite side of the nipple. Here, we demonstrate that perturbations in mRNA abundance, with increasing proximity to tumour, cannot be explained by copy number aberrations. Rather, we find a possibility of field cancerization surrounding the primary tumour by constructing a classifier that evaluates where epithelial samples were obtained relative to a tumour (cross-validated micro-averaged AUC = 0.74). We implement a spectral co-clustering algorithm to define biclusters. Relating to over-represented bicluster pathways, we further validate two genes with tissue microarrays and in vitro experiments. We highlight evidence suggesting that bicluster perturbation occurs early in tumour development.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29093438 PMCID: PMC5665998 DOI: 10.1038/s41467-017-01357-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1The first principal component of gene expression directly corresponds to proximity to tumour. a Schematic of the relative locations of all epithelial sample extractions, with the nipple and tumour highlighted. Samples D1 and D2 were obtained along the duct, approaching the tumour. Sample O1 is an epithelial sample from a breast duct on the other side of the nipple. b Biplot of the first two principal components of the expression matrix, limited to top 10% most varied genes. This proximity-based separation between epithelial samples is observed with the biplot including all genes, but is clearest here. c Biplot for the corresponding aCGH data; with no proximity-to-tumour separation in either of the first two principal components. PC, principal component
Fig. 2Expression-based classifier can identify where the epithelial samples were obtained from relative to the tumour. a Five-fold cross validation, depicting the receiver-operating curve (ROC) response of using each patient as a test; mean micro-average AUC = 0.74. b Original unclustered data set: columns depicted modules of genes identified through a hierarchical-clustering based gene agglomeration approach and rows corresponding to each epithelial sample. c Spectral co-clustering reveals biclusters of samples and modules that are both downregulated (modules 4, 6, 12, 13, 15, 18 and 19) and upregulated (modules 1, 2, 3, 5, 8, 10 and 16) in tumours and adjacent-to-tumour epithelium. All genes in these modules were selected a priori as univariately informative of the proximity-to-tumour label (top 30% of all genes; see Text). Clustering was done to further reveal structure within these univariately informative genes. Modules that are downregulated are negatively correlated with proximity to tumour (Table 1), and similarly, modules that are upregulated are positively correlated with proximity to tumour (Table 1). The sampling clustering has a Rand index of 0.21 with the distance-based grouping of the samples; the bottom three sample clusters are largely composed of tumour (T) and adjacent-to-tumour epithelial samples (D2). In contrast, the top cluster is composed largely of distant and contralateral duct epithelial samples (D1 and O1, respectively)
Tabulated summary of Pearson correlation coefficients between the modules identified with spectral co-clustering and proximity to tumour, to three significant digits
| Modules | Pearson’s |
| Bonferroni adjusted | Significant |
|---|---|---|---|---|
| 0 | −0.355 | 0.054 | 1.075 | |
| 1 | 0.597 | 0.000 | 0.009 | ** |
| 2 | 0.612 | 0.000 | 0.006 | ** |
| 3 | 0.672 | 0.000 | 0.001 | ** |
| 4 | −0.412 | 0.023 | 0.460 | |
| 5 | 0.542 | 0.002 | 0.037 | ** |
| 6 | −0.654 | 0.000 | 0.002 | ** |
| 7 | −0.228 | 0.226 | 4.519 | |
| 8 | 0.460 | 0.010 | 0.204 | |
| 9 | 0.157 | 0.406 | 8.118 | |
| 10 | 0.556 | 0.001 | 0.027 | ** |
| 11 | 0.101 | 0.595 | 11.908 | |
| 12 | −0.478 | 0.007 | 0.145 | |
| 13 | −0.608 | 0.000 | 0.007 | ** |
| 14 | 0.173 | 0.359 | 7.181 | |
| 15 | −0.383 | 0.036 | 0.723 | |
| 16 | 0.632 | 0.000 | 0.003 | ** |
| 17 | −0.176 | 0.351 | 7.016 | |
| 18 | −0.672 | 0.000 | 0.001 | ** |
| 19 | −0.511 | 0.004 | 0.074 |
Significant modules are denoted with **. Full list of genes and modules is available in Supplementary Data 3. All genes in these modules were univariately informative of the proximity-to-tumour label
Fig. 3MSI2 overexpression activates the Wnt cascade. a MSI2 overexpression results in a significant 1.5-fold linear increase in TOPFLASH Luciferase activity, relative to Flag control (paired Student’s t test; p < 0.05; please note the log scale). b MSI2 overexpression mediates β-catenin localisation to the nucleus, with no change in the total intracellular β-catenin levels. Left panel: representative immunofluorescence image of control MCF7 (nucleus is highlighted with blue DAPI staining; β-catenin is stained green; and MSI2 is red). β-catenin is largely localised to the membrane and the cytosol. Right panel: representative immunofluorescence image of MCF7 MSI2 overexpression clones, with β-catenin localisation to the nucleus
Fig. 4MSI overexpression increases migration and invasion of MCF7 and MDA-MB-231 MCF7-expressing GFP and MSI2-GFP were counted by Vi-cell-XR, and equally plated on transwells with and without Matrigel. Migration a and invasion b qualities were assessed by counting cells 48 h after initial plating. MSI2 overexpression causes an increase in migration and invasion (Welch’s t test; p < 0.05). Box plots of all replicates within each experiment (n = 3) are depicted. MDA-MB-231-expressing GFP and MSI2-GFP were plated on transwells with and without Matrigel. 48 h after initial plating, the transwells were counted for migration c and invasion d. MSI2 increased both the migrative capabilities and invasive tendencies of MDA-MB-231 (Welch’s t test; p < 0.05). As with the MCF7, box plots of all replicates within each experiment (n = 4) are depicted. Migration e and invasion f were also assayed in MCF7 shRNA control and shRNA MSI2, respectively. The knockdown clones presented the opposite effect, with a significant decrease in the migration and invasion ability of MCF7 (Welch’s t test; p < 0.05). Proliferation assay results are summarised in Supplementary Fig. 3