| Literature DB >> 21479216 |
Victor Trevino1, Mahlet G Tadesse, Marina Vannucci, Fatima Al-Shahrour, Philipp Antczak, Sarah Durant, Andreas Bikfalvi, Joaquin Dopazo, Moray J Campbell, Francesco Falciani.
Abstract
Statistical modelling, in combination with genome-wide expression profiling techniques, has demonstrated that the molecular state of the tumour is sufficient to infer its pathological state. These studies have been extremely important in diagnostics and have contributed to improving our understanding of tumour biology. However, their importance in in-depth understanding of cancer patho-physiology may be limited since they do not explicitly take into consideration the fundamental role of the tissue microenvironment in specifying tumour physiology. Because of the importance of normal cells in shaping the tissue microenvironment we formulate the hypothesis that molecular components of the profile of normal epithelial cells adjacent the tumour are predictive of tumour physiology. We addressed this hypothesis by developing statistical models that link gene expression profiles representing the molecular state of adjacent normal epithelial cells to tumour features in prostate cancer. Furthermore, network analysis showed that predictive genes are linked to the activity of important secreted factors, which have the potential to influence tumor biology, such as IL1, IGF1, PDGF BB, AGT, and TGFβ.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21479216 PMCID: PMC3068146 DOI: 10.1371/journal.pone.0016492
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Accuracy, size, and gene content of representative models developed from normal and tumour data.
|
|
|
|
|
|
|
| WDR18, ZNF146, MBD3, UBE4A, | 86.1 (7) | RPS2, CCL13, VCP, | 78.0(9) |
|
| LUZP1, SORL1, | 97.4 (7) |
| 92.0(10) |
|
| BZRPL1, | 89.7 (8) | HBA1/2, PABPC1/3, | 92.5(9) |
|
| TMSB4X/L3, SLC6A7, AA524802, ABCC10, INHBB,
SULT2B1, PHYHIP, SLC1A5, | 91.6 (12) | VIM, R42599, ARF1, RBM3, EIF4G2,
| 90.0(14) |
|
|
| 89.2 (13) |
| 97.4(8) |
|
|
| 96.5 (6) | AA420602, H19, | 100.0(9) |
|
| FOLR1, APOD, | 93.8 (14) |
| 97.1(8) |
|
| MOCOS | 94.4 (7) |
| 97.1(8) |
Accuracy (Acc) are expressed in percentage and model size are shown in brackets. Marked genes in bold and asterisk appear in both methods (GA-MLHD and BVS). Dataset is indicated. CP+N – Capsular Penetration class from Normal data, CP+T – Capsular Penetration Tumour, GS+N – Gleason Score Normal, GS+T – Gleason Score Tumour.
Figure 1Multivariate Models for Capsular Penetration using Normal data.
The figure shows the heat maps representing the expression profile of genes selected by the GA and BVS models in both Lapointe and Singh datasets from the normal tissue data. Each quadrant in the figure represents a combination of a modelling approach and a specific dataset. Genes present in GA-MLHD and BVS for the same dataset are highlighted in red. Accuracy is reported below each heatmap. GeneBank accession number and gene symbol are shown on the left side of the heatmap. Brighter green or red colours in heatmaps represent lower or higher relative expression respectively. t-test p-value is shown for comparison with the differential expression criteria commonly used in univariate variable selection approaches.
Figure 2Principal component representation for Capsular Penetration using Normal Data.
The figure shows the result of a PCA representing sample separation on the basis of the expression in normal tissue of genes selected by the modelling procedures. Each quadrant in the figure represents a combination of a modelling approach and a specific dataset. Each quadrant contains a 2D plot representing the separation of capsular penetration negative (black close circles) and positive (red close circles) samples (plots B, D, F and H) and a bar chart (plots A, C, E and G) representing the PC loadings (x axis) for each gene component (y axis). Note that PC loadings represent the contribution of every gene to class separation. Dashed lines delimitated genes with larger contribution that are discussed in the manuscript. Genes present in GA-MLHD and BVS for the same dataset are highlighted in red.
Figure 3Accuracy and Tissue specificity of representative models.
The predictive accuracy of the models developed using normal tissue (panel A, filled circles) is comparable to those models developed using tumour tissue (panel B, filled diamonds). When models developed using normal tissue are trained and tested using data from tumour tissue, the prediction power is decreased considerably (empty circles). Likewise, tumour models trained and tested with data from normal tissue are also non predictive (empty diamonds).
Figure 4Functional networks representing known interaction between genes expressed in normal tissue and selected in the models predictive of capsular penetration.
The figure represents the four most significant networks selected by the IPA software. Genes represented by blue shapes are present in the collection of models collected by the GA-MLHD procedure. Genes represented with red shapes represent genes in the collection of models but also included in the representative most predictive models. Genes in the networks are arranged by cellular localization (extracellular, membrane, cytoplasm and nucleus). Note that the IPA software search for statistically significant sub-networks of a given maximum size to simplify their visualization. Nevertheless, in this case these are linked as indicated by red dashed arrows connecting specific network components.
Figure 5Analysis of LCM cell populations representative of prostate cancer progression.
The figure represents the results of the analysis performed on the dataset developed by Tomlins et al. [11]. Different cell populations are labelled as follows. Normal cells (norm), normal cells adjacent the tumour (adj), benign prostate hyperplasia (MPH), low grade prostate carcinoma (L-PCA), high-grade prostate carcinoma (H-PCA) and metastatic cells (meta). Panel A shows a two-dimensional cluster analysis performed on the genes differentially expressed (p<0.01) across the seven LCM purified normal and tumour epithelial cell populations. Panel B represents the expression level (y axis) of genes differentially expressed between norm, adjacent and BPH (represented on the y axis). Levels of individual genes across all stages are presented in panels C-F and in .