| Literature DB >> 31704995 |
Ståle Nygård1,2, Ole Christian Lingjærde1,3,4, Carlos Caldas5, Eivind Hovig1,6, Anne-Lise Børresen-Dale3, Åslaug Helland3,7,8, Vilde D Haakensen9,10.
Abstract
Gene expression profiling of tumours is an important source of information for cancer patient stratification. Detecting subtle alterations of gene expression remains a challenge, however. Here, we propose a novel tool for high-sensitivity detection of differential pathway activity in tumours. For a pathway defined by a collection of genes, the samples are projected onto a low-dimensional manifold in the subspace spanned by those genes. For each sample, a score is next found by calculating the distance between each projected sample and the projection of a subgroup of reference samples. Depending on the aim of the analysis and the available data, the reference samples may represent e.g. normal tissue or tumour samples with a particular genotype or phenotype. The proposed tool, PathTracer, is demonstrated on gene expression data from 1952 invasive breast cancer samples, 10 DCIS, 9 benign samples and 144 tumour adjacent normal breast tissue samples. PathTracer scores are shown to predict survival, clinical subtypes, cellular proliferation and genomic instability. Furthermore, predictions are shown to outperform those obtained with other comparable methods.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31704995 PMCID: PMC6841931 DOI: 10.1038/s41598-019-52529-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(a) Schematic overview of the PathTracer method. (b) Principal curve based on first two principal components. (c) Samples are projected onto the principal curve. Euclidean distance between the sample point p and the reference point c, i.e the length of the dotted line, defines the deregulation score for sample j.
Figure 2(a) Heatmap of pathways with AUC > 0.98 in separating normal (n = 144) and tumour samples (n = 1971). Blue and red colours indicate low and high PathTracer score, respectively. (b) Heatmap of pathways with AUC > 0.7 in separating tumour samples with (n = 387) and without (n = 1008) TP53-mutation. Blue and red colours indicate low and high PathTracer score, respectively. (c) Kaplan Meier curves of breast cancer specific surivival for the Metabric dataset for the six clusters identified in (a). (d) Kaplan Meier curves of breast cancer specific survival for the Metabric dataset with tumour samples only, for the five clusters identified in (b).
Figure 3(a) Box plots of the Pearson correlation between deregulation scores (PTS/PDS) and Genome Instability Index (GII) (left) and proliferation index (right) using the Metabric dataset with n = 1971 tumour samples and n = 144 normal samples and 1288 human Reactome pathways. (b) Projection of samples onto the principal curve for the Cell cycle pathway using the Metabric dataset. The samples are coloured according to PAM50 subtype. The median PahtTracer score for each subtype is given by the length of the dotted lines. (c) Comparison of Pathifier (PDS) and PahtTracer (PTS) deregulation scores for each of the PAM50 subtypes using the Metabric dataset. (d) Projection of samples onto the principal curve for the DNA repair pathway using the Metabric dataset. The samples are coloured according to PAM50 subtype. The median PathTracer score for each subtype is given by the length of the dotted lines. (e) Comparison of Pathifier (PDS) and PahtTracer (PTS) deregulation scores for each of the PAM50 subtypes using the Metabric dataset.
Proportion of significant pathways in Cox regression of overall survival.
| Threshold | PDS | PTS | Ratio |
|---|---|---|---|
| 0.05 | 0.537 | 0.680 | 1.186 |
| 0.04 | 0.514 | 0.650 | 1.265 |
| 0.03 | 0.481 | 0.615 | 1.279 |
| 0.02 | 0.443 | 0.559 | 1.262 |
| 0.01 | 0.375 | 0.469 | 1.251 |
| 0.005 | 0.313 | 0.386 | 1.233 |
| 0.001 | 0.198 | 0.237 | 1.196 |
| 0.0001 | 0.078 | 0.119 | 1.525 |
| 0.00001 | 0.013 | 0.051 | 3.920 |
The first column in the table is the threshold used to determine if a Cox-regression P-value is significant, the second column is the proportion of pathways significant with Pathway Deregulation Score (PDS) as covariate, the third column is the proportion of pathways significant with PathTracer Score (PTS) as covariate, and the fourth column is the ratio between the third and the second column. In the figure the threshold values (first column) are plotted against the proportion of significant pathways PDS (black colour) and PTS (red colour).