| Literature DB >> 26416693 |
Preethi Krishnan1, Sunita Ghosh2,3, Bo Wang4, Dongping Li5, Ashok Narasimhan6, Richard Berendt7,8, Kathryn Graham9, John R Mackey10,11, Olga Kovalchuk12, Sambasivarao Damaraju13,14.
Abstract
BACKGROUND: Prognostication of Breast Cancer (BC) relies largely on traditional clinical factors and biomarkers such as hormone or growth factor receptors. Due to their suboptimal specificities, it is challenging to accurately identify the subset of patients who are likely to undergo recurrence and there remains a major need for markers of higher utility to guide therapeutic decisions. MicroRNAs (miRNAs) are small non-coding RNAs that function as post-transcriptional regulators of gene expression and have shown promise as potential prognostic markers in several cancer types including BC.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26416693 PMCID: PMC4587870 DOI: 10.1186/s12864-015-1899-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Demographics of the samples chosen for the study
| Characteristics | Discovery cohort | External validation cohort |
|---|---|---|
| ( | ( | |
| Median age at diagnosis in years (range) | 50 (24 – 79) | 54.5 (35 – 90) |
| Median follow up time from diagnosis in days (range) | 2927.5 (170 – 6125) | 1881.5 (174 – 3807) |
| Molecular subtypes | ||
| Luminal A | 62 | 51 |
| Luminal B | 2 | 0 |
| Luminal B HER2 | 10 | 18 |
| Triple Negative | 30 | 15 |
| Menopausal status | ||
| Pre | 37 | 24 |
| Post | 75 | 46 |
| Peri | 11 | 3 |
| Unknown | 1 | 11 |
| Family history of Breast Cancer | ||
| Yes | 40 | N/A |
| No | 58 | N/A |
| Unknown | 6 | N/A |
|
| ||
| I | 8 | 25 |
| II | 79 | 47 |
| III | 16 | 12 |
| IV | 1 | 0 |
| Overall Grade | ||
| Low | 36 | N/A |
| High | 67 | N/A |
| Unknown | 1 | N/A |
|
| ||
| Alive | 58 | 57 |
| Dead | 46 | 27 |
| Relapse Status | ||
| Relapse | 61 | N/A |
| No relapse | 43 | N/A |
| Treatment type | ||
| Adjuvant | 79 | 84 |
| Neoadjuvant | 25 | 0 |
N/A = Not available
Fig. 1Overall Workflow of the study. In literature, there are two widely used approaches to identify RNAs with prognostic significance – Case–control approach and Case only. In the former approach, only differentially expressed (DE) RNAs are considered for survival analysis whereas in the latter approach, all of the profiled RNAs are considered for survival analysis which therefore aids in identifying prognostic RNAs which would have otherwise not been identified in case–control. While either of the two approaches has been adopted in literature, both the approaches have been followed in this study. FF = Fresh Frozen; FFPE = Formalin Fixed Paraffin Embedded; Normalization* = Reads per kilobase per million (RPKM); FDR = False Discovery Rate; OS = Overall Survival; RFS = Recurrence Free Survival
Fig. 2Unsupervised hierarchical clustering (HC) using differentially expressed miRNAs. Unsupervised hierarchical clustering of 80 differentially expressed miRNAs was performed using Euclidean as distance measure and Average linkage method for linkage analysis. HC shows normal and tumor tissues as distinct clusters. 48 miRNAs were up-regulated in tumor and 32 miRNAs were down-regulated in tumor relative to normal tissues. Rows represent miRNAs and columns represent samples
miRNAs significant for overall survival (Discovery cohort)
| A. Case–control approach | ||
| miRNA ID | Univariate Cox p-value | Permuted p-value |
| hsa-miR-15a-5p | 0.02 | 0.03 |
| hsa-miR-660-5p | 0.03 | 0.04 |
| hsa-miR-574-3p | 0.08 | 0.07 |
| hsa-miR-27a-3p | 0.06 | 0.07 |
| B. Case-only approach | ||
| miRNA ID | Univariate Cox p-value | Permuted p-value |
| hsa-miR-210-3p | 0.01 | 0.02 |
| hsa-miR-15a-5p | 0.02 | 0.03 |
| hsa-miR-660-5p | 0.03 | 0.04 |
| hsa-miR-146b-5p | 0.04 | 0.05 |
| hsa-miR-374a-3p | 0.04 | 0.05 |
| hsa-miR-374a-5p | 0.04 | 0.06 |
| hsa-miR-27a-3p | 0.06 | 0.07 |
| hsa-miR-574-3p | 0.08 | 0.07 |
| hsa-miR-221-3p | 0.07 | 0.08 |
| hsa-miR-196a-5p | 0.07 | 0.09 |
| hsa-miR-425-5p | 0.05 | 0.10 |
A: 80 miRNAs were differentially expressed with Fold change > 2.0 and at a FDR cut off <0.05. All 80 miRNAs were subjected to Univariate Cox proportional hazards regression and permutation test (n = 10,000) for Overall Survival (OS). Four miRNAs were significant for OS and were used to construct a risk score. Univariate Cox p-value is the unpermuted p-value for Univariate Cox model. B: All the miRNAs (n = 147) retained after filtering (minimum 10 read counts in at least 90 % samples) in cases were considered for further analysis. 11 miRNAs were significant for OS with permuted p-value ≤ 0.1 and were considered for constructing a risk score. Univariate Cox p-value is the unpermuted p-value for Univariate Cox model
miRNAs significant for recurrence free survival (Discovery cohort)
| A. Case–control approach | ||
| miRNA ID | Univariate Cox p-value | Permuted p-value |
| hsa-miR-193b-3p | 0.09 | 0.09 |
| hsa-miR-15a-5p | 0.08 | 0.10 |
| B. Case-only approach | ||
| miRNA ID | Univariate Cox p-value | Permuted p-value |
| hsa-miR-210-3p | 0.01 | 0.02 |
| hsa-miR-425-5p | 0.05 | 0.08 |
| hsa-miR-193b-3p | 0.09 | 0.09 |
| hsa-miR-15a-5p | 0.08 | 0.10 |
A: 80 miRNAs were differentially expressed with Fold change > 2.0 and FDR cut off 0.05. All 80 miRNAs were subjected to Univariate Cox proportional hazards regression and permutation test (n = 10,000) for Recurrence Free Survival (RFS). Two miRNAs were significant for RFS with permutation p value ≤ 0.1 and were used for constructing risk score. Univariate Cox p-value is the unpermuted p-value for Univariate Cox model. B: All the miRNAs (n = 147) retained after filtering (minimum 10 read counts in at least 90 % samples) in cases were considered for further analysis. Four miRNAs were significant for RFS with permuted p-value ≤ 0.1 and were considered for constructing a risk score. Univariate Cox p-value is the unpermuted p-value for Univariate Cox model
Univariate and Multivariate results for overall survival (Discovery cohort)
| A. Case–control approach | ||||
| Parameter | Univariate analysis | Multivariate analysis | ||
| HR (95 % CI) | p-value | HR (95 % CI) | p-value | |
| Risk score | 2.44 (1.28 – 4.68) | 0.01 | 2.71 (1.38 – 5.35) | 0.004 |
| Tumor stage | 0.42 (0.22 – 0.81) | 0.01 | 0.36 (0.18 – 0.74) | 0.01 |
| Tumor grade | 1.93 (0.99 – 3.75) | 0.05 | ||
| Age at diagnosis | 1.05 (1.02 – 1.09) | 0.003 | 1.04 (1.01 – 1.07) | 0.02 |
| TNBC status | 0.88 (0.43 – 1.77) | 0.71 | ||
| B. Case-only approach | ||||
| Parameter | Univariate analysis | Multivariate analysis | ||
| HR (95 % CI) | p-value | HR (95 % CI) | p-value | |
| Risk score | 2.48 (1.34 – 4.61) | 0.004 | 2.76 (1.47 – 5.19) | 0.002 |
| Tumor stage | 0.42 (0.22 – 0.81) | 0.01 | 0.37 (0.19 – 0.72) | 0.004 |
| Tumor grade | 1.93 (0.995 – 3.75) | 0.05 | ||
| Age at diagnosis | 1.05 (1.02 – 1.09) | 0.003 | ||
| TNBC status | 0.88 (0.43 – 1.77) | 0.71 | ||
A and B: The four and 11 miRNAs from Table 1A and B respectively were used to construct risk scores. Receiver Operating Characteristics Curve was used to dichotomize cases into low and high-risk groups. Univariate Cox proportional hazards regression model was run for risk score and for other clinical parameters. In the multivariate analysis, risk score was significant with p < 0.05 after adjusting for confounders.
HR Hazard Ratio; CI Confidence Interval; TNBC Triple Negative Breast Cancer
Fig. 3Kaplan-Meier plots for Overall Survival (Discovery cohort). Kaplan-Meier plots were used to estimate OS in Case–control approach (a) and Case-only approach (b). Log rank test was performed to assess differences in survival between the two risk groups. Patients belonging to the high-risk group had shorter OS in both (a) and (b)
Univariate and multivariate results for recurrence free survival (Discovery cohort)
| A. Case–control approach | ||||
| Parameter | Univariate analysis | Multivariate analysis | ||
| HR (95 % CI) | p-value | HR (95 % CI) | p-value | |
| Risk score | 1.95 (1.16 – 3.29) | 0.01 | 2.27 (1.33 -3.88) | 0.003 |
| Tumor stage | 0.42 (0.23 – 0.76) | 0.01 | 0.34 (0.18 – 0.65) | 0.001 |
| Tumor grade | 1.52 (0.88 – 2.63) | 0.14 | ||
| Age at diagnosis | 1.02 (0.99 – 1.05) | 0.29 | ||
| TNBC status | 0.75 (0.39 – 1.41) | 0.37 | ||
| B. Case-only approach Parameter | ||||
| Parameter | Univariate analysis | Multivariate analysis | ||
| HR (95 % CI) | p-value | HR (95 % CI) | p-value | |
| Risk score | 1.68 (0.99 – 2.82) | 0.05 | 1.85 (1.09 – 3.14) | 0.02 |
| Tumor stage | 0.42 (0.23 – 0.79) | 0.01 | 0.38 (0.20 – 0.71) | 0.003 |
| Tumor grade | 1.52 (0.88 – 2.63) | 0.14 | ||
| Age at diagnosis | 1.02 (0.99 – 1.05) | 0.29 | ||
| TNBC status | 0.75 (0.39 – 1.41) | 0.37 | ||
A and B: The two and four miRNAs from Table 2A and B respectively were used to construct risk scores. Receiver Operating Characteristics Curve was used to dichotomize samples into low and high-risk groups. Univariate Cox proportional hazards regression model was run for risk score and for other clinical parameters. In the multivariate analysis, risk score was significant with p < 0.05 after adjusting for confounders.
HR Hazard Ratio; CI Confidence Interval; TNBC Triple Negative Breast Cancer
Fig. 4Kaplan-Meier plots for Recurrence Free Survival (Discovery cohort). Kaplan-Meier plots were used to estimate RFS in Case–control approach (a) and Case-only approach (b). Log rank test was performed to assess differences in survival between the two risk groups. Patients belonging to the high-risk group had shorter RFS in both (a) and (b)
Fig. 5qRT-PCR validations of select miRNAs. a One up-regulated miRNA (miR-660-5p, FC = 12.8) was validated in a subset of samples (9 normal samples and 56 tumor samples). b Three down-regulated miRNAs (miR-574-3p, miR-99b-5p and miR-769-5p) were validated in a subset of samples (11 normal samples and 60 tumor samples). All the miRNAs were significantly (* = p < 0.05) differentially expressed, similar to the results obtained in NGS platform. miR-574-3p and miR-660-5p were also found to be associated with Overall Survival
Identification of mRNA targets for miRNAs significant for OS and/or RFS
| miRNA ID | Target Identification | Gene Ontology | ||
|---|---|---|---|---|
| Number of mRNA targets identified from TargetScan | Number of predicted targets overlapping with mRNA expression dataset | Total number of annotation clusters | Number of clusters with enrichment score ≥ 1.3 | |
| hsa-miR-15a-5p | 1275 | 181 | 48 | 15 |
| hsa-miR-27a-3p | 1212 | 183 | 47 | 13 |
| hsa-miR-193b-3p | 222 | 30 | 4 | 0 |
| hsa-miR-574-3p | 13 | 2 | 0 | 0 |
| hsa-miR-660-5p | 149 | 25 | 2 | 1 |
| hsa-miR-210-3p | 32 | 6 | 1 | 0 |
| hsa-miR-146b-5p | 226 | 31 | 4 | 1 |
| hsa-miR-374a-3p; hsa-miR-374a-5p | 680 | 110 | 22 | 9 |
| hsa-miR-221-3p | 446 | 60 | 25 | 13 |
| hsa-miR-196a-5p | 295 | 46 | 13 | 5 |
| hsa-miR-425-5p | 212 | 24 | 2 | 0 |
Gene targets for 12 miRNAs significant in survival analysis were identified using in silico prediction (TargetScan) and were confirmed with in-house mRNA-miRNA matched expression dataset (n = 17). Gene ontology terms were identified for mRNAs overlapping with in-house dataset using DAVID bioinformatics tool. Only annotation clusters with Enrichment Score (ES) ≥ 1.3 were considered. However, no cluster with ES ≥ 1.3 could be identified for miR-193b-3p, miR-574-3p, miR-210-3p and miR-425-5p
Fig. 6Kaplan-Meier plot for Overall Survival (External validation cohort). Kaplan-Meier plots were used to estimate OS in Case-only approach. Log rank test was performed to assess differences in survival between the two risk groups. Patients belonging to the high-risk group had shorter OS
Univariate and multivariate results for overall survival (External Validation cohort/TCGA)
| Parameter | Univariate analysis | Multivariate analysis | ||
|---|---|---|---|---|
| HR (95 % CI) | p-value | HR (95 % CI) | p-value | |
| Risk score | 2.16 (0.92 – 5.05) | 0.08 | 2.07 (0.87 – 4.92) | 0.101 |
| Tumor stage | 0.32 (0.13 – 0.78) | 0.01 | 0.26 (0.1 – 0.67) | 0.005 |
| Age at diagnosis | 1.03 (1.003 – 1.06) | 0.03 | ||
| TNBC status | 0.63 (0.19 – 2.12) | 0.46 | ||
The eleven miRNAs identified as significant for OS in CO approach from the discovery set was validated using TCGA dataset. Risk score was constructed using the 11 miRNAs and an optimal cut-off point was estimated using Receiver Operating Characteristics Curve, which dichotomized the samples into low and high-risk groups. Univariate Cox proportional hazards regression model was run for risk score and for other clinical parameters. In the multivariate analysis, risk score was significant with p = 0.1 after adjusting for tumor stage.
HR Hazard Ratio; CI Confidence Interval; TNBC Triple Negative Breast Cancer