| Literature DB >> 31159741 |
A C Picornell1, I Echavarria2, E Alvarez3, S López-Tarruella4, Y Jerez4, K Hoadley5, J S Parker5, M Del Monte-Millán4, R Ramos-Medina4, J Gayarre4, I Ocaña4, M Cebollero6, T Massarrah4, F Moreno7, J A García Saenz7, H Gómez Moreno8, A Ballesteros9, M Ruiz Borrego10, C M Perou11, M Martin12.
Abstract
BACKGROUND: Full RNA-Seq is a fundamental research tool for whole transcriptome analysis. However, it is too costly and time consuming to be used in routine clinical practice. We evaluated the transcript quantification agreement between RNA-Seq and a digital multiplexed gene expression platform, and the subtype call after running the PAM50 assay in a series of breast cancer patients classified as triple negative by IHC/FISH. The goal of this study is to analyze the concordance between both expression platforms overall, and for calling PAM50 triple negative breast cancer intrinsic subtypes in particular.Entities:
Keywords: Breast cancer; Multiplexed gene expression; PAM50; RNA-Seq; Triple negative breast cancer
Mesh:
Substances:
Year: 2019 PMID: 31159741 PMCID: PMC6547580 DOI: 10.1186/s12864-019-5849-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1PAM50 subtype calls by technique. Barplot represents counts of samples per subtype and technique. The cross table shows in detail the discordances between both platforms
Centroid correlation for the potential discordant sample calls
These measures are extracted from the PAM50 assay outcome (Additional file 1: Table S1). The sample’s subtype classification is assigned to the centroid with the highest correlation (in bold red). When the second centroid has a value close to the highest one (difference less or equal to 0.1) the classification is ambiguous being possible any of both subtypes (bold *). The Discordance column summarizes whether a real discordance is observed in a sample or just a scenario where two centroid correlations are almost equivalent (HUGM-0047 in NanoString nCounter® is a paradigmatic case)
Fig. 2Separate centroid correlation when NanoString nCounter® and RNA-Seq platforms are compared. The blue line represents the linear regression. The grey area surrounding it represents the confidence interval
Fig. 3Correlation of the correlation to the centroids in both platforms obtained in the PAM50 subtype classifier
Fig. 4Correlation of ROR and ROR + PS and their associated Bland-Altman plots in both platforms. The upper/lower dashed lines in the Bland-Altman plots represent the mean difference +/− 1.96 * standard deviation. The central dashed line represents the mean difference
Fig. 5Normalized gene expression levels for each gene contained in the PAM50 assay. The log2 normalized counts for RNA-Seq are represented in the X-axis and those for NanoString nCounter® are represented in the Y-axis. The red line represents the LOWESS smoother, which uses locally weighted polynomial regression
Risk of Recurrence (ROR) for the potential discordant sample calls
| Sample ID | ROR | ROR-Group | ROR-PS | ROR + PS-Group | Discordance | |
|---|---|---|---|---|---|---|
| NanoString nCounter | HCSC-0027 | 51.10 | med | 43.42 | med | NO |
| HLPR-002 | 43.57 | med | 23.44 | med | YES | |
| HUGM-0022 | 51.60 | med | 42.28 | med | NO | |
| HUGM-0029 | 73.51 | high | 68.98 | high | NO | |
| HUGM-0047 | 81.24 | high | 72.48 | high | NO | |
| INEN-0017 | 66.66 | high | 55.051 | high | NO | |
| INEN-0021 | 72.50 | high | 65.40 | high | NO | |
| RNA-Seq | HCSC-0027 | 35.25 | med | 33.04 | med | NO |
| HLPR-002 | 16.87 | low | 10.68 | low | YES | |
| HUGM-0022 | 40.68 | med | 32.44 | med | NO | |
| HUGM-0029 | 75.02 | high | 69.59 | high | NO | |
| HUGM-0047 | 73.73 | high | 69.58 | high | NO | |
| INEN-0017 | 62.68 | high | 52.98 | high | NO | |
| INEN-0021 | 66.98 | high | 64.52 | high | NO |
These measures are extracted from the PAM50 assay outcome (Additional file 1: Table S1). Two measures regarding the risk of recurrence are reported: ROR, which takes into account only the subtype calling; and ROR + PS, which considers also the Proliferation Score. The latter is defined as the mean expression level for the proliferation genes: CCNB1, UBE2C, BIRC5, KNTC2, CDC20, PTTG1, RRM2, MKI67, TYMS, CEP55 and CDCA1. The ROR(+PS)-Group columns gives a categorical classification in terms of risk: high, medium (med) and low