| Literature DB >> 31805884 |
Michal Marczyk1,2, Chunxiao Fu3, Rosanna Lau3, Lili Du3, Alexander J Trevarton3, Bruno V Sinn4, Rebekah E Gould3, Lajos Pusztai1, Christos Hatzis1, W Fraser Symmans5.
Abstract
BACKGROUND: Utilization of RNA sequencing methods to measure gene expression from archival formalin-fixed paraffin-embedded (FFPE) tumor samples in translational research and clinical trials requires reliable interpretation of the impact of pre-analytical variables on the data obtained, particularly the methods used to preserve samples and to purify RNA.Entities:
Keywords: FFPE-based clinical assay; RNA extraction; RNA sequencing
Year: 2019 PMID: 31805884 PMCID: PMC6896723 DOI: 10.1186/s12885-019-6363-0
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1Design of the study
Clinical-pathologic characteristics of the 12 breast cancer samples in this study
| Patient | Age | Grade | Stage | ER status | PR status | HER2 status |
|---|---|---|---|---|---|---|
| 16H | [50–60) | 3 | III | P | P | N |
| 16 J | [40–50) | 1 | II | P | P | N |
| 16 K | [40–50) | 2 | II | P | P | N |
| 16 W | [70–80) | 2 | I | P | N | N |
| 17E | [60–70) | 2 | I | P | P | N |
| 18Y | [40–50) | 2 | I | P | P | P |
| 19A | [50–60) | 2 | II | P | P | N |
| 19D | [80–90) | 3 | III | P | P | N |
| 19G | [50–60) | 2 | II | P | N | N |
| 19 K | [50–60) | 2 | II | P | P | N |
| 19 M | [40–50) | 3 | I | P | P | N |
| 19O | [50–60) | 2 | II | P | P | N |
Fig. 2Mapping of reads to genome and gene expression quantification results for wtRNAseq data. a Mapping summary statistics from STAR aligner. b Distribution of genomic regions in which sequencing reads were aligned. c PCA analysis based on expression levels of all protein-coding genes
Fig. 3Concordance of gene expression between FFPE and FF samples for wtRNAseq data. a Distribution of concordance correlation coefficient (CCC) for all genes within each RNA extraction kit used. b Association between gene expression and CCC value. c High expression (normalized expression higher than − 7.5) and high concordant (CCC > 0.5) genes between different kits. d Concordance of molecular signatures scores for 3 FFPE kits in comparison to FF
Descriptive statistics of concordance and LME analysis for all genes quantified by wtRNAseq in FFPE versus FF samples. Median values with median absolute deviation in brackets
| Kit | Concordance analysis | LME model | Replicates | |||
|---|---|---|---|---|---|---|
| CCC | R | Bias | Bias | Variance | Difference | |
| N | 0.629 (0.291) | 0.856 (0.122) | 0.778 (0.239) | 0.269 (0.605) | 0.123 (0.132) | 0.133 (0.097) |
| Q | 0.659 (0.266) | 0.859 (0.116) | 0.810 (0.206) | 0.229 (0.561) | 0.118 (0.129) | 0.177 (0.106) |
| R | 0.646 (0.278) | 0.863 (0.113) | 0.793 (0.226) | 0.192 (0.619) | 0.116 (0.127) | 0.216 (0.124) |
No. of differentially expressed genes (DEGs) in wtRNAseq
| Contrast | |log2FC| > 0 | |log2FC| > 0.5 | |log2FC| > 1 | |log2FC| > 2 |
|---|---|---|---|---|
| FF vs N | 12,387 | 6083 | 1125 | 31 |
| FF vs Q | 12,061 | 5361 | 757 | 14 |
| FF vs R | 12,383 | 6081 | 958 | 26 |
| N vs Q | 1 | 0 | 0 | 0 |
| N vs R | 774 | 20 | 0 | 0 |
| Q vs R | 212 | 0 | 0 | 0 |
Fig. 4Technical variance and reliability of mRNA transcripts for wtRNAseq data. a Bias estimate component of LME model (closer to 0; better). b Variance component of LME model (smaller is better) vs gene expression level. c Distribution of median of difference in expression between replicates for all genes within each RNA extraction kit. d Percentage difference in molecular signature scores between technical replicates
Fig. 5Robustness of targeted sequencing assay for SETER/PR index. a Concordance of SETER/PR between FFPE and FF samples. b Heatmap of correlation matrix between genes in SETER/PR index calculated on wtRNAseq and targeted RNAseq platforms. c Concordance of SETER/PR signature between two platforms (scatter plots on top and Bland-Altman plots on bottom)