| Literature DB >> 28036264 |
Rou Chen1, Qingzhou Guan1, Jun Cheng1, Jun He1, Huaping Liu1, Hao Cai1, Guini Hong1, Jiahui Zhang1, Na Li1, Lu Ao1, Zheng Guo1.
Abstract
Formalin-fixed paraffin-embedded (FFPE) samples represent a valuable resource for clinical researches. However, FFPE samples are usually considered an unreliable source for gene expression analysis due to the partial RNA degradation. In this study, through comparing gene expression profiles between FFPE samples and paired fresh-frozen (FF) samples for three cancer types, we firstly showed that expression measurements of thousands of genes had at least two-fold change in FFPE samples compared with paired FF samples. Therefore, for a transcriptional signature based on risk scores summarized from the expression levels of the signature genes, the risk score thresholds trained from FFPE (or FF) samples could not be applied to FF (or FFPE) samples. On the other hand, we found that more than 90% of the relative expression orderings (REOs) of gene pairs in the FF samples were maintained in their paired FFPE samples and largely unaffected by the storage time. The result suggested that the REOs of gene pairs were highly robust against partial RNA degradation in FFPE samples. Finally, as a case study, we developed a REOs-based signature to distinguish liver cirrhosis from hepatocellular carcinoma (HCC) using FFPE samples. The signature was validated in four datasets of FFPE samples and eight datasets of FF samples. In conclusion, the valuable FFPE samples can be fully exploited to identify REOs-based diagnostic and prognostic signatures which could be robustly applicable to both FF samples and FFPE samples with degraded RNA.Entities:
Keywords: RNA degradation; formalin-fixed paraffin-embedded samples; fresh-frozen samples; gene expression measurements; relative expression orderings
Mesh:
Substances:
Year: 2017 PMID: 28036264 PMCID: PMC5351660 DOI: 10.18632/oncotarget.14257
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Description of paired FF and FFPE sample data and normal sample data used in this study
| Dataset | Platform | Sample size | Tissue type | Storage type | RIN(FF) | RIN(FFPE) |
|---|---|---|---|---|---|---|
| #TCGA_ LUAD | IlluminaHiSeq_RNASeqV2 | 12 pairs | LUAD | FFPE and FF | 8.1~9.4 | 2.3~2.5 |
| #TCGA_ COAD | IlluminaHiSeq_RNASeqV2 | 10 pairs | COAD | FFPE and FF | 7.3~9.8 | 2.0~2.6 |
| #TCGA_ BRCA | IlluminaHiSeq_RNASeqV2 | 5 pairs | BRCA | FFPE and FF | 7.4~9.7 | 2.1~2.7 |
| GSE54809 | GPL6244 | 7 | normal prostate | FFPE | - | - |
| GSE6956 | GPL571 | 20 | FF | - | - | |
| GSE29079 | GPL5175 | 48 | FF | - | - | |
| GSE32448 | GPL570 | 40 | FF | - | - | |
| GSE46602 | GPL570 | 14 | FF | - | - | |
| GSE11682 | GPL4133 | 17 | FF | - | - | |
| GSE28204 | GPL6480 | 4 | FF | - | - | |
| GSE35988 | GPL6480/6848 | 28 | FF | - | - | |
| GSE38241 | GPL4133 | 21 | FF | - | - | |
| GSE55597 | GPL10558 | 16 | FF | - | - | |
| GSE70768 | GPL10558 | 73 | FF | - | - | |
| E-MTAB-2523 | IlluminaHiSeq 2000 | 4 | normal liver | FFPE | - | - |
| GSE41804 | GPL570 | 20 | FF | - | - | |
| GSE55092 | GPL570 | 80 | FF | - | - | |
| GSE46408 | GPL4133 | 6 | FF | - | - | |
| GSE50579 | GPL14550 | 7 | FF | - | - | |
| GSE54236 | GPL6480 | 80 | FF | - | - | |
| GSE36376 | GPL10558 | 193 | FF | - | - | |
| GSE39791 | GPL10558 | 72 | FF | - | - | |
| GSE57957 | GPL10558 | 37 | FF | - | - |
Note:#TCGA_LUAD, #TCGA_COAD and #TCGA_BRCA denote mRNA_seq data of paired FF and FFPE samples for lung adenocarcinoma, colon adenocarcinoma and breast invasive cancer samples from TCGA, respectively.
Figure 1The fluctuation degree of gene expression measurements affected by RNA degradation in FFPE samples
Fold changes of DEGs between FFPE samples and their paired FF samples for lung adenocarcinoma (LUAD, shown in blue bar), colon adenocarcinoma (COAD, shown in red bar) and breast invasive cancer samples (BRCA, shown in green bar).
Figure 2The consistency scores of REOs of gene pairs between every FFPE sample and its paired FF sample
The consistency scores between every paired FFPE and FF samples after excluding zero, 10% and 20% gene pairs with the smallest expression differences in each of the FF samples for lung adenocarcinoma A., colon adenocarcinoma B. and breast invasive cancer C.
The influence of the storage time on REOs of gene pairs in FFPE samples
| Dataset | Storage time(years) | Consistency score |
|---|---|---|
| GSE54809 | 11 | 0.9668 |
| 12 | 0.9565 | |
| 14 | 0.9666 | |
| 16 | 0.9464 | |
| 18 | 0.9468 | |
| 19 | 0.9652 | |
| 21 | 0.9611 | |
| E-MTAB-2523 | 0.17 | 0.9099 |
| 1.08 | 0.8644 | |
| 5.17 | 0.9757 | |
| 20.08 | 0.9244 |
Note: The average consistency scores between REOs of FFPE normal samples and stable REOs in normal FF samples.
The 5-gene-pair signature
| Signature | Gene A | Gene B |
|---|---|---|
| pair1 | CLDN10 | SPINK1 |
| pair2 | CLDN10 | CDKN3 |
| pair3 | CLDN10 | LCN2 |
| pair4 | VIPR1 | NEK2 |
| pair5 | CRHBP | NEK2 |
Note: Gene A has a higher expression level than Gene B in liver cirrhosis.
The prediction sensitivity scores of the signature in the validation datasets
| Sample type | Dataset | Platform | Number (Sensitivity) of HCC samples | Number (Sensitivity) ofHepatitis/cirrhotic liver samples |
|---|---|---|---|---|
| FFPE | GSE10142 | GPL5474 | - | 225(0.9289) |
| GSE10186 | GPL5474 | 118(0.9068) | - | |
| GSE19977 | GPL8432 | 164(0.9390) | - | |
| GSE20017 | GPL8432 | 135(0.9259) | - | |
| FF | GSE63898 | GPL13667 | 228(0.9342) | 168(0.9583) |
| GSE25097 | GPL10687 | 268(0.9478) | 40(1.0000) | |
| GSE56140 | GPL18461 | 35(0.8000) | 34(0.9706) | |
| GSE36411 | GPL10558 | 42(0.9524) | 21(0.9524) | |
| GSE6764 | GPL570 | 35(0.9429) | 13(1.0000) | |
| GSE9843 | GPL570 | 91(0.9780) | - | |
| GSE17967 | GPL571 | - | 47(1.0000) | |
| GSE57725 | GPL14951 | - | 23(0.9565) |