| Literature DB >> 26901863 |
Oystein Eikrem1, Christian Beisland2, Karin Hjelle2, Arnar Flatberg3, Andreas Scherer4, Lea Landolt1, Trude Skogstrand1, Sabine Leh1,5, Vidar Beisvag3, Hans-Peter Marti1,6.
Abstract
Formalin-fixed, paraffin-embedded (FFPE) tissues are an underused resource for molecular analyses. This proof of concept study aimed to compare RNAseq results from FFPE biopsies with the corresponding RNAlater® (Qiagen, Germany) stored samples from clear cell renal cell carcinoma (ccRCC) patients to investigate feasibility of RNAseq in archival tissue. From each of 16 patients undergoing partial or full nephrectomy, four core biopsies, such as two specimens with ccRCC and two specimens of adjacent normal tissue, were obtained with a 16g needle. One normal and one ccRCC tissue specimen per patient was stored either in FFPE or RNAlater®. RNA sequencing libraries were generated applying the new Illumina TruSeq® Access library preparation protocol. Comparative analysis was done using voom/Limma R-package. The analysis of the FFPE and RNAlater® datasets yielded similar numbers of detected genes, differentially expressed transcripts and affected pathways. The FFPE and RNAlater datasets shared 80% (n = 1106) differentially expressed genes. The average expression and the log2 fold changes of these transcripts correlated with R2 = 0.97, and R2 = 0.96, respectively. Among transcripts with the highest fold changes in both datasets were carbonic anhydrase 9 (CA9), neuronal pentraxin-2 (NPTX2) and uromodulin (UMOD) that were confirmed by immunohistochemistry. IPA revealed the presence of gene signatures of cancer and nephrotoxicity, renal damage and immune response. To simulate the feasibility of clinical biomarker studies with FFPE samples, a classifier model was developed for the FFPE dataset: expression data for CA9 alone had an accuracy, specificity and sensitivity of 94%, respectively, and achieved similar performance in the RNAlater dataset. Transforming growth factor-ß1 (TGFB1)-regulated genes, epithelial to mesenchymal transition (EMT) and NOTCH signaling cascade may support novel therapeutic strategies. In conclusion, in this proof of concept study, RNAseq data obtained from FFPE kidney biopsies are comparable to data obtained from fresh stored material, thereby expanding the utility of archival tissue specimens.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26901863 PMCID: PMC4764764 DOI: 10.1371/journal.pone.0149743
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristic patient features at the time of surgery.
eGFR was calculated with the MDRD formula. The staging was performed based on the EAU Guidelines on renal cell carcinoma: 2014 update [43].
| Patient number | Age, yr | Gender | BMI | Nephrectomy type | eGFR (ml/min/1.73m2) | TNM-stage | Size (mm) | Fuhrmann grade | Stage |
|---|---|---|---|---|---|---|---|---|---|
| 9 | 70 | Male | 24 | Partial | >60 | pT1AcN0cM0 | 18 | 2 | I |
| 10 | 69 | Male | 34 | Partial | >60 | pT3AcN0cM0 | 15 | 2 | III |
| 11 | 37 | Male | 27 | Partial | >60 | pT1AcN0cM0 | 19 | 2 | I |
| 13 | 63 | Male | 24 | Full | 40 | pT3AcN0cM0 | 69 | 4 | III |
| 15 | 68 | Male | 28 | Partial | >60 | pT1AcN0cM0 | 21 | 2 | I |
| 16 | 53 | Male | 33 | Full | 56 | pT3bN0M1 | 100 | 2 | IV |
| 18 | 78 | Male | 27 | Full | 47 | T3AcN0cM0 | 60 | 2 | III |
| 19 | 71 | Female | 22 | Full | >60 | pT2aN0cM0 | 90 | 1 | II |
| 21 | 53 | Female | 25 | Full | 55 | pT1BcN0cM0 | 65 | 2 | I |
| 22 | 49 | Male | 25 | Partial | >60 | pT1BcN0cM0 | 50 | 2 | I |
| 24 | 69 | Male | 27 | Partial | >60 | pT1AcN0cM0 | 25 | 2 | I |
| 27 | 46 | Male | 31 | Full | >60 | pT2BcN0cM0 | 117 | 3 | II |
| 29 | 54 | Female | 29 | Partial | >60 | pT1AcN0cM0 | 15 | 2 | I |
| 31 | 67 | Male | 25 | Partial | >60 | pT1AcN0cM0 | 18 | 1 | I |
| 32 | 36 | Male | 23 | Partial | >60 | pT1AcN0cM0 | 18 | 3 | I |
| 33 | 48 | Male | 28 | Partial | >60 | pT1AcN0cM0 | 38 | 1 | I |
Fig 1Multidimensional scaling (MDS) analysis of gene expression data.
MDS analysis based on all commonly detected genes shows that samples segregate by diagnosis (A) and not by storage condition (B). Distances correspond to leading log-fold-changes between each pair of samples. MDS based on differentially expressed genes demonstrates less within-group variance compared to MDS with all detected genes in the RNAlater® (C) and FFPE (D) datasets. NF: Normal, FFPE; NR: Normal, RNAlater; TF: Tumor, FFPE; TR: Tumor, RNAlater. NO = Normal; TU = Tumor.
Fig 2Correlation of gene expression data.
The correlation of commonly differentially expressed genes is given with respect to (A) average expression and (B) log2 fold changes.
Gene expression analyses.
The 20 most up- or down-regulated genes in the FFPE data set with corresponding RNAlater® values (upper panel), and the 20 most up- or down regulated genes in the RNAlater® dataset with corresponding FFPE values (lower panel), filtered by adjusted p-value≤0.05. Rank indicates the rank of the gene within the list of differentially genes sorted by largest to smallest absolute fold change. 14 genes are shared between the two lists. TU: tumour, NO: normal, FC: fold change, ND: not detected, did not pass the expression filter.
| ENSG00000169344 | UMOD | -183,2 | 2,40E-07 | -158,7 | 8,06E-08 | 1 | 3 |
| ENSG00000106236 | NPTX2 | 140,9 | 6,67E-07 | 220,1 | 2,29E-08 | 2 | 2 |
| ENSG00000107159 | CA9 | 121,2 | 5,50E-06 | 304,4 | 3,65E-09 | 3 | 1 |
| ENSG00000074803 | SLC12A1 | -91,9 | 1,59E-07 | -78,5 | 1,15E-07 | 4 | 7 |
| ENSG00000169550 | MUC15 | -82,1 | 3,20E-07 | -66,6 | 1,27E-06 | 5 | 8 |
| ENSG00000142319 | SLC6A3 | 76,6 | 2,17E-06 | 101,7 | 6,53E-07 | 6 | 4 |
| ENSG00000169347 | GP2 | -57,2 | 1,13E-06 | -52,7 | 4,52E-07 | 7 | 10 |
| ENSG00000107165 | TYRP1 | -56,1 | 5,91E-06 | ND | ND | 8 | ND |
| ENSG00000088836 | SLC4A11 | -54,2 | 1,14E-07 | -62,7 | 2,16E-05 | 9 | 9 |
| ENSG00000130822 | PNCK | 53,3 | 1,42E-06 | 92,0 | 1,89E-07 | 10 | 5 |
| ENSG00000198691 | ABCA4 | -52,4 | 3,12E-07 | ND | ND | 11 | 30 |
| ENSG00000165973 | NELL1 | -51,4 | 2,72E-07 | -35,8 | 8,78E-07 | 12 | 16 |
| ENSG00000186510 | CLCNKA | -50,3 | 1,61E-08 | -39,7 | 9,73E-08 | 13 | 13 |
| ENSG00000215644 | GCGR | -49,7 | 1,52E-07 | -33,9 | 2,68E-06 | 14 | 18 |
| ENSG00000164893 | SLC7A13 | -49,3 | 3,87E-04 | -43,6 | 9,14E-06 | 15 | 11 |
| ENSG00000138798 | EGF | -47,9 | 1,43E-07 | -37,3 | 2,26E-07 | 16 | 15 |
| ENSG00000150201 | FXYD4 | -47,8 | 1,89E-05 | -8,1 | 1,51E-02 | 17 | 134 |
| ENSG00000184956 | MUC6 | -47,1 | 1,14E-05 | ND | ND | 18 | ND |
| ENSG00000100362 | PVALB | -45,7 | 5,83E-07 | ND | ND | 19 | ND |
| ENSG00000130829 | DUSP9 | -45,0 | 7,90E-07 | -24,4 | 1,56E-06 | 20 | 36 |
| ENSG00000107159 | CA9 | 304,4 | 3,65E-09 | 121,2 | 5,50E-06 | 1 | 3 |
| ENSG00000106236 | NPTX2 | 220,1 | 2,29E-08 | 140,9 | 6,67E-07 | 2 | 2 |
| ENSG00000169344 | UMOD | -158,7 | 8,06E-08 | -183,2 | 2,40E-07 | 3 | 1 |
| ENSG00000142319 | SLC6A3 | 101,7 | 6,53E-07 | 76,6 | 2,17E-06 | 4 | 6 |
| ENSG00000130822 | PNCK | 92,0 | 1,89E-07 | 53,3 | 1,42E-06 | 5 | 10 |
| ENSG00000185633 | NDUFA4L2 | 87,6 | 6,30E-10 | 20,9 | 5,88E-06 | 6 | 50 |
| ENSG00000074803 | SLC12A1 | -78,5 | 1,15E-07 | -91,9 | 1,59E-07 | 7 | 4 |
| ENSG00000169550 | MUC15 | -66,6 | 1,27E-06 | -82,1 | 3,20E-07 | 8 | 5 |
| ENSG00000088836 | SLC4A11 | -62,7 | 2,16E-05 | -54,2 | 1,14E-07 | 9 | 9 |
| ENSG00000169347 | GP2 | -52,7 | 4,52E-07 | -57,2 | 1,13E-06 | 10 | 7 |
| ENSG00000164893 | SLC7A13 | -43,6 | 9,14E-06 | -49,3 | 3,87E-04 | 11 | 15 |
| ENSG00000130208 | APOC1 | 40,0 | 7,15E-09 | 9,1 | 6,01E-05 | 12 | 136 |
| ENSG00000186510 | CLCNKA | -39,7 | 9,73E-08 | -50,3 | 1,61E-08 | 13 | 13 |
| ENSG00000123610 | TNFAIP6 | 37,8 | 2,98E-08 | 33,6 | 1,68E-07 | 14 | 26 |
| ENSG00000138798 | EGF | -37,3 | 2,26E-07 | -47,9 | 1,43E-07 | 15 | 16 |
| ENSG00000165973 | NELL1 | -35,8 | 8,78E-07 | -51,4 | 2,72E-07 | 16 | 12 |
| ENSG00000113889 | KNG1 | -34,9 | 7,04E-07 | -35,6 | 5,31E-07 | 17 | 25 |
| ENSG00000215644 | GCGR | -33,9 | 2,68E-06 | -49,7 | 1,52E-07 | 18 | 14 |
| ENSG00000008196 | TFAP2B | -32,5 | 4,04E-06 | -29,9 | 7,56E-06 | 19 | 31 |
| ENSG00000184661 | CDCA2 | 32,4 | 1,77E-07 | 28,9 | 1,13E-06 | 20 | 33 |
Fig 3Immunohistochemistry and mRNA plots.
(A) Immunohistochemistry of UMOD, NTPX2 and CA9. Magnification x20, scale bar 50 μm. (B) Respective mRNA abundance plots in the FFPE and in the RNAlater® datasets.
Pathway analysis.
The 20 most affected canonical pathways in each NGS dataset with the corresponding values and ranks. Rank indicates the place of the pathway within the list of pathways sorted by largest to smallest –log(adjusted p-value). 12 of 20 pathways are shared between both datasets. TU: tumour, NO: normal, FC: fold change, ND: not detected, did not pass the expression filter.
| Antigen Presentation Pathway | 13,90 | 9,13 | 1 | 3 |
| Hepatic Fibrosis / Hepatic Stellate Cell Activation | 13,90 | 14,60 | 2 | 2 |
| LXR/RXR Activation | 7,53 | 6,67 | 3 | 4 |
| Leukocyte Extravasation Signaling | 7,13 | 4,55 | 4 | 9 |
| Coagulation System | 6,78 | 6,59 | 5 | 5 |
| Communication between Innate and Adaptive Immune Cells | 6,60 | 3,58 | 6 | 17 |
| Caveolar-mediated Endocytosis Signaling | 6,54 | 3,69 | 7 | 12 |
| Atherosclerosis Signaling | 6,50 | 6,04 | 8 | 6 |
| Dendritic Cell Maturation | 6,50 | 4,18 | 9 | 10 |
| Crosstalk between Dendritic Cells and Natural Killer Cells | 6,31 | 3,62 | 10 | 14 |
| Graft-versus-Host Disease Signaling | 5,80 | 3,02 | 11 | 35 |
| Complement System | 5,78 | 4,55 | 12 | 8 |
| Autoimmune Thyroid Disease Signaling | 5,78 | 3,49 | 13 | 23 |
| Virus Entry via Endocytic Pathways | 5,78 | 2,92 | 14 | 38 |
| OX40 Signaling Pathway | 5,78 | 3,34 | 15 | 28 |
| Intrinsic Prothrombin Activation Pathway | 5,44 | 4,15 | 16 | 11 |
| Allograft Rejection Signaling | 5,44 | 3,49 | 17 | 25 |
| Fcγ Receptor-mediated Phagocytosis in Macrophages and Monocytes | 4,85 | 3,52 | 18 | 22 |
| Granulocyte Adhesion and Diapedesis | 4,36 | 2,74 | 19 | 47 |
| iCOS-iCOSL Signaling in T Helper Cells | 4,35 | 2,86 | 20 | 41 |
| EIF2 Signaling | 14,60 | ND | 1 | ND |
| Hepatic Fibrosis / Hepatic Stellate Cell Activation | 14,60 | 13,90 | 2 | 2 |
| Antigen Presentation Pathway | 9,13 | 13,90 | 3 | 1 |
| LXR/RXR Activation | 6,67 | 7,53 | 4 | 3 |
| Coagulation System | 6,59 | 6,78 | 5 | 5 |
| Atherosclerosis Signaling | 6,04 | 6,50 | 6 | 8 |
| LPS/IL-1 Mediated Inhibition of RXR Function | 5,23 | 3,87 | 7 | 27 |
| Complement System | 4,55 | 5,78 | 8 | 12 |
| Leukocyte Extravasation Signaling | 4,55 | 7,13 | 9 | 4 |
| Dendritic Cell Maturation | 4,18 | 6,50 | 10 | 9 |
| Intrinsic Prothrombin Activation Pathway | 4,15 | 5,44 | 11 | 16 |
| Caveolar-mediated Endocytosis Signaling | 3,69 | 6,54 | 12 | 7 |
| Ethanol Degradation II | 3,62 | 1,49 | 13 | 77 |
| Crosstalk between Dendritic Cells and Natural Killer Cells | 3,62 | 6,31 | 14 | 10 |
| Histamine Degradation | 3,58 | 0,41 | 15 | 267 |
| B Cell Development | 3,58 | 3,15 | 16 | 32 |
| Communication between Innate and Adaptive I Immune Cells | 3,58 | 6,60 | 17 | 6 |
| eNOS Signaling | 3,58 | 2,78 | 18 | 41 |
| Valine Degradation I | 3,57 | 1,48 | 19 | 78 |
| mTOR Signaling | 3,57 | ND | 20 | ND |
Comparison of our gene expression data with data from literature [17].
Twenty genes with smallest p-values and largest absolute fold changes in a meta-analysis of five microarray studies are compared to the corresponding genes and their fold changes and p-values of the NGS datasets. The median fold changes and standard deviations for the meta-analysis are presented. All shown genes were differentially expressed in only 2 or 3 microarray datasets. Large standard deviations indicate a large spread of values in the individual microarray studies. 17 of the 20 genes were found differentially expressed in both NGS datasets, 13 of these with fold changes within the fold change range of the microarray meta-analysis. ND: not detected, did not pass initial expression filter.
| Zaravinos et al. [ | Eikrem et al. (present study) | ||||||
|---|---|---|---|---|---|---|---|
| NDUFA4L2 | 53,94±58,53 | <0.01 | 20,9 | 4,09E-07 | 87,6 | 6.85E-14 | yes |
| PLIN2 | 27,86±27,89 | <0.01 | 4,6 | 2,82E-05 | 4,7 | 1,03E-04 | yes |
| NNMT | 20,86±9,84 | <0.01 | 9,0 | 2,25E-07 | 15,8 | 5,47E-10 | yes |
| ENO2 | 19,97±9,82 | <0.01 | 6,3 | 7,10E-08 | 7,3 | 1,39E-10 | no |
| AHNAK2 | 16,62±2,23 | <0.01 | 12,2 | 8,66E-09 | 16,0 | 1,96E-08 | yes |
| NETO2 | 15,8±13,8 | <0.01 | 10,6 | 5,10E-10 | 11,7 | 5,06E-13 | yes |
| CA9 | 14,48±4,40 | <0.01 | 121,2 | 3,72E-07 | 304,4 | 3,17E-12 | no |
| VWF | 13,06±2,61 | <0.01 | 4,9 | 3,84E-08 | 13,7 | 1,06E-09 | yes |
| COL23A1 | 12,75±5,10 | <0.01 | 22,1 | 6,99E-09 | 20,9 | 5,05E-09 | no |
| EHD2 | 12,70±13,94 | <0.01 | 3,9 | 2,26E-10 | 4,0 | 2,96E-08 | yes |
| ATP6V0A4 | -19,70±32,54 | <0.01 | -10,4 | 5,39E-08 | -7,4 | 2,28E-05 | yes |
| CA10 | -21,45±8,80 | <0.01 | ND | ND | |||
| SLC12A3 | -23,67±31,69 | <0.01 | -10,5 | 6,59E-05 | -18,9 | 1,39E-06 | yes |
| CLDN8 | -27,11±95,38 | <0.01 | ND | ND | |||
| SERPINA5 | -35,45±32,90 | <0.01 | -13,7 | 3,34E-05 | -16,4 | 9,39E-07 | yes |
| KNG1 | -38,45±51,67 | <0.01 | -35,6 | 1,15E-08 | -34,9 | 9,64E-09 | yes |
| KCNJ1 | -50,79±59,09 | <0.01 | -2,4 | 1,48E-09 | -2,1 | 1,48E-04 | yes |
| RALYL | -53,58±11,02 | <0.01 | ND | ND | |||
| CALB1 | -103,68±156,0 | <0.01 | -12,00 | 1,45E-03 | -8,4 | 7,88E-05 | yes |
| NPHS2 | -159,10±155,4 | <0.01 | -3,8 | 4,63E-03 | -4,4 | 1,76E-03 | no |
Fig 4Pathway signature of VEGF and NOTCH mediated EMT in ccRCC.
Comparison of gene expression data from the FFPE and from the RNAlater® dataset with published results [20] and between themselves. F = FFPE samples, R = RNAlater samples, Numbers = fold change of up-regulation (red) or down-regulation (blue).
Fig 5Gene network.
The most differentially affected network with the central role of TGFB1 in (A) FFPE samples and B) RNAlater data sets. Proteins with cancer involvement are marked with purple outline. Red fill indicates overrepresentation of the gene in ccRCC, green indicates under-representation. Color intensity reflects range of fold change.
Fig 6Development of a candidate marker for ccRCC.
(A) Expression values of CA9 correctly classified 30 of 32 samples in our FFPE dataset. (B) Whisker plot of expression value distribution in our FFPE dataset for CA9. (C) Scatterplot for the expression values of CA9 in our FFPE and in our RNAlater dataset. (D) CA9 expression values correctly classify 139 out of 144 samples in a microarray dataset of ccRCC (GSE53757). (E) Distribution of CA9 expression values for normal (NO) and ccRCC tumor samples (TU) in the GSE53757 dataset. (F) Stratification of the expression values of overexpressed CA9 into all four stages of ccRCC [14].