| Literature DB >> 26361796 |
A Francina Webster1, Paul Zumbo2, Jennifer Fostel3, Jorge Gandara2, Susan D Hester4, Leslie Recio5, Andrew Williams6, Charles E Wood4, Carole L Yauk7, Christopher E Mason8.
Abstract
Formalin-fixed paraffin-embedded (FFPE) tissue samples represent a potentially invaluable resource for transcriptomic research. However, use of FFPE samples in genomic studies has been limited by technical challenges resulting from nucleic acid degradation. Here we evaluated gene expression profiles derived from fresh-frozen (FRO) and FFPE mouse liver tissues preserved in formalin for different amounts of time using 2 DNA microarray protocols and 2 whole-transcriptome sequencing (RNA-seq) library preparation methodologies. The ribo-depletion protocol outperformed the other methods by having the highest correlations of differentially expressed genes (DEGs), and best overlap of pathways, between FRO and FFPE groups. The effect of sample time in formalin (18 h or 3 weeks) on gene expression profiles indicated that test article treatment, not preservation method, was the main driver of gene expression profiles. Meta- and pathway analyses indicated that biological responses were generally consistent for 18 h and 3 week FFPE samples compared with FRO samples. However, clear erosion of signal intensity with time in formalin was evident, and DEG numbers differed by platform and preservation method. Lastly, we investigated the effect of time in paraffin on genomic profiles. Ribo-depletion RNA-seq analysis of 8-, 19-, and 26-year-old control blocks resulted in comparable quality metrics, including expected distributions of mapped reads to exonic, untranslated region, intronic, and ribosomal fractions of the transcriptome. Overall, our results indicate that FFPE samples are appropriate for use in genomic studies in which frozen samples are not available, and that ribo-depletion RNA-seq is the preferred method for this type of analysis in archival and long-aged FFPE samples.Entities:
Keywords: FFPE; RNA-seq; archival RNA; biorepositories; microarray; toxicogenomics
Mesh:
Substances:
Year: 2015 PMID: 26361796 PMCID: PMC4659533 DOI: 10.1093/toxsci/kfv195
Source DB: PubMed Journal: Toxicol Sci ISSN: 1096-0929 Impact factor: 4.849
FIG. 1.Differential gene expression. Expression profiles following exposure to furan (8 mg/kg/day) in FRO, 18 h in formalin (FFPE-18 h), or 3 weeks in formalin (FFPE-3 weeks) liver samples evaluated using different DNA microarray and RNA-seq protocols. (A) Correlation analysis of DEG fold changes for FRO versus FFPE-18 h (top row) and FRO versus FFPE-3 weeks (bottom row) (P < 0.0001 for all linear regressions). Genes were significantly changed in at least one list (FDR P < 0.05, fold change > ± 1.5 compared with control). (B) Hierarchical clustering of all DEGs (FDR P < 0.05, fold change > ± 2).
FIG. 2.Pathway analysis for (A) ribo-depletion RNA-seq, (B) poly-A-enrichment RNA-seq, (C) 1-color microarrays, and (D) 2-color microarrays. The top 15 pathways for the FRO groups (blue) are listed vertically. The level of enrichment of the corresponding FFPE groups (white-FFPE-18 h; grey-FFPE-3 weeks) is plotted. Pathway significance was calculated in IPA using a Fisher Exact test; −log(P-value) = 1.3 corresponds to a P = 0.05 (indicated in red). Venn diagrams indicate the overlap of each full list. For all analyses, only mapped DEGs with FDR P < 0.05 and fold change > ± 1.5 were used and only pathways with at least 4 DEGs were considered. Nrf2 Oxidative Stress Response was the only pathway that was enriched in all 12 groups (blue chevrons); p53 was enriched in 11/12 groups (grey chevrons). Full color version available online.
Number of DEGs (by unique gene symbol) in each microarray and RNA-seq experiment using different filtering thresholds
| Fold change cut-off | Fresh Frozen | FFPE | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 18 h | 3 weeks | ||||||||||||
| FDR | Unadj. | FDR | Unadj. | FDR | Unadj. | FDR | Unadj. | FDR | Unadj. | FDR | Unadj. | ||
| Mapped RefSeq | RNA-seq | ||||||||||||
| Ribo− | 768 | 129 | 207 | 814 | 189 | 248 | 928 | 77 | 451 | ||||
| PolyA+ | 675 | 189 | 199 | 741 | 160 | 249 | 1447 | 51 | 812 | ||||
| Microarray | |||||||||||||
| 1 color | 489 | 103 | 111 | 1847 | 302 | 323 | 739 | 72 | 118 | ||||
| 2 color | 234 | 47 | 53 | 798 | 77 | 77 | 152 | 12 | 12 | ||||
| Mapped RefSeq | RNA-seq | ||||||||||||
| Ribo– | 316 (44) | 836 | 133 | 226 | 619 (46) | 883 | 208 | 265 | 212 (25) | 986 | 76 | 483 | |
| PolyA+ | 659 (58) | 728 | 206 | 217 | 455 (37) | 783 | 165 | 257 | 136 (11) | 1519 | 58 | 870 | |
| Microarray | |||||||||||||
| 1 color | 667 (28) | 811 | 157 | 173 | 3543 (133) | 3753 | 626 | 640 | 491 (40) | 887 | 69 | 105 | |
| 2 color | 339 (32) | 405 | 84 | 93 | 1854 (196) | 1853 | 192 | 188 | 266 (48) | 272 | 25 | 24 | |
The corresponding number of enriched IPA pathways (with P < 0.05 and at least 4 molecules) is indicated in brackets. Lists that were used to derive enriched pathways are indicated by italics text.
Chemical signature analysis
| Time in formalin | Time in formalin | |||||
|---|---|---|---|---|---|---|
| FRO | 18 h* | 3 weeks* | FRO | 18 h | 3 weeks | |
| Thioacetamide | ||||||
| 1,5-Naphthalenediamine | 6 | 31 | ||||
| Dimethylnitrosamine | 35 | 25 | ||||
| Methapyrilene | ||||||
| Diethylnitrosamine | 13 | 26 | 10 | |||
| Anastrozole | 6 | 14 | 14 | 17 | 6 | |
| Tunicamycin | 7 | 19 | 6 | 7 | 10 | 31 |
| Bleomycin | 8 | 9 | 91 | — | — | — |
| Benzo(a)pyrene | 9 | 4 | 11 | — | — | — |
| Bromobenzene | 10 | 18 | 8 | 8 | 15 | 7 |
| FRO | 18 h* | 3 weeks | FRO | 18 h* | 3 weeks* | |
| Malathion (14,800 ppm) | ||||||
| Benzofuran (240 mg/kg) | ||||||
| Methylene chloride (4000 ppm) | 6 | 14 | 14 | 14 | ||
| 1,5-Naphthalenediamine (2000 ppm) | ||||||
| Iodoform (93 mg/kg) | – | |||||
| Methylene chloride (3000 ppm) | 6 | 10 | 6 | 10 | 10 | |
| Methylene chloride (2000 ppm) | 7 | 12 | 11 | 9 | 11 | 11 |
| 1,4-Dichlorobenzene (600 mg/kg) | 8 | 7 | ||||
| 1,2,3-Trichloropropane (60 mg/kg) | 9 | 11 | 13 | 10 | 13 | 13 |
| 1,4-Dichlorobenzene (500 mg/kg) | 10 | 10 | 8 | 8 | 8 | 8 |
Kendall’s rank correlation for all NextBio (where * indicates a significant correlation with respect to FRO): Ribo- RNA-seq: FRO versus 18 h (tau = 0.674; = 0.0071); FRO versus 3 weeks (tau = 0.511; = 0.0466); 1 C microarray: FRO versus 18 h (tau = 0.214; P = 0.5484); FRO versus 3 weeks (tau = 0.429; P = 0.1789).
Kendall’s rank correlation for GSE18858: Ribo- RNA-seq: FRO versus 18 h (tau = 0.556; = 0.0446); FRO versus 3 weeks (tau = 0.467; P = 0.0726); 1C microarray: FRO versus 18 h (tau = 0.6; = 0.0167); FRO versus 3 weeks (tau = 0.6; = 0.0167). The expression values of the top 250 mapped DEGs (FC > 1.5, FDR P < 0.05 for furan-treated mice versus respective controls) stratified by sample group (FRO, FFPE-18-h, FFPE-3-week) were compared with liver gene expression signatures from all NextBio mouse studies (top) and GSE18858 (Thomas ; bottom). In the latter, female B6C3F1 mice were exposed to one of 26 chemicals for 13 weeks. The top 5 most correlated datasets for each group are indicated by italics text (where 1 indicates the most correlated study, 2 the second most correlated, etc.); only positively correlated chemicals with at least 3 supporting studies were included (all correlations are P < 0.05, which was calculated in NextBio using a Fisher’s exact test).
Disease Signature Analysis
| Time in formalin | ||||||
|---|---|---|---|---|---|---|
| Treatment | Time of tissue collection | Mouse details | GEO | FRO | 18 h* | 3 weeks* |
| Partial hepatectomy | 48 h post-PH | 5–6 m/CB6F1 | GSE20427 | |||
| Partial hepatectomy | 48 h post-PH | 25–27 m/CB6F1 | GSE20427 | |||
| Partial hepatectomy | 18 h post-PH | Adult/wildtype C57BL/6 | GSE51801 | |||
| CCl4-exposure | 2 d post-exposure | BALB/c (IL4 IL13 double knockout) | GSE45002 | |||
| CCl4-exposure | 2 d post-exposure | BALB/c (wildtype) | GSE45002 | 6 | ||
| Partial hepatectomy | 2 h post-PH | 25–27 m/CB6F1 | GSE20427 | 6 | 5 | |
| Partial hepatectomy | 2 h post-PH | 5–6 m/CB6F1 | GSE20427 | 7 | 9 | - |
| Partial hepatectomy | 24 h post-PH | Tob1-null | GSE21836 | 8 | 8 | 7 |
| Partial hepatectomy | 38 h post-PH | 5–6 m/CB6F1 | GSE20427 | 9 | 7 | 9 |
| Partial hepatectomy | 24 h post-PH | 5–6 m/CB6F1 | GSE20427 | 10 | 10 | 8 |
| FRO | 18 h | 3 weeks* | ||||
| Partial hepatectomy | 48 h post-PH | 5–6 m/CB6F1 | GSE20427 | |||
| Partial hepatectomy | 48 h post-PH | 25–27 m/CB6F1 | GSE20427 | |||
| CCl4-exposure | 2 d post CCl4-exposure | BALB/c (wildtype mice) | GSE45002 | 9 | ||
| CCl4-exposure | 2 d post CCl4-exposure | BALB/c (IL4/IL13 double knockout mice) | GSE45002 | |||
| Partial hepatectomy | 18 h post-PH | Adult/wildtype C57BL/6 | GSE51801 | |||
| Partial hepatectomy | 38 h post-PH | 5–6 m/CB6F1 | GSE20427 | 6 | 7 | 6 |
| Partial hepatectomy | 24 h post-PH | Tob1-null (Tob1 = a repressor of liver regeneration) | GSE21836 | 7 | 8 | |
| Ehrlichia chaffeensis induced Hepatitis | 15 d Wakulla strain | Severe combined immunodeficient mice | GSE8966 | 8 | 8 | 9 |
| Partial hepatectomy | 24 h post-PH | 5–6 m/CB6F1 | GSE20427 | 9 | 6 | 7 |
| Partial hepatectomy | 2 h post-PH | 25–27 m/CB6F1 | GSE20427 | 10 | 10 | 10 |
PH, partial hepatectomy; CCl4, carbon tetrachloride; m, month.
All samples are: mouse; liver; treated versus control.
Kendall’s rank correlation (where *indicates a significant correlation with respect to FRO): Ribo- RNA-seq: FRO versus 18 h (tau = 0.778; = 0.0009); FRO versus 3 weeks (tau = 0.611; = 0.0247); 1 C microarray: FRO versus 18 h (tau = 0.378; P = 0.1557); FRO versus 3 weeks (tau = 0.644; = 0.0091). The expression values of the top 250 mapped DEGs (FC > 1.5, FDR P < 0.05) in each group (FRO, FFPE-18-h, FFPE-3-week) determined by RNA-seq or microarray analysis were compared with gene expression signatures derived from publically available mouse liver and biliary disease datasets using NextBio. The top 5 most correlated datasets for each group are indicated by italics text (where 1 indicates the most correlated study, 2 the second most correlated, etc.); only positively correlated studies were included. All correlations have a P < 0.05, which was calculated in NextBio using a Fisher’s Exact Test.
FIG. 3.Examination of RNA-seq data quality metrics. (A–C) Mapping distributions and GC content distributions for the furan-treated sampled. (A) The percentage of reads that covers each nucleotide position of all genes scaled to 100 bins, from 5′ UTR to 3′ UTR for FRO and FFPE samples. The ribo-depleted RNA samples consistently showed more uniform gene coverage than did their poly-A–selected counterparts. (B) GC content distributions for FRO and FFPE samples. (C) The percentage of FRO and FFPE reads that map to various genic regions. Overall, we detected more intronic reads from ribo-depleted RNA samples than from poly-A–enriched libraries; this trend increased as a function of time in formalin. (D,E) Mapping distributions and GC content distributions, expressed as a percentage, for 8-, 19-, and 26-year-old control FFPE tissues. (F) Inter-group Kendell's tau comparison.