| Literature DB >> 23641797 |
Marlene Thomas1, Manuela Poignée-Heger, Martin Weisser, Stephanie Wessner, Anton Belousov.
Abstract
BACKGROUND: Whole genome microarray gene expression profiling is the 'gold standard' for the discovery of prognostic and predictive genetic markers for human cancers. However, suitable research material is lacking as most diagnostic samples are preserved as formalin-fixed, paraffin-embedded tissue (FFPET). We tested a new workflow and data analysis method optimized for use with FFPET samples.Entities:
Year: 2013 PMID: 23641797 PMCID: PMC3660273 DOI: 10.1186/2043-9113-3-10
Source DB: PubMed Journal: J Clin Bioinforma ISSN: 2043-9113
Figure 1Plot showing (A) typical poor-quality Affymetrix probe set and (B) the revised probe set used in this study.* OY axis corresponds to probe-dependent signal component, also known as affinity profile. Signals are normalized by RMA method. Black dots = ER–, red dots = ER+. * The set of 11 probes shown in Figure 1A contains seven probes that either do not match the genome and transcriptome (probes 1 to 4) or that are not sufficiently specific for their target sequence and may, therefore, detect multiple genes (probes 6 to 8). Concentrating on an unambiguous part of the probe set (Figure 1B) significantly improves the power of the statistical tests used (eg, a t-statistic of 3.07 comparing expression in estrogen receptor-negative versus expression in estrogen receptor-positive tumors as opposed to a t-statistic of 0.98 when the original Affymetrix probe set is used).
RNA concentration, RNA integrity number and 3′/5′ ratios for fresh-frozen and formalin-fixed, paraffin-embedded tissue samples
| 1 | 153 | 7.8 | 1.0 | 258 | 2.1 | 4.2 |
| 2 | 27 | 9.0 | 1.1 | 71 | 2.2 | 6.8 |
| 3 | 40 | 6.2 | 1.4 | 156 | 2.5 | 24.2 |
| 4 | 98 | 7.0 | 1.2 | 87 | 1.8 | 7.4 |
| 5 | 49 | 8.6 | 1.0 | 127 | NA | 5.1 |
| 6 | 73 | 7.3 | 1.0 | 96 | NA | 90.5 |
| 7 | 326 | 8.6 | 1.0 | 467 | NA | 90.8 |
| 8 | 87 | 6.8 | 1.2 | 168 | 6.2 | 44.3 |
| 9 | 34 | 5.4 | 1.5 | 74 | NA | 151.3 |
| 10 | 55 | 8.3 | 1.0 | 100 | NA | 90.8 |
| 11 | 23 | 6.9 | 1.2 | 53 | 2.6 | 9.0 |
| 12 | 120 | 7.9 | 1.0 | 162 | 2.8 | 72.2 |
| 13 | 58 | 7.7 | 1.1 | 71 | 2.6 | 17.1 |
| 14 | 110 | 6.9 | 1.1 | 178 | 2.5 | 11.3 |
| 15 | 121 | 7.8 | 1.1 | 219 | 2.3 | 16.6 |
| 16 | 175 | 7.1 | 1.2 | 349 | 2.3 | 16.1 |
FF, fresh-frozen; FFPET, formalin-fixed, paraffin-embedded tissue; NA, not assessable; RIN, RNA integrity number.
* RNA integrity is gauged using the RIN, which indicates the intactness of the sample by assigning a grade from 1 to 10, 10 being the most intact.
Figure 2Pre-normalization microarray plots. (A) Fresh-frozen tissue, (B) Formalin-fixed, paraffin-embedded tissue (FFPET) processed using the Affymetrix Two-Cycle kit, (C) FFPET tissue processed using the NuGEN kit, and (D) FFPET tissue processed using the Sigma/Rubicon Genomics kit.
Quality measures of microarray gene expression values*
| FF: | N/A | 49.7 (47.3, 50.6) | 51.4 | 17.4 |
| FFPET: Affymetrix | 0.78 (0.72, 0.81) | 33.4 (21.8, 38.1) | 36.2 | 19.4 |
| FFPET: NuGEN | 0.88 (0.84, 0.89) | 47.2 (44.1, 58.1) | 52.7 | 17.9 |
| FFPET: Sigma/Rubicon | 0.48 (0.47, 0.52) | 22.9 (22.1, 24.6) | 16.4 | 28.0 |
FF, fresh-frozen; FFPET, formalin-fixed, paraffin-embedded tissue; LOB, limit of blank; N/A, not applicable.
* For the genes represented by at least two probe sets the analysis of consistency of probe set expression is performed. As an overall measure of the consistency we have estimated the percentage of genes for which the Pearson correlation coefficient between expression of all pairs of related probe sets did not exceed 0.5.
† As defined by the Affymetrix algorithm.
Figure 3Estrogen receptor status of FFPET samples: sequence detection sensitivity and specificity by different methods. Each point of a particular color in the plot represents the effect size (difference between mean expression in ER+ and ER– samples) found in FF samples and FFPET samples using a particular workflow. Different quadrants of the plot are marked so as to assist in classifying the findings. For example, false negative (FN) findings are those which are present in FF but absent in FFPET. Presence and absence is decided based on the observed effect size with the widely accepted threshold of 2-fold change (up or down). One particular case, referred to as a wrong positive finding (WP), not observed in our experiments, is when a relevant (>1 in absolute value) effect in FF stays relevant but changes its direction in FFPET.
Figure 4Principal components analysis plots. (A) Fresh-frozen tissue, (B) Formalin-fixed, paraffin-embedded tissue (FFPET) samples processed using the NuGEN RNA amplification kit, (C) FFPET samples processed using the Affymetrix Two-Cycle RNA amplification kit, and (D) FFPET samples processed using the Sigma/Rubicon Genomics RNA amplification kit. Note the extremely similar shape of fresh-frozen and FFPET data clusters associated with the NuGEN workflow. The NuGEN kit also provides the most parsimonious description: in order to capture the effect of estrogen receptor (ER) status on gene expression, both the Two-Cycle and the Sigma/Rubicon Genomics kit would require at least two leading principal components. Circles and triangles represent ER– and ER+ samples correspondingly. Technical replicates are shown in red.