| Literature DB >> 30087585 |
Paul D Piehowski1, Vladislav A Petyuk1, Ryan L Sontag1, Marina A Gritsenko1, Karl K Weitz1, Thomas L Fillmore1, Jamie Moon1, Hala Makhlouf2, Rodrigo F Chuaqui2, Emily S Boja3, Henry Rodriguez3, Jerry S H Lee4, Richard D Smith1, Danielle M Carrick5, Tao Liu1, Karin D Rodland1.
Abstract
BACKGROUND: Mass spectrometry-based proteomics has become a powerful tool for the identification and quantification of proteins from a wide variety of biological specimens. To date, the majority of studies utilizing tissue samples have been carried out on prospectively collected fresh frozen or optimal cutting temperature (OCT) embedded specimens. However, such specimens are often difficult to obtain, in limited in supply, and clinical information and outcomes on patients are inherently delayed as compared to banked samples. Annotated formalin fixed, paraffin embedded (FFPE) tumor tissue specimens are available for research use from a variety of tissue banks, such as from the surveillance, epidemiology and end results (SEER) registries' residual tissue repositories. Given the wealth of outcomes information associated with such samples, the reuse of archived FFPE blocks for deep proteomic characterization with mass spectrometry technologies would provide a valuable resource for population-based cancer studies. Further, due to the widespread availability of FFPE specimens, validation of specimen integrity opens the possibility for thousands of studies that can be conducted worldwide.Entities:
Keywords: Formalin fixed paraffin embedded; Phosphoproteomics; Proteomics; Surveillance, epidemiology and end results; Tandem mass tags
Year: 2018 PMID: 30087585 PMCID: PMC6074037 DOI: 10.1186/s12014-018-9202-4
Source DB: PubMed Journal: Clin Proteomics ISSN: 1542-6416 Impact factor: 3.988
FFPE specimens selected for analysis
| Specimen time in storage | |||||
|---|---|---|---|---|---|
| 3–12 years | 13–22 years | 23–32 years | Age not provided | Total | |
| RTR site 1 | 8 | 9 | 3 | 0 | 20 |
| RTR site 2 | 4 | 11 | 5 | 0 | 20 |
| RTR site 3 | 1 | 11 | 7 | 1 | 20 |
| Total | 13 | 31 | 15 | 1 | 60 |
Fig. 1Effect of storage time on peptide yield. a Average peptide yield per sample at each RTR. b Total peptide extraction yield (µg) from SEER specimens versus specimen age, evaluated by BCA assay. Each point represents total yield from five 10-µm FFPE sections, scraped and pooled in a single tube for sample processing. Different colors are used for each RTR. c Peptide yield normalized to tumor volume (surface area × 0.01 mm depth) versus time in storage for the two RTRs reporting tumor dimensions
Fig. 2Comparison of FFPE (SEER) and OCT (TCGA) proteome coverage. The comparative analysis was done using both the spectrum identification rates (a) and unique peptide identification rates (b). The identification rate is calculated as the number of PSM passing a 1% FDR cutoff (a) or unique peptide sequences (b) divided by the total number of MS/MS spectra taken by the mass spectrometer. The difference in assignment of PSMs and peptides between the TMT-10 (SEER) and iTRAQ-4 (TCGA) labeled samples is also adjusted to account for the known differences as described above
Fig. 3PCA analysis of proteomic results grouped by SEER RTR site: a expression proteome results, and b phosphoproteome. The lack of any statistically significant effect of RTR on protein abundances within this set of samples was confirmed with ANOVA analysis
Fig. 4PCA analysis of TMT results analyzed by time in storage: a expression protein abundance, and b phosphopeptides. Linear regression of relative protein abundance for three proteins with highest significance (c) and top 3 most significant phosphosites (d)
Fig. 5Comparison of GSEA enrichment in FFPE versus OCT samples. a and b are examples of the GSEA plots for the most depleted and enriched GO terms comparing FFPE to OCT. The significance of the test depends on the degree of concordance between changes in protein abundance within a GO term. Particular groupings of proteins at the low and high ends of the ranked list indicated non-random depletion and enrichment of the corresponding GO terms. The significance of the enrichment is based on one million permutations which were computed using FGSEA R package. c is the volcano plot reflecting significance and estimate fold of change of the GO term. The list of significantly affected GO terms is available in the Additional file 7: Table S6. The pattern and most significant GO terms remained consistent when FFPE was compared to OCT data, independent of the OCT analysis site (PNNL vs. JHU). Noteworthy, comparing two OCT datasets as negative control test set yielded no significantly different GO terms (right panel), thus demonstrating the validity of the statistical test