| Literature DB >> 26653538 |
David L Tabb, Xia Wang1, Steven A Carr2, Karl R Clauser2, Philipp Mertins2, Matthew C Chambers, Jerry D Holman, Jing Wang, Bing Zhang, Lisa J Zimmerman, Xian Chen3, Harsha P Gunawardena3, Sherri R Davies4, Matthew J C Ellis4, Shunqiang Li4, R Reid Townsend4, Emily S Boja5, Karen A Ketchum6, Christopher R Kinsinger5, Mehdi Mesri5, Henry Rodriguez5, Tao Liu7, Sangtae Kim7, Jason E McDermott7, Samuel H Payne7, Vladislav A Petyuk7, Karin D Rodland7, Richard D Smith7, Feng Yang7, Daniel W Chan8, Bai Zhang8, Hui Zhang8, Zhen Zhang8, Jian-Ying Zhou8, Daniel C Liebler.
Abstract
The NCI Clinical Proteomic Tumor Analysis Consortium (CPTAC) employed a pair of reference xenograft proteomes for initial platform validation and ongoing quality control of its data collection for The Cancer Genome Atlas (TCGA) tumors. These two xenografts, representing basal and luminal-B human breast cancer, were fractionated and analyzed on six mass spectrometers in a total of 46 replicates divided between iTRAQ and label-free technologies, spanning a total of 1095 LC-MS/MS experiments. These data represent a unique opportunity to evaluate the stability of proteomic differentiation by mass spectrometry over many months of time for individual instruments or across instruments running dissimilar workflows. We evaluated iTRAQ reporter ions, label-free spectral counts, and label-free extracted ion chromatograms as strategies for data interpretation (source code is available from http://homepages.uc.edu/~wang2x7/Research.htm ). From these assessments, we found that differential genes from a single replicate were confirmed by other replicates on the same instrument from 61 to 93% of the time. When comparing across different instruments and quantitative technologies, using multiple replicates, differential genes were reproduced by other data sets from 67 to 99% of the time. Projecting gene differences to biological pathways and networks increased the degree of similarity. These overlaps send an encouraging message about the maturity of technologies for proteomic differentiation.Entities:
Keywords: CPTAC; Differential proteomics; iTRAQ; label-free; quality control; technology assessment; xenografts
Mesh:
Substances:
Year: 2015 PMID: 26653538 PMCID: PMC4779376 DOI: 10.1021/acs.jproteome.5b00859
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 4.466
Figure 1High-level view of the bioinformatics pipeline employed in this study. Six instruments analyzed the same xenograft pair. Label-free sets were processed once by a spectral counts method and once by extracted ion chromatograms. While instrument-specific assemblies of PSMs were used for repeatability analysis, an all-instrument assembly was analyzed for reproducibility and biological pathway and network enrichment.
Figure 2Number of identified distinct peptides per sample/replicate. When data from each instrument were assembled separately with 0.5% PSM FDR and <5% empirical protein FDR, identification sensitivity varied considerably by site. Ailing instrument OElite@65 yielded the lowest sensitivity by far, whereas QExac@56 produced a remarkable number of identifications. Label-free instruments ran WHIM2 and WHIM16 separately, whereas iTRAQ instruments combined these samples into a single 4plex.
Figure 3How many of the differential genes from each experiment are confirmed by at least one other replicate experiment? Blue differential genes are found in common by another experiment in this instrument, whereas orange ones are unique to a single replicate. For iTRAQ, the confirmation must come from a different set of LC–MS/MS data altogether. Note that instruments that produced more replicates were likely to have a higher proportion of common differential genes by random chance.
Figure 4Rank correlations compare the ordering of genes by signed posterior probabilities. OVelos@10 illustrates the extent to which differential probabilities within an iTRAQ 4plex are more similar than those across multiple 4plexes. The QExac@56 shows an exception to this behavior in replicate B. Spectral count-based differentiation produces similar overall correlations to iTRAQ, though without the benefit of being able to compare within LC–MS/MS experiments. Replicate G from OVelos@65 correlated more poorly. The declining sensitivity of identification in QExac@98 gave low correlation values other than the one between the first two replicates.
Figure 5To what extent are the differential genes found for each data set confirmed by other data sets? Those represented by blue were found to be differential in common with another data set in that graph panel, as well. Orange genes, on the other hand, were unique to a particular data set. High identification sensitivity for QExac@56 led to many instrument-specific differences.
Consistency of Enriched Pathways in Genes Expressed More Highly in WHIM16a
| pathway | QExac@56 | OVelos@10 | OVelos@45 | XICs: OVelos@65 (A–J) | SPC: OVelos@65 (A–J) | XICs: QExac@98 (A–D) | SPC: QExac@98 (A–D) |
|---|---|---|---|---|---|---|---|
| glycolysis/gluconeogenesis | × | × | × | × | × | + | × |
| arginine and proline metabolism | + | + | × | × | × | × | × |
| valine, leucine, and isoleucine degradation | × | + | × | × | × | × | |
| ECM receptor interaction | × | × | × | × | × | ||
| focal adhesion | × | × | × | × | × | ||
| endocytosis | × | × | × | × | + | ||
| antigen processing and presentation | + | + | × | + | + | × | × |
| glutathione metabolism | + | + | × | + | + | × | + |
| amino sugar and nucleotide sugar metabolism | + | × | × | + | + | + | |
| fructose and mannose metabolism | + | + | + | + | × | × | |
| propanoate metabolism | + | + | + | + | × | × | |
| cell adhesion molecules | + | × | × | + | + | ||
| hematopoietic cell lineage | + | × | × | + | + | ||
| regulation of actin cytoskeleton | + | + | + | × | × | ||
| starch and sucrose metabolism | + | + | × | × | + | ||
| butanoate metabolism | + | + | × | × | |||
| citrate cycle/TCA_cycle | + | + | × | × | |||
| complement and coagulation cascades | × | + | + | × | |||
| fatty acid metabolism | + | × | × | ||||
| graft versus host disease | + | + | × | + | + | + | |
| allograft rejection | + | × | + | + | + | ||
| tryptophan metabolism | + | + | + | + | × | ||
| lysosome | + | + | × | + | |||
| pentose and glucuronate interconversions | + | + | + | × | |||
| glycosaminoglycan degradation | × | + | + |
× indicates that the pathway was in the top 10 most significant pathways for that data set. + indicates that the pathway was significant (corrected p value < 0.05) in that data set.
Consistency of Enriched Network Modules in Gene Expressed More Highly in WHIM16a
| module | enriched function | QExac@56 | OVelos@10 | OVelos@45 | XIC OVelos@65 (A–J) | SPC: OVelos@65 (A–J) | XICs: QExac@98 (A–D) | SPC: QExac@98 (A–D) |
|---|---|---|---|---|---|---|---|---|
| Level_2_Module_3 | response to wounding | × | × | × | × | × | ||
| Level_2_Module_11 | immune system process | × | × | × | × | × | ||
| Level_3_Module_37 | cell-substrate adhesion | × | × | × | × | × | ||
| Level_4_Module_32 | microtubule-based process | × | × | × | × | × | ||
| Level_2_Module_2 | proteolysis | × | × | × | × |
Enriched function is the most significant GO (gene ontology) biological process term enriched with genes in the module. × indicates that the pathway was in the top 10 most significant pathways for that data set. All modules can be found in NetGestalt (www.netgestalt.org).
Figure 6NetGestalt-based analysis of functional consistency for up-in-WHIM16 genes from seven data sets. (a) Proteins in the iRef network are placed in a linear order together with the hierarchical modular organization of the network. Alternating green and orange bar colors distinguish neighboring modules. (b–g) For each data set, the network modules found to be enriched are colored light red. Within these modules, up-in-WHIM16 genes are represented by a dark red stripe.