| Literature DB >> 34813325 |
James A Sanford1, Yang Wang1, Joshua R Hansen1, Marina A Gritsenko1, Karl K Weitz1, Tyler J Sagendorf1, Cristina E Tognon2,3, Vladislav A Petyuk1, Wei-Jun Qian1, Tao Liu1, Brian J Druker2,3, Karin D Rodland1,2, Paul D Piehowski4.
Abstract
Global and phosphoproteome profiling has demonstrated great utility for the analysis of clinical specimens. One barrier to the broad clinical application of proteomic profiling is the large amount of biological material required, particularly for phosphoproteomics─currently on the order of 25 mg wet tissue weight. For hematopoietic cancers such as acute myeloid leukemia (AML), the sample requirement is ≥10 million peripheral blood mononuclear cells (PBMCs). Across large study cohorts, this requirement will exceed what is obtainable for many individual patients/time points. For this reason, we were interested in the impact of differential peptide loading across multiplex channels on proteomic data quality. To achieve this, we tested a range of channel loading amounts (approximately the material obtainable from 5E5, 1E6, 2.5E6, 5E6, and 1E7 AML patient cells) to assess proteome coverage, quantification precision, and peptide/phosphopeptide detection in experiments utilizing isobaric tandem mass tag (TMT) labeling. As expected, fewer missing values were observed in TMT channels with higher peptide loading amounts compared to lower loadings. Moreover, channels with a lower loading have greater quantitative variability than channels with higher loadings. A statistical analysis showed that decreased loading amounts result in an increase in the type I error rate. We then examined the impact of differential loading on the detection of known differences between distinct AML cell lines. Similar patterns of increased data missingness and higher quantitative variability were observed as loading was decreased resulting in fewer statistical differences; however, we found good agreement in features identified as differential, demonstrating the value of this approach.Entities:
Keywords: TMT; acute myeloid leukemia; clinical proteomics; differential loading; isobaric labeling; phosphoproteomics
Mesh:
Substances:
Year: 2021 PMID: 34813325 PMCID: PMC8739833 DOI: 10.1021/jasms.1c00169
Source DB: PubMed Journal: J Am Soc Mass Spectrom ISSN: 1044-0305 Impact factor: 3.109
Figure 1Clinical proteomics workflow. Diagram illustrating the steps involved in sample processing and data acquisition for our clinical proteomics workflow.
Figure 2Design of TMT multiplexes with differential peptide loading. (A) Protein extraction yields obtained from cell pellets of decreasing cell counts. (B) Design of the two TMT11 multiplexes with differential amounts of peptide loaded per channel. (C) Total TMT reporter ion intensity obtained per channel from global proteomics data sets. TMT labeling efficiency was determined to be >99% for each plex and is reported in the plot headers. (D) Relationship between the amount of peptide loaded per channel and the median TMT reporter ion intensities acquired.
Peptide and Protein Identificationsa
| global proteomics | phosphoproteomics | |
|---|---|---|
| unique peptide identifications (total) | 138 373 | 27 351 |
| unique peptides quantified in >25% of samples | 137 302 (99.2%) | 26 753 (97.8%) |
| unique peptides quantified in >33% of samples | 134 399 (97.1%) | 24 230 (88.6%) |
| unique peptides quantified in >50% of samples | 118 418 (85.6%) | 17 409 (63.7%) |
| unique peptides quantified in 100% of samples | 45 925 (33.2%) | 3 366 (12.3%) |
| unique protein identifications (total) | 8926 | NA |
| unique proteins quantified in >25% of samples | 8910 (99.8%) | NA |
| unique proteins quantified in >33% of samples | 8887 (99.6%) | NA |
| unique proteins quantified in >50% of samples | 8722 (97.7%) | NA |
| unique proteins quantified in 100% of samples | 7641 (85.6%) | NA |
Number of unique peptides and proteins identified from global proteomics data sets and phosphoproteomics data sets across the two experimental TMT11 multiplexes. Unique peptide counts were filtered based on presence in >25%, 33%, 50%, and 100% of sample channels, and percentages displayed represent the fraction of overall unique peptide identifications that pass the filtering criteria.
Figure 3Missing data increase in channels with lower peptide loadings. Percentage of the total identified features [peptides in global proteomics (A), proteins in global proteomics (B), or phosphopeptides in phosphoproteomics (C)] that are quantified in the 4 replicates of each peptide loading group. (D, E) Upset plots demonstrating the overlap of peptides that were quantified (in all four replicates) of each differential peptide loading group. (F–H) Comparison of rates of missing data across channels of TMT plexes loaded with differential amounts of peptide (differential loading) vs loaded with equal peptide amounts (standard loading). Missing data were evaluated at the peptide level in phosphoproteomics data sets (F), or the peptide level (G) or protein level (H) in global proteomics data sets.
Figure 4Impact of loading quantity on data reproducibility and statistics. (A, B) Visualization of intragroup data reproducibility through principal component analysis (PCA) and coefficients of variation (%CV) plots calculated across the four replicates of each loading group. (C, D) Density plots of p-value histograms resulting from unequal variance t tests comparing individual loading groups with the 400 μg standard loading amount at the protein level from global proteomics data sets or peptide level from phosphoproteomics data sets.
Figure 5Differential loading affects the detection of true biological differences. (A) Density plots of adjusted p-values from comparisons of global protein abundances in MOLM-14 and K652 cells with the indicated peptide loadings. (B) Upset plots comparing the statistically significant protein-level results from each of the indicated MOLM-14 vs K652 comparisons. (C–E) Correlation of the calculated fold change for each protein when comparing various K652 peptide loadings with 400 μg of MOLM-14 peptides. (F) Density plots of adjusted p-values from comparisons of phosphopeptide abundances in MOLM-14 and K652 cells with the indicated peptide loadings. (G) Upset plots comparing the statistically significant phosphopeptide-level results from each of the indicated MOLM-14 vs K652 comparisons. (H–J) Correlation of the calculated fold change for each phosphopeptide when comparing various K652 peptide loadings with 400 μg of MOLM-14 peptides.