| Literature DB >> 32398793 |
Jörg Reinders1,2, Michael Altenbuchinger3, Katharina Limm4, Philipp Schwarzfischer4, Tamara Scheidt5, Lisa Strasser5, Julia Richter6, Monika Szczepanowski6, Christian G Huber5, Wolfram Klapper6, Rainer Spang3, Peter J Oefner4.
Abstract
Diffuse large B-cell lymphoma (DLBCL) is commonly classified by gene expression profiling according to its cell of origin (COO) into activated B-cell (ABC)-like and germinal center B-cell (GCB)-like subgroups. Here we report the application of label-free nano-liquid chromatography - Sequential Window Acquisition of all THeoretical fragment-ion spectra - mass spectrometry (nanoLC-SWATH-MS) to the COO classification of DLBCL in formalin-fixed paraffin-embedded (FFPE) tissue. To generate a protein signature capable of predicting Affymetrix-based GCB scores, the summed log2-transformed fragment ion intensities of 780 proteins quantified in a training set of 42 DLBCL cases were used as independent variables in a penalized zero-sum elastic net regression model with variable selection. The eight-protein signature obtained showed an excellent correlation (r = 0.873) between predicted and true GCB scores and yielded only 9 (21.4%) minor discrepancies between the three classifications: ABC, GCB, and unclassified. The robustness of the model was validated successfully in two independent cohorts of 42 and 31 DLBCL cases, the latter cohort comprising only patients aged >75 years, with Pearson correlation coefficients of 0.846 and 0.815, respectively, between predicted and NanoString nCounter based GCB scores. We further show that the 8-protein signature is directly transferable to both a triple quadrupole and a Q Exactive quadrupole-Orbitrap mass spectrometer, thus obviating the need for proprietary instrumentation and reagents. This method may therefore be used for robust and competitive classification of DLBCLs on the protein level.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32398793 PMCID: PMC7217957 DOI: 10.1038/s41598-020-64212-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Protein identities, regression weights and intercept of the zero-sum model trained on the development cohort.
| UniProt protein ID | Regression weight |
|---|---|
| P04233 | HG2A_HUMAN | +4.489047570 |
| Q15063 | POSTN_HUMAN | +1.551003753 |
| P51884 | LUM_HUMAN | +1.490571730 |
| Q9NS69 | TOM22_HUMAN | +1.019845999 |
| P62841 | RPS15_HUMAN | −0.888995137 |
| P16070 | CD44_HUMAN | −1.624465232 |
| P01871 | IGHM_HUMAN | −2.556118873 |
| P18031 | PTN1_HUMAN | −3.480889810 |
| Intercept | −1.258554293 |
Figure 1DLBCL subtyping using different methods of gene and protein expression profiling and tissue preservation. Plot (a) shows the GCB gold-standard scores based on Affymetrix GeneChip expression profiling of cryo-preserved tissue sections versus zero-sum scores predicted in a leave-one-out cross validation on nanoLC-SWATH-MS proteome data acquired on FFPE sections that had been prepared in parallel for a total of 42 DLBCL cases. The scores from both technologies agree well with a Pearson correlation coefficient of r = 0.873 despite the differently preserved tissue sections used. The dashed lines are classification boundaries for GCB, unclassified and ABC, respectively. The color bars below the plot contrast the respective classifications, with the absence of major disagreements underscoring the good agreement between the Affymetrix and the nanoLC-SWATH-MS based COO assignments. Plot (b) shows the correlation between nCounter and nanoLC-SWATH-MS based GCB scores with a Pearson correlation coefficient of r = 0.846 and the respective COO assignments obtained for an independent set of 42 FPPE tissue sections. ABCs are indicated by blue circles, unclassified cases by green crosses, and GCBs by red triangles, respectively.
Figure 2Reproducibility of GCB scores using microLC-SWATH-MS. (a) Correlation (r = 0.962, n = 29) between nano- and microLC-SWATH-MS GCB scores based on the summed signal intensities of the 8 proteins constituting the cell-of-origin signature (Table 1). The offset ß0 was estimated by calculating the mean difference between nano- and microLC-SWATH-MS GCB scores. (b) Correlation (r = 0.815) and respective COO assignments obtained by nCounter and microLC-SWATH-MS based COO subtyping of 31 FFPE tissue sections obtained fromDLBCL patients aged >75 years at the time of diagnosis.
Amino acid sequences, nanoLC retention times, and SWATH-MS precursor and three most intense fragment ions of the 17 proteotypic peptides used for the analysis of the cell-of-origin protein signature by selected and parallel reaction monitoring.
| Protein | Peptide sequence | RT (min) | Precursor (m/z2+) | Three most intensive fragment ions (m/z1+) |
|---|---|---|---|---|
| IGHM | QIQVSWLR | 80 | 515.2957 | 788.4413 (y6); 561.3143 (y4); 660.3828 (y5) |
| IGHM | [PGQ]-QIQVSWLRa | 114 | 506.7823 | 561.3143 (y4); 660.3828 (y5); 788.4413 (y6) |
| POSTN | AAAITSDILEALGR | 116 | 700.8908 | 1074.5790 (y10); 973.5313 (y9); 886.4993 (y8) |
| PTN1 | MGLIQTADQLR | 82 | 623.3347 | 831.4319 (y7); 703.3734 (y6); 944.5160 (y8) |
| PTN1 | FSYLAVIEGAK | 100 | 599.3293 | 687.4036 (y7); 800.4876 (y8); 963.5510 (y9) |
| CD44 | YGFIEGHVVIPR | 82 | 462.9225 | 906.5156 (y8); 583.3926 (y5); 484.3242 (y4) |
| CD44 | ALSIGFETC[PPa]Rb | 79 | 584.2950 | 783.3454 (y6); 983.4615 (y8) 726.3239 (y5) |
| CD44 | ESSETPDQFMTADETR | 71 | 922.3862 | 1310.5681 (y11); 970.4299 (y8); 1411.6158 (y12) |
| HG2A | DLISNNEQLPMLGR | 103 | 800.4116 | 1258.6208 (y11); 814.4604 (y7); 1171.5889 (y10) |
| HG2A | LTVTSQNLQLENLR | 87 | 814.9520 | 1315.6964 (y11); 1214.6488 (y10); 999.5582 (y8) |
| LUM | FNALQYLR | 90 | 512.7823 | 763.4461 (y6); 579.3250 (y4); 692.4090 (y5) |
| LUM | LPSGLPVSLLTLYLDNNK | 142 | 653.0383 | 766.3730 (y6); 864.5189 (b6); 980.5048 (y8) |
| RPS15 | GVDLDQLLDMSYEQLMQLYSAR | 171 | 863.4172 | 981.5186 (y8); 1109.5771 (y9); 1238.6198 (y10) |
| RPS15 | GVDLDQLLDM[Oxi]SYEQLMQLYSARc | 166 | 868.7488 | 981.5186 (y8); 1109.5771 (y9); 1238.6198 (y10) |
| RPS15 | DMIILPEMVGSMVGVYN[Dea]GKd | 142 | 685.3384 | 1012.4768 (y10); 737.3611 (b14); 734.8442 (y14) |
| TOM22 | LQMEQQQQLQQR | 50 | 779.3937 | 1316.6376 (y10); 1056.5544 (y8); 1185.5970 (y9) |
| TOM22 | LWGLTEMFPER | 115 | 689.8448 | 1079.5190 (y9); 909.4135 (y7); 808.3658 (y6) |
a[PGQ]-Q, cyclized N-terminal glutamine; bC[PPA], propionamidated cysteine; cM[Oxi], oxidized methionine; dN[Dea], deamidated asparagine.
Figure 3Replication of SWATH-MS GCB scores by targeted analysis of 17 peptides proteotypic for the 8 proteins constituting the cell-of-origin protein signature. (a) Plots show the correlation between the nanoLC-SWATH-MS based GCB scores and the GCB scores obtained by the targeted analysis of the 17 proteotypic peptides listed in Table 2 by either (a) nanoLC-QQQ-MS (r = 0.971, n = 38) or (b) nanoLC Q Exactive™ Hybrid Quadrupole-Orbitrap™ mass spectrometer -MS (r = 0.981, n = 36).