| Literature DB >> 19594898 |
Monika Pietrowska1, Lukasz Marczak, Joanna Polanska, Katarzyna Behrendt, Elzbieta Nowicka, Anna Walaszczyk, Aleksandra Chmura, Regina Deja, Maciej Stobiecki, Andrzej Polanski, Rafal Tarnawski, Piotr Widlak.
Abstract
BACKGROUND: Mass spectrometric analysis of the blood proteome is an emerging method of clinical proteomics. The approach exploiting multi-protein/peptide sets (fingerprints) detected by mass spectrometry that reflect overall features of a specimen's proteome, termed proteome pattern analysis, have been already shown in several studies to have applicability in cancer diagnostics. We aimed to identify serum proteome patterns specific for early stage breast cancer patients using MALDI-ToF mass spectrometry.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19594898 PMCID: PMC2725033 DOI: 10.1186/1479-5876-7-60
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Figure 1Estimation of the performance of classification of breast cancer samples. A – The total error rate was plotted against the number of features (i.e. spectral components) in the classifier. Shown are average error rates and 95% confidence intervals calculated based on 1000 random validation experiments with 50:50 training/validation split of data. B – Estimation of the sensitivity and specificity of the classification for classifiers built of three or four spectral components. The ROC curve was computed by changing the value of the probability threshold in the SVM classifier from 0.0 to 1.0, and averaging the specificity obtained versus sensitivity rate over 1000 random repeats of training and validation.
Figure 2Characterization of spectral components essential for cancer classification. A – The three most frequent differentiating components are marked with arrows along the mass spectra of serum samples of cancer patients (red lines) and healthy controls (green lines). B – Actual spectral plots of three selected components for cancer patients (red lines) and healthy controls (green lines), as well as modeled Gaussian kernels (blue curves); X-axes represent the m/z values, Y-axes represent intensities. Box-plots on the right represent quantification of the abundance of spectral components in samples from cancer patients (red) and healthy controls (green) (shown are minimum, lower quartile, median, upper quartile and maximum values; outliers are marked by asterisks).
Characteristics of spectral components that differentiated samples from breast cancer patients and healthy controls.
| Component | -95% CI | + 95% CI | S.D. | p-value | Corrected | Frequency | Change |
| 2294.67 | 2283.38 | 2305.96 | 5.76 | 1.28e-12 | 3.84e-10 | 46% | D |
| 2296,88 | 2310,09 | 3,37 | D | ||||
| 2554.37 | 2540.32 | 2568.41 | 7.16 | 4.13e-07 | 1.24e-04 | 1% | U |
| 2845.58 | 2838.34 | 2852.81 | 3.69 | 3.59e-12 | 1.08e-09 | 21% | D |
| 2864.46 | 2866.62 | 7.73 | D | ||||
| 3283.73 | 3265.34 | 3302.13 | 9.39 | 6.60e-07 | 1.98e-04 | 1% | U |
| 3360.19 | 3352.06 | 3368.31 | 4.15 | 5.69e-11 | 1.71e-08 | 22% | D |
| 3427.46 | 3401.71 | 3453.21 | 13.14 | 8.11e-11 | 2.43e-08 | 7% | D |
| 3577.42 | 3580.04 | 9.36 | D | ||||
| 3874.18 | 3863.89 | 3884.47 | 5.25 | 8.08e-09 | 2.42e-06 | 3% | D |
| 3895.05 | 3882.03 | 3908.06 | 6.64 | 1.58e-11 | 4.74e-09 | 6% | D |
| 4965.77 | 4945.35 | 4986.19 | 10.42 | 1.91e-08 | 5.73e-06 | 5% | D |
| 6061.80 | 6050.15 | 6073.45 | 5.94 | 9.53e-09 | 2.86e-06 | 5% | D |
| 6743.99 | 6734.13 | 6753.85 | 5.03 | 2.99e-08 | 8.97e-06 | 2% | D |
Shown are the most frequent spectral components (m/z values), their 95% confidence intervals, standard deviations of the corresponding model Gaussians, and relative frequencies in cancer classifiers built of 4 features. The p-values are for differences between patients and healthy controls measured by the Mann-Whitney U test for each individual component (also shown after the Bonferroni correction against multiple testing). The change refers to either increased (U) or decreased (D) abundance of a given peptide in cancer samples compared to control samples. The three most frequent components are underlined.
Comparison of discriminating spectral components/peptide peaks found in this study and in other published work.
| This study | Other studies | |||||||
| m/z value | p-value | Change | m/z value | p-value | Change | Ref. | Study design | Identity |
| 2303.48 | 6.25e-14 | D | 2306.20 | 1.09e-06 | U | 35 | MALDI/serum/A | C4a |
| 2356.91 | 2.47e-04 | D | 2359.09 | 4.07e-12 | U | 35 | MALDI/serum/A | ITIH4 |
| 2378.80 | 8.91e-06 | D | 2380.03 | 1.26e-07 | U | 35 | MALDI/serum/A | Fibrinogen |
| 2510.80 | 4.65e-08 | D | 2509.16 | 5.56e-13 | U | 35 | MALDI/serum/A | ApoA-IV |
| 2599.75 | 6.03e-04 | U | 2603.15 | 2.08e-07 | U | 35 | MALDI/serum/A | Factor XIIIa |
| 3020.51 | 5.49e-03 | U | 3017.85 | 1.50e-03 | U | 43 | SELDI/NAF/M | |
| 3273.96 | 1.08e-03 | U | 3278.71 | 1.05e-05 | D | 42 | SELDI/serum/M | |
| 3283.73 | 6.80e-07 | U | 3281.5 | 1.77e-04 | U | 38 | MALDI/serum/M | |
| 3973.35 | 1.51e-06 | D | 3284.74 | 3.00e-04 | U | 43 | SELDI/NAF/M | |
| 4648.09 | 3.48e-07 | U | 3975.99 | 3.06e-05 | D | 42 | SELDI/serum/M | |
| 5105.44 | 4.66e-03 | U | 4648 | 4.13e-03 | D | 42 | SELDI/serum/M | |
| 6802.40 | 1.42e-03 | D | 5101.8 | 4.90e-03 | U | 43 | SELDI/NAF/M | |
| 8116.60 | 3.41e-04 | D | 6807.26 | 1.90e-03 | D | 42 | SELDI/serum/M | |
| 8134.75 | 9.61e-04 | D | 8116 | 1.00e-06 | U | 29,33 | SELDI/serum/M | C3a |
| 8656.46 | 2.73e-04 | U | 8138.56 | 7.89e-07 | U | 42 | SELDI/serum/M | |
| 8657.2 | 1.00e-03 | U | 37 | SELDI/Serum/E | ||||
Uncorrected p-values are based on Mann-Whitney U tests in this study. Correspondence of peptide peaks is based on a difference of less than ± 0.2% of the m/z values of [M+H]+ ions. The column "Change" refers to an increased (U) or decreased (D) abundance of a given peptide in breast cancer samples comparing to control samples, and the column "Identity" shows the protein from which the corresponding fragment is derived. Corresponding peptide peaks were found in six studies based on either MALDI or SELDI spectrometry; patient groups consisted of either early (E), advanced (A) or mixed (M) stages. One study analyzed the nipple aspirate fluid (NAF).
Figure 3Estimation of differences of serum proteome patterns between sub-groups of breast cancers patients. Patients were differentiated by age, primary tumor size (T), lymph nodal status (N), histopathological grade (G), and estrogen (ER) and progesterone (PR) receptor expression. A – The total error rates of classification plotted against the number of features in the classifiers as in Fig. 1A; the actual line width corresponded to 95% confidence intervals. B – ROC curves computed for classifiers built of 15 spectral components for each comparison (computation was done as described in Fig. 1B).
Comparison of serum proteome patterns among different sub-groups of breast cancer patients.
| 5353.64 | 11.54 | 0.020 | 48.4 |
| 2475.96 | 2.37 | 0.029 | 19.9 |
| 4098.35 | 25.90 | 0.032 | 22.5 |
| 3024.45 | 11.71 | 0.035 | 20.2 |
| 4070.43 | 5.58 | 0.048 | 19.2 |
| 5353.64 | 11.54 | 0.033 | 21.9 |
| 2873.93 | 7.13 | 0.038 | 20.5 |
| 5343.98 | 6.26 | 0.051 | 15.8 |
| 3024.45 | 11.71 | 0.073 | 32.8 |
| 3249.64 | 5.24 | 0.075 | 12.4 |
| 8618.25 | 10.83 | 0.024 | 45.1 |
| 8602.25 | 29.35 | 0.036 | 29.5 |
| 2909.04 | 10.58 | 0.038 | 32.5 |
| 8607.98 | 5.60 | 0.040 | 25.7 |
| 8682.92 | 8.83 | 0.047 | 28.2 |
| 2937.50 | 6.76 | 0.004 | 34.1 |
| 2556.63 | 8.07 | 0.007 | 44.1 |
| 2909.04 | 10.58 | 0.011 | 36.8 |
| 7547.58 | 12.44 | 0.022 | 37.7 |
| 4793.63 | 5.91 | 0.026 | 8.1 |
| 7915.93 | 29.15 | 0.038 | 42.4 |
| 6302.67 | 4.03 | 0.039 | 23.6 |
| 8246.58 | 14.13 | 0.043 | 27.5 |
| 2599.96 | 3.93 | 0.044 | 28.9 |
| 3367.65 | 13.43 | 0.061 | 22.4 |
| 7101.57 | 8.39 | 0.002 | 50.7 |
| 9965.62 | 16.77 | 0.014 | 37.3 |
| 7750.49 | 24.72 | 0.015 | 30.6 |
| 3367.65 | 13.43 | 0.018 | 30.5 |
| 9934.38 | 23.21 | 0.020 | 28.3 |
The five spectral components with the lowest p-values were selected for each comparison. Shown are spectral components (m/z values), S.Ds. of the corresponding model Gaussians, and their relative frequencies in classifiers. The p-values (uncorrected) are for differences measured by the Mann-Whitney U test for each individual component.
Levels of tumor markers in plasma of breast cancer patients and healthy controls.
| healthy controls | 58 | 1.13 | 1.62 | 1.46 | 0.84 – 1.75 | 0.04 |
| cancer patients | 37 | 1.54 | 2.45 | 3.11 | 1.00 – 2.11 | |
| healthy controls | 58 | 12.3 | 13.28 | 5.45 | 9.5 – 16.4 | 0.63 |
| cancer patients | 37 | 14.0 | 13.74 | 5.79 | 8.3 – 18.5 | |
| healthy controls | 58 | 0.41 | 0.53 | 0.48 | 0.24 – 0.60 | 0.06 |
| cancer patients | 37 | 0.54 | 0.63 | 0.44 | 0.35 – 0.75 | |
| healthy controls | 58 | 27.70 | 33.51 | 23.01 | 17.80 – 41.80 | 0.05 |
| cancer patients | 37 | 23.01 | 24.19 | 16.09 | 10.02 – 31.11 | |
| healthy controls | 50 | 45.90 | 47.13 | 11.9 | 38.70 – 52.20 | 0.0003 |
| cancer patients | 73 | 54.73 | 59.47 | 15.37 | 47.13 – 66.98 | |
Shown are median, mean and S.D. values, as well as lower and upper quartiles. The p-values are for differences between patients and healthy controls measured by the Kruskal-Wallis test.