| Literature DB >> 22253617 |
Nai-Jun Fan1, Chun-Fang Gao, Xiu-Li Wang.
Abstract
Background. To explore the application of serum proteomic patterns for the preoperative detection of regional lymph node involvement of colorectal cancer (CRC). Methods. Serum samples were applied to immobilized metal affinity capture ProteinChip to generate mass spectra by Surface-Enhanced Laser Desorption/ionization Time-of-Flight Mass Spectrometry (SELDI-TOF-MS). Proteomic spectra of serum samples from 70 node-positive CRC patients and 75 age- and gender-matched node-negative CRC patients were employed as a training set, and a classification tree was generated by using Biomarker Pattern Software package. The validity of the classification tree was then challenged with a blind test set including another 65 CRC patients. Results. The software identified an average of 46 mass peaks/spectrum and 5 of the identified peaks at m/z 3,104, 3,781, 5,867, 7,970, and 9,290 were used to construct the classification tree. The classification tree separated effectively node-positive CRC patients from node-negative CRC patients, achieving a sensitivity of 94.29% and a specificity of 100.00%. The blind test challenged the model independently with a sensitivity of 91.43% a specificity of 96.67%. Conclusions. The results indicate that SELDI-TOF-MS can correctly distinguish node-positive CRC patients from node-negative ones and show great potential for preoperative screening for regional lymph node involvement of CRC.Entities:
Year: 2011 PMID: 22253617 PMCID: PMC3255105 DOI: 10.1155/2011/784967
Source DB: PubMed Journal: Gastroenterol Res Pract ISSN: 1687-6121 Impact factor: 2.260
Figure 1Representative serum protein profiling spectra of node-positive CRC patients and node-negative CRC patients. The peaks at m/z 7,970 were compared. m/z represents the mass to charge ratio.
SELDI protein peak intensities in serum with significant differences (P < 0.05) between node-positive CRC patients and node-negative CRC patients.
| Peak, m/z | Protein quantification ( |
|
| |
|---|---|---|---|---|
| Node (+) | Node (−) | |||
| 5,867 | 3.534 ± 0.547 | 0.230 ± 0.107 | 58.749 | 0 |
| 9,290 | 3.271 ± 0.587 | 1.443 ± 0.103 | 73.932 | 0 |
| 7,970 | 12.364 ± 1.08 | 7.913 ± 0.668 | 61.235 | 1 |
| 14,433 | 0.891 ± 0.168 | 0.404 ± 0.102 | 42.436 | 1.8 |
| 15,621 | 1.782 ± 0.609 | 5.873 ± 0.655 | 23.125 | 2.16 |
| 15,824 | 0.729 ± 0.111 | 2.072 ± 0.255 | 20.218 | 4.25 |
| 13,652 | 1.388 ± 0.168 | 2.423 ± 0.259 | 26.276 | 7.58 |
| 3,781 | 4.268 ± 0.216 | 6.799 ± 0.977 | 33.346 | 9.28 |
| 15,006 | 0.641 ± 0.150 | 1.824 ± 0.084 | 19.552 | 2.26 |
| 7,802 | 6.580 ± 0.504 | 5.170 ± 0.635 | 16.319 | 5 |
| 4,497 | 9.154 ± 0.639 | 5.139 ± 0.916 | 22.740 | 0.000106 |
| 7,504 | 0.034 ± 0.005 | 0.704 ± 0.271 | 10.960 | 0.000106 |
| 5,967 | 8.809 ± 0.705 | 6.021 ± 0.879 | 18.793 | 0.000124 |
| 5,808 | 4.947 ± 0.797 | 2.252 ± 0.990 | 19.989 | 0.001097 |
| 10,057 | 1.809 ± 0.301 | 1.206 ± 0.245 | 8.139 | 0.001261 |
| 7,597 | 44.557 ± 1.121 | 37.993 ± 1.763 | 13.041 | 0.001549 |
| 5,786 | 16.269 ± 1.880 | 11.966 ± 0.994 | 10.461 | 0.003002 |
| 15,089 | 0.708 ± 0.198 | 1.678 ± 0.042 | 7.895 | 0.003002 |
| 3,104 | 4.719 ± 0.478 | 6.626 ± 0.791 | 7.436 | 0.012677 |
| 14,869 | 1.922 ± 0.653 | 4.278 ± 1.079 | 6.209 | 0.0134 |
| 7,620 | 12.534 ± 2.920 | 8.908 ± 1.474 | 13.036 | 0.017595 |
| 5,662 | 1.760 ± 0.601 | 0.266 ± 0.131 | 12.497 | 0.035988 |
Peaks were named by their mass to charge ratio (m/z).
Figure 2The pattern-matching algorithm to distinguish node-positive CRC patients from node-negative CRC patients on learning mode of training set. The left branch node after Node 1 is the cases of the linear combination: −0.075 (m/z 3,104) −0.231 (m/z 3,781) + 0.157 (m/z 5,867) + 0.086 (m/z 7,970) + 0.953 (m/z 9,290) ≤1.122, and the right one is >1.122. The left branch node after the Node 2 is the linear combination: 0.410 (m/z 3,781) −0.912 (m/z 5,867) ≤0.248, and the right one is >0.248. The cases met to the conditions of Terminal Node 1 and Terminal Node 3 were diagnosed as node-positive CRC patients, and those met to the conditions of Terminal Node 2 were diagnosed as node-negative CRC patients. N represents the number of samples. m/z represents the mass to charge ratio.
Important peaks selected by the biomarker pattern software.
| Peak, m/z | Score | |
|---|---|---|
| 5,867 | 100.00 | ∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣ |
| 9,290 | 100.00 | ∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣ |
| 7,970 | 78.81 | ∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣ |
| 15,823 | 67.55 | ∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣ |
| 15,621 | 63.21 | ∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣∣ |
Peaks were named by their mass to charge ratio (m/z).
*The most important peak was assigned an importance index of 100. The importance of other peaks was compared with the top peak and a number below 100 was given for each peak.
Performance of the classification tree analysis of node-positive CRC patients in training set and test set.
| Sets | Sensitivity (%) | Specificity (%) | Accuracy rate (%) |
|---|---|---|---|
| Training set | |||
| Learning mode | 100.00% (70/70) | 100.00% (75/75) | 100.00% (145/145) |
| Test mode | 94.29% (66/70) | 100.00% (75/75) | 97.24% (141/145) |
| Test set | 91.43% (32/35) | 96.67% (29/30) | 93.85% (61/65) |
Number in parentheses denotes the number of correctly classified samples of the total number of samples in the group.