| Literature DB >> 28386191 |
Abstract
We aimed to evaluate the specificity of 12 tumor markers related to colon carcinoma and identify the most sensitive index. Logistic regression and Bhattacharyya distance were used to evaluate the index. Then, different index combinations were used to establish a support vector machine (SVM) diagnosis model of malignant colon carcinoma. The accuracy of the model was checked. High accuracy was assumed to indicate the high specificity of the index. Through Logistic regression, three indexes, CEA, HSP60 and CA199, were screened out. Using Bhattacharyya distance, four indexes with the largest Bhattacharyya distance were screened out, including CEA, NSE, AFP, and CA724. The specificity of the combination of the above six indexes was higher than that of other combinations, so did the accuracy of the established SVM identification model. Using Logistic regression and Bhattacharyya distance for detection and establishing an SVM model based on different serum marker combinations can increase diagnostic accuracy, providing a theoretical basis for application of mathematical models in cancer diagnosis.Entities:
Keywords: Bhattacharyya distance; Colon carcinoma; Logistic regression; Specificity; Support vector machine; Tumor marker
Year: 2017 PMID: 28386191 PMCID: PMC5372389 DOI: 10.1016/j.sjbs.2017.01.037
Source DB: PubMed Journal: Saudi J Biol Sci ISSN: 2213-7106 Impact factor: 4.219
Analysis of 12 serum markers in the two groups (means ± standard deviations).
| Indexes groups | Colon cancer group | Control group |
|---|---|---|
| CEA | 29.31 ± 8.31 (ng/mL) | 4.28 ± 1.39 (ng/mL) |
| NSE | 11.76 ± 2.33 (ng/mL) | 2.45 ± 1.01 (ng/mL) |
| HSP60 | 587.29 ± 477.44 (pg/mL) | 201.45 ± 120.97 (pg/mL) |
| CYFRA21-I | 8.75 ± 2.22 (ng/mL) | 1.98 ± 1.04 (ng/mL) |
| TPA | 0.87 ± 1.25 (U/mL) | 0.081 ± 0.54 (U/mL) |
| AFP | 17.68 ± 5.15 (ng/mL) | 2.78 ± 0.98 (ng/mL) |
| CA199 | 52.03 ± 38.34 (U/mL) | 24.03 ± 12.22 (U/mL) |
| CA242 | 18.55 ± 10.09 (U/mL) | 5.06 ± 1.47 (U/mL) |
| CA724 | 5.87 ± 1.25 (U/mL) | 1.06 ± 0.77 (U/mL) |
| CA125 | 43.05 ± 9.73 (U/mL) | 10.31 ± 7.65 (U/mL) |
| CA153 | 21.40 ± k8.63 (U/mL) | 15.14 ± 2.83 (U/mL) |
| UGT1A8 | 8.52 ± 2.03 (ng/mL) | 34.6 ± 12.16 (ng/mL) |
Variables in Logistic regression equation.
| B | S.E | Wals | df | Sig. | Exp (B) | ||
|---|---|---|---|---|---|---|---|
| Step 1 | CA199 | 1.839 | 0.420 | 19.158 | 1 | 0.000 | 6.291 |
| Constant | 0.024 | 0.220 | 0.012 | 1 | 0.913 | 1.024 | |
| Step 2 | CEA | 1.806 | 0.450 | 16.138 | 1 | 0.000 | 6.086 |
| CA199 | 1.922 | 0.447 | 18.508 | 1 | 0.000 | 6.834 | |
| Constant | −0.640 | 0.283 | 5.102 | 1 | 0.024 | 0.527 | |
| Step 3 | CEA | 1.721 | 0.462 | 13.911 | 1 | 0.000 | 5.592 |
| HSP60 | 1.252 | 0.472 | 7.044 | 1 | 0.008 | 3.496 | |
| CA199 | 1.920 | 0.459 | 17.502 | 1 | 0.000 | 6.823 | |
| Constant | −0.996 | 0.325 | 9.371 | 1 | 0.002 | 0.369 | |
Variable(s) entered on step 1: CA199.
Variable(s) entered on step 2:CEA.
Variable(s) entered on step 3: HSP60.
Bhattacharyya distances of tumor markers in the two groups.
| Index | CEA | NSE | HSP60 | CYFRA21-I | TPA | AFP | CA 199 | CA242 | CA724 | CA125 | CA153 | UGT1A8 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Bhattacharyya Distance | 3.4608 | 4.2107 | 1.2176 | 2.7314 | 0.9357 | 3.2135 | 1.0877 | 1.7578 | 3.4332 | 2.4567 | 1.0739 | 2.3742 |
Figure 1Analysis of the accuracy of the SVM model established using 12 tumor marker indexes.
Figure 2Analysis of the accuracy of the SVM model established using 4 tumor marker indexes.