| Literature DB >> 27800494 |
Hiroyuki Ogihara1, Norio Iizuka2, Yoshihiko Hamamoto3.
Abstract
We discuss a novel diagnostic method for predicting the early recurrence of liver cancer with high accuracy for personalized medicine. The difficulty with cancer treatment is that even if the types of cancer are the same, the cancers vary depending on the patient. Thus, remarkable attention has been paid to personalized medicine. Unfortunately, although the Tokyo Score, the Modified JIS, and the TNM classification have been proposed as liver scoring systems, none of these scoring systems have met the needs of clinical practice. In this paper, we convert continuous and discrete data to categorical data and keep the natively categorical data as is. Then, we propose a discrete Bayes decision rule that can deal with the categorical data. This may lead to its use with various types of laboratory data. Experimental results show that the proposed method produced a sensitivity of 0.86 and a specificity of 0.49 for the test samples. This suggests that our method may be superior to the well-known Tokyo Score, the Modified JIS, and the TNM classification in terms of sensitivity. Additional comparative study shows that if the numbers of test samples in two classes are the same, this method works well in terms of the F1 measure compared to the existing scoring methods.Entities:
Mesh:
Year: 2016 PMID: 27800494 PMCID: PMC5075355 DOI: 10.1155/2016/8567479
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Divisions and numbers of patients with each marker.
| Division of markers |
|
|
|---|---|---|
|
|
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
| ⋮ | ⋮ | ⋮ |
|
| ||
|
|
|
|
|
| ||
| ⋮ | ⋮ | ⋮ |
|
| ||
|
|
|
|
|
| ||
| ⋮ | ⋮ | ⋮ |
Figure 1Arrangement of training samples of the recurrence class for markers x 1 and x 2.
Division and number of training samples in each class.
| Division of markers | Number of samples | |
|---|---|---|
| 29 | 89 | |
| Recurrence within 1 year | Nonrecurrence within 1 year | |
|
| 15 | 60 |
|
| 14 | 29 |
|
| ||
|
| 6 | 47 |
|
| 11 | 29 |
|
| 12 | 13 |
|
| ||
|
| 10 | 18 |
|
| 19 | 71 |
|
| ||
|
| 14 | 44 |
|
| 15 | 45 |
|
| ||
|
| 9 | 16 |
|
| 20 | 73 |
|
| ||
|
| 16 | 70 |
|
| 13 | 19 |
|
| ||
|
| 18 | 67 |
|
| 11 | 22 |
|
| ||
|
| 17 | 61 |
|
| 12 | 28 |
|
| ||
|
| 25 | 79 |
|
| 4 | 10 |
|
| ||
|
| 15 | 60 |
|
| 14 | 29 |
Figure 2Selection of optimal markers.
Figure 3Flow of classifier design and evaluation.
Figure 4Relationships between the training sample subsets.
Optimal combinations of markers per number of markers and their discrimination performances obtained using training samples.
| Number of markers | Sensitivity | Specificity | Youden index | Combination of markers | |||||
|---|---|---|---|---|---|---|---|---|---|
| 3 | 0.79 | 0.50 | 0.29 | Tumor number × tumor size | vp | Liver damage | |||
|
| |||||||||
| 4 | 0.80 | 0.50 | 0.30 | Tumor number × tumor size | vp | ICG | Liver damage | ||
|
| |||||||||
| 5 | 0.75 | 0.50 | 0.25 | ALB | Tumor number × tumor size | vv | Degree of differentiation | Liver damage | |
|
| |||||||||
| 6 | 0.74 | 0.51 | 0.24 | ALB | Tumor number × tumor size | vp | ICG | Degree of differentiation | Liver damage |
|
| |||||||||
| 7 | — | — | — | ||||||
| 8 | — | — | — | ||||||
| 9 | — | — | — | ||||||
Figure 5Relationship between the number of training samples and sensitivity.
Figure 6ROC curve.
Figure 7Relation between the number of markers and discrimination time.
(a) Breakdown of training samples
| Virus type | Recurrence within 1 year | Nonrecurrence within 1 year |
|---|---|---|
| B | 6 | 16 |
| C | 18 | 56 |
| Samples that are neither B type nor C type | 5 | 17 |
|
| ||
| Total number of samples | 29 | 89 |
(b) Breakdown of test samples
| Virus type | Recurrence within 1 year | Nonrecurrence within 1 year |
|---|---|---|
| B | 6 | 16 |
| C | 17 | 55 |
| Samples that are neither B type nor C type | 5 | 17 |
|
| ||
| Total number of samples | 28 | 88 |
(a) Results using 28 recurrence test samples and 88 nonrecurrence test samples
| Index | Proposed method | Modified JIS with 3 | TNM classification with 2 | Tokyo score with 2 |
|---|---|---|---|---|
| Accuracy | 0.58 | 0.77 | 0.34 | 0.49 |
| Sensitivity, recall | 0.86 | 0.57 | 0.96 | 0.71 |
| Specificity | 0.49 | 0.83 | 0.14 | 0.42 |
|
| 0.49 | 0.54 | 0.41 | 0.40 |
| Youden index | 0.35 | 0.40 | 0.10 | 0.13 |
| Diagnostic odds ratio | 5.73 | 6.49 | 4.26 | 1.81 |
(b) Results using 28 test samples/class obtained by resampling
| Index | Proposed method | Modified JIS with 3 | TNM classification with 2 | Tokyo score with 2 |
|---|---|---|---|---|
| Accuracy | 0.67 [0.66, 0.69] | 0.70 [0.69, 0.71] | 0.55 [0.54, 0.56] | 0.57 [0.55, 0.58] |
| Sensitivity, recall | 0.86 | 0.57 | 0.96 | 0.71 |
| Specificity | 0.49 [0.46, 0.52] | 0.83 [0.81, 0.85] | 0.14 [0.12, 0.16] | 0.42 [0.39, 0.44] |
|
| 0.73 [0.72, 0.73] | 0.66 [0.65, 0.67] | 0.68 [0.68, 0.69] | 0.62 [0.62, 0.63] |
| Youden index | 0.35 [0.32, 0.37] | 0.40 [0.38, 0.42] | 0.10 [0.08, 0.12] | 0.13 [0.11, 0.16] |
| Diagnostic odds ratio | 6.03 [5.39, 6.67] | 7.88 [6.22, 9.53] | 4.62 [3.86, 5.38] | 1.95 [1.74, 2.15] |