| Literature DB >> 36011082 |
Valentin Bejan1, Elena-Niculina Dragoi2, Silvia Curteanu2, Viorel Scripcariu1, Bogdan Filip1.
Abstract
The incidence of colon, rectal, and colorectal cancer is very high, and diagnosis is often made in the advanced stages of the disease. In cases where peritoneal carcinomatosis is limited, patients can benefit from newer treatment options if the disease is promptly identified, and they are referred to specialized centers. Therefore, an essential diagnostic benefit would be identifying those factors that could lead to early diagnosis. A retrospective study was performed using patient data gathered from 2010 to 2020. The collected data were represented by routine blood tests subjected to stringent inclusion and exclusion criteria. In order to determine the presence or absence of peritoneal carcinomatosis in colorectal cancer patients, three types of machine learning approaches were applied: a neuro-evolutive methodology based on artificial neural network (ANN), support vector machines (SVM), and random forests (RF), all combined with differential evolution (DE). The optimizer (DE in our case) determined the internal and structural parameters that defined the ANN, SVM, and RF in their optimal form. The RF strategy obtained the best accuracy in the testing phase (0.75). Using this RF model, a sensitivity analysis was applied to determine the influence of each parameter on the presence or absence of peritoneal carcinomatosis.Entities:
Keywords: colon cancer; differential evolution algorithm; neural networks; peritoneal carcinomatosis; rectal cancer
Year: 2022 PMID: 36011082 PMCID: PMC9407908 DOI: 10.3390/healthcare10081425
Source DB: PubMed Journal: Healthcare (Basel) ISSN: 2227-9032
Statistics of the parameters considered for carcinomatosis and colorectal cancer study group.
| N | Minimum | Maximum | Mean | Std. Deviation | |
|---|---|---|---|---|---|
| Sex | 46 | 1 | 2 | 1.41 | 0.50 |
| Age (years old) | 46 | 36 | 84 | 62.22 | 11.50 |
| Hb (g/DL) | 46 | 6.5 | 16.5 | 12.20 | 2.66 |
| Ht (%) | 46 | 19.9 | 49 | 37.18 | 7.00 |
| PLT (/mm3) | 46 | 152,000 | 702,000 | 339,347.83 | 112,512.96 |
| WBC (/mm3) | 46 | 2830 | 85,700 | 11,187.37 | 11,761.72 |
| Neutr (/mm3) | 46 | 10.07 | 64,700 | 9082.39 | 11,336.75 |
| Lymph (/mm3) | 46 | 180 | 5700 | 1610.43 | 905.72 |
Figure 1Flowchart of patient inclusion and exclusion algorithm.
Statistics of the parameters considered for colorectal cancer study group.
| N | Minimum | Maximum | Mean | Std. Deviation | |
|---|---|---|---|---|---|
| Sex | 49 | 1 | 2 | 1.55 | 0.50 |
| Age (years old) | 49 | 29 | 91 | 69.94 | 12.84 |
| Hb (g/DL) | 49 | 4.7 | 16 | 10.58 | 2.59 |
| Ht (%) | 49 | 17.1 | 46.9 | 33.11 | 6.64 |
| PLT(/mm3) | 49 | 139,000 | 792,000 | 340,551.00 | 129,012.30 |
| WBC(/mm3) | 49 | 3820 | 19,170 | 9127.14 | 3523.92 |
| Neutr(/mm3) | 49 | 2200 | 16,540 | 6627.14 | 3280.35 |
| Lymph(/mm3) | 49 | 177 | 3140 | 1530.35 | 607.70 |
Figure 2Workflow for data classification.
Confusion matrix for the ANN.
| PC (Predicted) | Non-PC (Predicted) | ||
|---|---|---|---|
| Training | PC (actual) | 27 (TP) | 9 (FN) |
| Non-PC (actual) | 3 (FP) | 27 (TN) | |
| Testing | PC (actual) | 8 (TP) | 5 (FN) |
| Non-PC (actual) | 10 (FP) | 6 (TN) |
Figure 3Exploration exploitation balance of the DE algorithm.
Confusion matrix for the SVM.
| PC (Predicted) | Non-PC (Predicted) | ||
|---|---|---|---|
| Training | PC (actual) | 28 (TP) | 8 (FN) |
| Non-PC (actual) | 9 (FP) | 21 (TN) | |
| Testing | PC (actual) | 7 (TP) | 6 (FN) |
| Non-PC (actual) | 2 (FP) | 14 (TN) |
Confusion matrix for the RF.
| PC (Predicted) | Non-PC (Predicted) | ||
|---|---|---|---|
| Training | PC (actual) | 36 (TP) | 0 (FN) |
| Non-PC (actual) | 0 (FP) | 30 (TN) | |
| Testing | PC (actual) | 10 (TP) | 3 (FN) |
| Non-PC (actual) | 4 (FP) | 12 (TN) |
Summary statistics for RF.
| Precision | Recall (or Sensitivity) | F1-Score | Accuracy | Specificity | |
|---|---|---|---|---|---|
| ANN | 0.44 | 0.61 | 0.651 | 0.48 | 0.38 |
| SVM | 0.77 | 0.53 | 0.63 | 0.72 | 0.88 |
| RF | 0.71 | 0.77 | 0.74 | 0.76 | 0.75 |
Sensitivity values.
| Input | Sensitivity Coefficient |
|---|---|
| Age | 0.217220 |
| Ht | 0.153470 |
| Hb | 0.144125 |
| Lymph | 0.133404 |
| Neutr | 0.117887 |
| WBC | 0.112012 |
| PLT | 0.102972 |
| Sex | 0.018910 |