| Literature DB >> 35740581 |
Victor Ianole1,2, Mihai Danciu1,2, Constantin Volovat3, Cipriana Stefanescu4, Paul-Corneliu Herghelegiu5, Florin Leon5, Adrian Iftene6, Ciprian-Gabriel Cusmuliuc6, Bogdan Toma1, Vasile Drug7, Delia Gabriela Ciobanu Apostol1,2.
Abstract
AIM: The need for predictive and prognostic biomarkers in colorectal carcinoma (CRC) brought us to an era where the use of artificial intelligence (AI) models is increasing. We investigated the expression of Claudin-7, a tight junction component, which plays a crucial role in maintaining the integrity of normal epithelial mucosa, and its potential prognostic role in advanced CRCs, by drawing a parallel between statistical and AI algorithms.Entities:
Keywords: Claudin-7; advanced colorectal carcinoma; artificial intelligence models; cancer; immunohistochemistry; prognostic biomarkers
Year: 2022 PMID: 35740581 PMCID: PMC9221359 DOI: 10.3390/cancers14122915
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Visualization of independent inputs vs. outputs, for the classification problem (Cldn7—Claudin-7; P—percentage, I—intensity, D—membranous staining pattern: discontinuous vs. continuous).
Figure 2Visualization of independent inputs vs. outputs, for the regression problem (Cldn7—Claudin-7; P—percentage, I—intensity, D—membranous staining pattern: discontinuous vs. continuous).
Comparison of Cldn7 expression between tumor core and invasive front.
| n | Min | Max | Mean | |
|---|---|---|---|---|
| Cldn7 (Core) P | 84 | 2 | 3 | 2.99 |
| Cldn7 (Core) I | 84 | 1 | 3 | 2.42 |
| Cldn7 (Front) P | 84 | 1 | 3 | 2.90 |
| Cldn7 (Front) I | 84 | 1 | 3 | 1.81 |
| Control group | 20 | 0 | 3 | 3 |
Abbreviations: Cldn7, Claudin-7; Cldn7 (Core) P, the proportion of Claudin-7 stained tumor cells in the tumor core; Cldn7 (Core) I, the intensity of Claudin-7 immunoreaction in the tumor core; Cldn7 (Front) P, the proportion of Claudin-7 stained tumor cells in invasive front; Cldn7 (Front) I, the intensity of Claudin-7 immunoreaction in invasive front.
Comparison of Cldn7 staining pattern between tumor core and invasive front.
| Staining Pattern | Cldn7 (Core) | Cldn7 (Front) | ||
|---|---|---|---|---|
| n | % | n | % | |
| Continuous | 62 | 73.8 | 15 | 17.9 |
| Discontinuous | 22 | 26.2 | 69 | 82.1 |
| Total | 84 | 100.0 | 84 | 100.0 |
Abbreviations: Cldn7, Claudin-7.
Figure 3Immunohistochemical profile of Claudin-7 (IHC, anti-Claudin-7 monoclonal antibody) in: (a) normal colonic mucosa (magnification ×50); (b) CRC, intensity decreasing from core to invasive front (magnification ×50); (c) CRC, continuous membranous staining pattern in the tumor core and invasive front (magnification ×50); (d) CRC, continuous membranous staining pattern in the tumor core (magnification ×200); (e) CRC, discontinuous membranous staining pattern in the tumor core (magnification ×200); (f) CRC, discontinuous membranous staining pattern in invasive front (magnification ×100).
Univariate analysis of Claudin-7 and clinicopathological parameters.
| Clinicopathological Parameters | Cldn7 (Core) P | Cldn7 | Cldn7 | Cldn7 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 3 | 1 | 2 | 3 | 1 | 2 | 3 | 1 | 2 | 3 | ||||||
|
|
|
|
|
|
|
|
|
|
|
| ||||||
| Age | 21–30 | 0 | 1 | 0.310 | 0 | 1 | 0 | 0.655 | 0 | 0 | 1 | 0.227 | 0 | 1 | 0 | 0.647 |
| 31–40 | 0 | 2 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | |||||
| 41–50 | 1 | 11 | 2 | 3 | 7 | 0 | 1 | 11 | 4 | 7 | 1 | |||||
| 51–60 | 0 | 24 | 2 | 9 | 13 | 0 | 1 | 23 | 9 | 12 | 3 | |||||
| 61–70 | 0 | 34 | 1 | 18 | 15 | 0 | 2 | 32 | 10 | 17 | 7 | |||||
| 71–80 | 0 | 10 | 1 | 5 | 4 | 1 | 1 | 8 | 3 | 7 | 0 | |||||
| 81–90 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | |||||
| Sex | F | 1 | 35 | 0.429 | 3 | 16 | 17 | 1.000 | 1 | 3 | 32 | 0.523 | 13 | 18 | 5 | 0.953 |
| M | 0 | 48 | 3 | 21 | 24 | 0 | 3 | 45 | 15 | 26 | 7 | |||||
| T stage | T2 | 1 | 10 | 2 | 6 | 3 | 0 | 1 | 10 | 3 | 7 | 1 | ||||
| T3 | 0 | 41 | 3 | 21 | 17 | 0 | 4 | 37 | 17 | 17 | 7 | |||||
| T4a | 0 | 27 | 0.190 | 1 | 8 | 18 | 0.206 | 1 | 1 | 25 | 0.744 | 8 | 15 | 4 | 0.389 | |
| T4b | 0 | 5 | 0 | 2 | 3 | 0 | 0 | 5 | 0 | 5 | 0 | |||||
| N stage | N0 | 0 | 19 | 0 | 12 | 7 | 1 | 1 | 17 | 7 | 11 | 1 | ||||
| N1a | 0 | 6 | 1 | 1 | 4 | 0 | 1 | 5 | 3 | 2 | 1 | |||||
| N1b | 0 | 18 | 1 | 8 | 9 | 0 | 1 | 17 | 8 | 9 | 1 | |||||
| N1c | 0 | 7 | 0.298 | 1 | 2 | 4 | 0.613 | 0 | 0 | 7 | 0.677 | 2 | 4 | 1 | 0.438 | |
| N2a | 1 | 11 | 1 | 5 | 6 | 0 | 2 | 10 | 4 | 7 | 1 | |||||
| N2b | 0 | 22 | 2 | 9 | 11 | 0 | 1 | 21 | 4 | 11 | 7 | |||||
| Grading | High Grade | 0 | 15 | 1 | 6 | 8 | 0 | 1 | 14 | 3 | 8 | 4 | ||||
| Low Grade | 1 | 68 | 0.821 | 5 | 31 | 33 | 0.906 | 1 | 5 | 63 | 1.000 | 25 | 36 | 8 | 0.278 | |
| Tumor Location | LeftColon | 1 | 42 | 4 | 20 | 19 | 1 | 3 | 39 | 13 | 27 | 3 | ||||
| Rectum | 0 | 10 | 1.000 | 0 | 6 | 4 | 0.625 | 0 | 2 | 8 | 0.312 | 6 | 2 | 2 | 0.053 | |
| RightColon | 0 | 31 | 2 | 11 | 18 | 0 | 1 | 30 | 9 | 15 | 7 | |||||
| Venous Invasion | V0 | 0 | 20 | 0 | 12 | 8 | 0 | 1 | 19 | 5 | 10 | 5 | ||||
| V1 | 1 | 63 | 0.762 | 6 | 25 | 33 | 0.168 | 1 | 5 | 58 | 1.000 | 23 | 34 | 7 | 0.255 | |
| Lymphovascular Invasion | L0 | 0 | 9 | 0 | 7 | 2 | 0 | 1 | 8 | 3 | 6 | 0 | ||||
| L1 | 1 | 74 | 0.893 | 6 | 30 | 39 | 0.122 | 1 | 5 | 69 | 0.562 | 25 | 38 | 12 | 0.553 | |
| Perineural Invasion | Pn0 | 0 | 20 | 2 | 11 | 7 | 0 | 1 | 19 | 5 | 12 | 3 | ||||
| Pn1 | 1 | 63 | 0.762 | 4 | 26 | 34 | 0.352 | 1 | 5 | 58 | 1.000 | 23 | 32 | 9 | 0.640 | |
| Tumour Growth Pattern | Expansive | 0 | 10 | 1 | 4 | 5 | 0 | 1 | 9 | 2 | 6 | 2 | ||||
| Infiltrative | 1 | 73 | 0.881 | 5 | 33 | 36 | 0.882 | 1 | 5 | 68 | 0.603 | 26 | 38 | 10 | 0.651 | |
| Tumor Deposits | Absent | 1 | 42 | 4 | 22 | 17 | 1 | 2 | 40 | 15 | 23 | 5 | ||||
| Present | 0 | 41 | 0.512 | 2 | 15 | 24 | 0.244 | 0 | 4 | 37 | 0.427 | 13 | 21 | 7 | 0.832 | |
| Tumor Budding | Bd1 | 0 | 2 | 0 | 2 | 0 | 0 | 0 | 2 | 1 | 1 | 0 | ||||
| Bd2 | 0 | 12 | 1.000 | 1 | 7 | 4 | 0.354 | 0 | 2 | 10 | 0.438 | 3 | 6 | 3 | 0.743 | |
| Bd3 | 1 | 69 | 5 | 28 | 37 | 1 | 4 | 65 | 24 | 37 | 9 | |||||
| Leukocyte Infiltrate | 3 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | ||||
| 5 | 0 | 23 | 1 | 10 | 12 | 0 | 2 | 21 | 5 | 15 | 3 | |||||
| 10 | 0 | 26 | 3 | 10 | 13 | 1 | 3 | 22 | 13 | 11 | 2 | |||||
| 15 | 1 | 8 | 2 | 2 | 5 | 0 | 1 | 8 | 5 | 2 | 2 | |||||
| 20 | 0 | 11 | 0 | 5 | 6 | 0 | 0 | 11 | 3 | 5 | 3 | |||||
| 25 | 0 | 1 | 0.143 | 0 | 0 | 1 | 0.366 | 0 | 0 | 1 | 0.876 | 1 | 0 | 0 | 0.033 | |
| 30 | 0 | 12 | 0 | 9 | 3 | 0 | 0 | 12 | 1 | 10 | 1 | |||||
| 50 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | |||||
Abbreviations: Cldn7, Claudin-7; Cldn7 (Core) P, the proportion of Claudin-7 stained tumor cells in the tumor core; Cldn7 (Core) I, the intensity of Claudin-7 immunoreaction in the tumor core; Cldn7 (Front) P, the proportion of Claudin-7 stained tumor cells in invasive front; Cldn7 (Front) I, the intensity of Claudin-7 immunoreaction in invasive front.
Figure 4Prognostic significance of Claudin-7 intensity (I) expression regarding overall survival as calculated by Kaplan–Meier analysis and scatter plots: (a) invasive front; (b) tumor core. The intensity of immunoreaction: 1—weak, 2—moderate, 3—strong.
The results of different algorithms for the classification problem.
| Algorithm | Training | Cross Validation | Leave One Out |
|---|---|---|---|
| NN | 60.7143% | 36.9048% | 40.4762% |
| kNN, k =10, w = 1/d | 60.7143% | 38.0952% | 42.8571% |
| NNGE | 48.8095% | 45.2381% | 42.8571% |
| C4.5 | 51.1905% | 40.4762% | 38.0952% |
| Random Forest, 100 trees | 60.7143% | 39.2857% | 45.2381% |
| SVM, RBF kernel | 60.7143% | 34.5238% | 39.2857% |
| Linear model | 59.5238% | 45.2381% | 47.619% |
The results of different algorithms for the regression problem.
| Algorithm | Training | Cross Validation | Leave One Out |
|---|---|---|---|
| NN | 0.5803 | 0.3225 | 0.2759 |
| kNN, k =10, w = 1/d | 0.5801 | 0.2981 | 0.2524 |
| Random Forest, 100 trees | 0.5715 | 0.3069 | 0.2843 |
| SVR, RBF kernel | 0.55 | 0.156 | 0.1066 |
| Linear regression | 0.498 | 0.3268 | 0.2529 |
The information gain values.
| Clinicopathological Parameters | O1 | O2 | O3 | O4 | |
|---|---|---|---|---|---|
| Cldn7 | Cldn7 (Core) I | Cldn7 (Front) P | Cldn7 (Front) I | ||
| I1 | Tumor Location | 0.01328 | 0.02698 | 0.03564 | 0.07025 |
| I2 | Grading | 0.00389 | 0.00277 | 0.00406 | 0.02741 |
| I3 | T-stage | 0.03746 |
| 0.03254 | 0.05841 |
| I4 | N-stage |
|
|
|
|
| I5 | Venous Invasion | 0.00492 | 0.04001 | 0.00640 | 0.01225 |
| I6 | Lymphovascular Invasion | 0.00241 | 0.05595 | 0.00358 | 0.03046 |
| I7 | Perineural Invasion | 0.00525 | 0.02838 | 0.00738 | 0.00748 |
| I8 | Leukocyte Infiltrate |
|
|
|
|
| I9 | Tumor Deposits | 0.01328 | 0.04099 | 0.01975 | 0.01006 |
| I10 | Tumor Budding | 0.00389 | 0.05448 | 0.01788 | 0.01834 |
| I11 | Tumor Growth Pattern | 0.00274 | 0.00181 | 0.00324 | 0.01211 |
| I12 | Sex | 0.01600 | 0.00256 | 0.01735 | 0.00179 |
| I13 | Age Group |
| 0.10040 |
|
|
Abbreviations: Cldn7—Claudin-7; P—percentage; I—intensity. In each column corresponding to an output Oi, the red values marked by bold typeface represent the first (i.e., best) value and the blue ones represent the second and third values.
Figure 5The increase of classification accuracy for each output by adding input attributes in the order specified by information gain: (a) output O1; (b) output O2; (c) output O3; (d) output O4.
Figure 6The increase of classification accuracy for output O2 for a bounded subset of inputs.
Figure 7The increase of classification accuracy for output O4 using Random Forest.