| Literature DB >> 35812896 |
Xuan Li1, Haiyan Zhou2, Xianhui Zhao1, Huan Peng1, Shanshan Luo1, Juan Feng3, Jianfu Heng3, Heli Liu1,4, Jie Ge1.
Abstract
Lymph node metastasis (LNM) is considered to be one of the important factors in determining the optimal treatment for early gastric cancer (EGC). This study aimed to develop and validate a nomogram to predict LNM in patients with EGC. A total of 842 cases from the Surveillance, Epidemiology, and End Results (SEER) database were divided into training and testing sets with a ratio of 6 : 4 for model development. Clinical data (494 patients) from the hospital were used for external validation. Univariate and multivariate logistic regression analyses were used to identify the predictors using the training set. Logistic regression, LASSO regression, ridge regression, and elastic-net regression methods were used to construct the model. The performance of the model was quantified by calculating the area under the receiver operating characteristic curve (AUC) with 95% confidence intervals (CIs). Results showed that T stage, tumor size, and tumor grade were independent predictors of LNM in EGC patients. The AUC of the logistic regression model was 0.766 (95% CI, 0.709-0.823), which was slightly higher than that of the other models. However, the AUC of the logistic regression model in external validation was 0.625 (95% CI, 0.537-0.678). A nomogram was drawn to predict LNM in EGC patients based on the logistic regression model. Further validation based on gender, age, and grade indicated that the logistic regression predictive model had good adaptability to the population with grade III tumors, with an AUC of 0.803 (95% CI, 0.606-0.999). Our nomogram showed a good predictive ability and may provide a tool for clinicians to predict LNM in EGC patients.Entities:
Mesh:
Year: 2022 PMID: 35812896 PMCID: PMC9259240 DOI: 10.1155/2022/8399822
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 3.822
Figure 1Flow diagram of patients extracted from Surveillance, Epidemiology, and End Results (SEER) database and clinical data.
Characteristics of all patients.
| Variables | Clinical data ( | SEER data | ||
|---|---|---|---|---|
| Total ( | Training set ( | Test set ( | ||
| Age (years), mean ± SD | 53.80 ± 10.60 | 69.4 ± 11.3 | 69.05 ± 11.76 | 69.80 ± 10.71 |
|
| ||||
|
| ||||
| Female | 218 (44.13) | 357 (42.40) | 214 (42.38) | 143 (42.43) |
| Male | 276 (55.87) | 485 (57.60) | 291 (57.62) | 194 (57.57) |
|
| ||||
|
| ||||
| T1a | 243 (49.19) | 306 (36.34) | 176 (34.85) | 130 (38.58) |
| T1b | 251 (50.81) | 536 (63.66) | 329 (65.15) | 207 (61.42) |
|
| ||||
|
| ||||
| Overlap | 13 (2.63) | 59 (7.01) | 41 (8.12) | 18 (5.34) |
| Lower | 348 (70.45) | 455 (54.04) | 267 (52.87) | 188 (55.79) |
| Middle | 120 (24.29) | 303 (35.99) | 185 (36.63) | 118 (35.01) |
| Upper | 13 (2.63) | 25 (2.97) | 12 (2.38) | 13 (3.86) |
|
| ||||
|
| ||||
| Diffuse | 302 (61.13) | 69 (8.19) | 36 (7.13) | 33 (9.79) |
| Intestinal | 16 (3.24) | 700 (83.14) | 424 (83.96) | 276 (81.90) |
| Others | 176 (35.63) | 73 (8.67) | 45 (8.91) | 28 (8.31) |
|
| ||||
|
| ||||
| 1- | 130 (26.32) | 148 (17.58) | 135 (26.73) | 110 (32.64) |
| 2- | 151 (30.57) | 245 (29.10) | 117 (23.17) | 57 (16.91) |
| 3- | 122 (24.70) | 174 (20.67) | 60 (11.88) | 53 (15.73) |
| 4- | 57 (11.54) | 113 (13.42) | 101 (20.00) | 61 (18.10) |
| <1 | 34 (6.88) | 162 (19.24) | 92 (18.22) | 56 (16.62) |
|
| ||||
|
| ||||
| I | 314 (63.56) | 143 (16.98) | 78 (15.45) | 65 (19.29) |
| II | 128 (25.91) | 334 (39.67) | 206 (40.79) | 128 (37.98) |
| III | 52 (10.53) | 365 (43.35) | 221 (43.76) | 144 (42.73) |
|
| ||||
|
| ||||
| No | 361 (73.08) | 666 (79.10) | 398 (78.81) | 268 (79.53) |
| Yes | 133 (26.92) | 176 (20.90) | 107 (21.19) | 69 (20.47) |
Note: LNM: lymph node metastasis.
Difference analysis of patients with or without LNM in the training set.
| Variables | Total ( | Non-LNM ( | LNM ( | Statistic |
|
|---|---|---|---|---|---|
| Age (years), mean ± SD | 69.05 ± 11.76 | 69.41 ± 11.62 | 67.74 ± 12.24 |
| 0.193 |
| Gender, |
| 0.606 | |||
| Female | 214 (42.38) | 171 (42.96) | 43 (40.19) | ||
| Male | 291 (57.62) | 227 (57.04) | 64 (59.81) | ||
| T stage, |
| <0.001 | |||
| T1a | 176 (34.85) | 163 (40.95) | 13 (12.15) | ||
| T1b | 329 (65.15) | 235 (59.05) | 94 (87.85) | ||
| Primary site, |
| 0.930 | |||
| Overlap | 41 (8.12) | 31 (7.79) | 10 (9.35) | ||
| Lower | 267 (52.87) | 210 (52.76) | 57 (53.27) | ||
| Middle | 185 (36.63) | 147 (36.93) | 38 (35.51) | ||
| Upper | 12 (2.38) | 10 (2.51) | 2 (1.87) | ||
| Type, |
| 0.354 | |||
| Diffuse | 36 (7.13) | 26 (6.53) | 10 (9.35) | ||
| Intestinal | 424 (83.96) | 339 (85.18) | 85 (79.44) | ||
| Others | 45 (8.91) | 33 (8.29) | 12 (11.21) | ||
| Tumor size (cm), |
| <0.001 | |||
| <1 | 135 (26.73) | 110 (27.64) | 25 (23.36) | ||
| 1- | 117 (23.17) | 92 (23.12) | 25 (23.36) | ||
| 2- | 60 (11.88) | 43 (10.80) | 17 (15.89) | ||
| 3- | 101 (20.00) | 66 (16.58) | 35 (32.71) | ||
| 4- | 92 (18.22) | 87 (21.86) | 5 (4.67) | ||
| Grade, |
| <0.001 | |||
| I | 78 (15.45) | 72 (18.09) | 6 (5.61) | ||
| II | 206 (40.79) | 167 (41.96) | 39 (36.45) | ||
| III | 221 (43.76) | 159 (39.95) | 62 (57.94) |
Note: LNM: lymph node metastasis.
Univariate and multivariate logistic regression analyses of factors associated with LNM.
| Variables | Univariate analysis | Multivariate analysis | ||
|---|---|---|---|---|
| OR (95% CI) |
| OR (95% CI) |
| |
| Age | 0.99 (0.97–1.01) | 0.193 | — | — |
|
| ||||
|
| ||||
| Female | Ref | |||
| Male | 1.12 (0.73–1.73) | 0.606 | — | — |
|
| ||||
|
| ||||
| T1a | Ref | Ref | ||
| T1b | 5.02 (2.72–9.26) | <0.001 | 3.84 (2.04–7.21) | <0.001 |
|
| ||||
|
| ||||
| Lower | Ref | |||
| Overlap | 1.19 (0.55–2.57) | 0.661 | — | — |
| Middle | 0.95 (0.60–1.51) | 0.836 | — | — |
| Upper | 0.74 (0.16–3.46) | 0.699 | — | — |
|
| ||||
|
| ||||
| Diffuse | Ref | |||
| Intestinal | 0.65 (0.30–1.40) | 0.274 | — | — |
| Others | 0.95 (0.35–2.53) | 0.911 | — | — |
|
| ||||
|
| ||||
| <1 | Ref | |||
| 1- | 3.95 (1.45–10.75) | 0.007 | 2.70 (0.97–7.54) | 0.058 |
| 2- | 4.73 (1.73–12.90) | 0.002 | 3.07 (1.10–8.61) | 0.033 |
| 3- | 6.88 (2.38–19.89) | <0.001 | 4.73 (1.58–14.13) | 0.005 |
| 4- | 9.23 (3.43–24.84) | <0.001 | 5.75 (2.08–15.92) | <0.001 |
|
| ||||
|
| ||||
| I | Ref | Ref | ||
| II | 2.80 (1.14–6.91) | 0.025 | 2.04 (0.80–5.21) | 0.137 |
| III | 4.68 (1.94–11.32) | <0.001 | 3.19 (1.27–8.00) | 0.014 |
Note: LNM: lymph node metastasis; OR: odds ratio; CI: confidence interval; Ref: reference.
The area under the receiver operating characteristic curve (AUC) for different models.
| Models | Training set | Test set |
|---|---|---|
| AUC (95% CI) | AUC (95% CI) | |
| Logistic regression | 0.731 (0.682–0.779) | 0.766 (0.709–0.823) |
| Ridge regression | 0.730 (0.681–0.779) | 0.740 (0.681–0.799) |
| LASSO regression | 0.721 (0.671–0.771) | 0.737 (0.676–0.797) |
| Elastic-net regression | 0.735 (0.686–0.783) | 0.749 (0.691–0.807) |
Nomogram for prediction of LNM in EGC patients.
The performance of the logistic regression prediction model.
| Parameter (95% CI) | SEER data | External validation | |
|---|---|---|---|
| Training set | Test set | ||
| AUC | 0.731 (0.682–0.779) | 0.766 (0.709–0.823) | 0.625 (0.573–0.678) |
| Accuracy | 0.588 (0.544–0.631) | 0.588 (0.533–0.641) | 0.617 (0.573–0.660) |
| Sensitivity | 0.850 (0.769–0.912) | 0.899 (0.802–0.958) | 0.391 (0.308–0.479) |
| Specificity | 0.518 (0.467–0.568) | 0.507 (0.446–0.569) | 0.701 (0.651–0.748) |
| PPV | 0.322 (0.267–0.379) | 0.320 (0.255–0.390) | 0.325 (0.253–0.403) |
| NPV | 0.928 (0.886–0.958) | 0.951 (0.902–0.980) | 0.757 (0.708–0.802) |
| Cutoff | 0.1607 | 0.1607 | 0.1607 |
Note: AUC: the area under the receiver operating characteristic curve; PPV: positive predictive value; NPV: negative predictive value.
Figure 2Receiver operator characteristic (ROC) curves and the area under the ROC curve (AUC) for the logistic regression prediction model in the training set, test set, and external validation. (a) ROC curves in the training set; (b) ROC curves in the test set; (c) ROC curves in the external validation.
Figure 3Nomogram for predicting lymph node metastasis (LNM) in early gastric cancer (EGC) patients. (a) Nomogram; (b) example of the nomogram.
The performance of the prediction model based on different populations.
| Subgroup | Parameter (95% CI) | Test set | External validation |
|---|---|---|---|
| Gender (males) | AUC | 0.793 (0.720–0.866) | 0.603 (0.530–0.675) |
| Sensitivity | 0.919 (0.781–0.983) | 0.621 (0.493–0.738) | |
| Specificity | 0.580 (0.498–0.658) | 0.571 (0.502–0.639) | |
| PPV | 0.340 (0.248–0.442) | 0.313 (0.235–0.400) | |
| NPV | 0.968 (0.910–0.993) | 0.828 (0.756–0.885) | |
| Accuracy | 0.644 (0.573–0.712) | 0.583 (0.523–0.642) | |
|
| |||
| Gender (females) | AUC | 0.729 (0.635–0.822) | 0.673 (0.597–0.749) |
| Sensitivity | 0.875 (0.710–0.965) | 0.358 (0.245–0.485) | |
| Specificity | 0.441 (0.347–0.539) | 0.821 (0.751–0.879) | |
| PPV | 0.311 (0.218–0.417) | 0.471 (0.329–0.615) | |
| NPV | 0.925 (0.818–0.979) | 0.743 (0.669–0.807) | |
| Accuracy | 0.538 (0.453–0.622) | 0.679 (0.613–0.740) | |
|
| |||
| Age (≥65 years) | AUC | 0.755 (0.688–0.821) | 0.478 (0.339–0.617) |
| Sensitivity | 0.896 (0.773–0.965) | 1.000 (0.832–1.000) | |
| Specificity | 0.548 (0.476–0.618) | 1.000 (0.937–1.000) | |
| PPV | 0.323 (0.245–0.410) | 0.740 (0.628–0.834) | |
| NPV | 0.956 (0.901–0.986) | — | |
| Accuracy | 0.615 (0.552–0.676) | 0.740 (0.628–0.834) | |
|
| |||
| Age (<65 years) | AUC | 0.794 (0.681–0.907) | 0.644 (0.587–0.700) |
| Sensitivity | 0.905 (0.696–0.988) | 0.398 (0.307–0.495) | |
| Specificity | 0.464 (0.343–0.588) | 0.734 (0.680–0.782) | |
| PPV | 0.339 (0.218–0.478) | 0.357 (0.274–0.447) | |
| NPV | 0.941 (0.803–0.993) | 0.766 (0.713–0.814) | |
| Accuracy | 0.567 (0.458–0.671) | 0.643 (0.595–0.689) | |
|
| |||
| Grade (I) | AUC | 0.722 (0.583–0.861) | 0.695 (0.635–0.755) |
| Sensitivity | 1.000 (0.158–1.000) | 0.500 (0.395–0.605) | |
| Specificity | 0.905 (0.804–0.964) | 0.764 (0.702–0.818) | |
| PPV | 1.000 (0.541–1.000) | 0.475 (0.373–0.578) | |
| NPV | 0.966 (0.883–0.996) | 0.781 (0.720–0.835) | |
| Accuracy | 0.877 (0.772–0.945) | 0.685 (0.630–0.736) | |
|
| |||
| Grade (II) | AUC | 0.645 (0.540–0.750) | 0.632 (0.523–0.742) |
| Sensitivity | 0.870 (0.664–0.972) | 0.667 (0.482–0.820) | |
| Specificity | 0.429 (0.332–0.529) | 0.474 (0.370–0.579) | |
| PPV | 0.250 (0.160–0.359) | 0.306 (0.202–0.425) | |
| NPV | 0.938 (0.828–0.987) | 0.804 (0.676–0.898) | |
| Accuracy | 0.508 (0.418–0.597) | 0.523 (0.433–0.612) | |
|
| |||
| Grade (III) | AUC | 0.731 (0.647–0.815) | 0.803 (0.606–0.999) |
| Sensitivity | 0.955 (0.845–0.994) | 0.833 (0.359–0.996) | |
| Specificity | 0.310 (0.221–0.410) | 0.457 (0.309–0.610) | |
| PPV | 0.378 (0.288–0.475) | 0.167 (0.056–0.347) | |
| NPV | 0.939 (0.798–0.993) | 0.955 (0.772–0.999) | |
| Accuracy | 0.507 (0.422–0.591) | 0.500 (0.358–0.642) | |
Note: AUCarea under the curve; PPVpositive predictive value; NPVnegative predictive value.