| Literature DB >> 35480078 |
Mengfei Liu1, Ren Zhou1, Zhen Liu1, Chuanhai Guo1, Ruiping Xu2, Fuyou Zhou2, Anxiang Liu3, Haijun Yang4, Fenglei Li5, Liping Duan6, Lin Shen7, Qi Wu8, Hongchen Zheng1, Hongrui Tian1, Fangfang Liu1, Ying Liu1, Yaqi Pan1, Huanyu Chen1, Zhe Hu1, Hong Cai1, Zhonghu He1, Yang Ke1.
Abstract
Background: Previous risk prediction models taking esophageal malignant lesions detected during endoscopy as the primary outcome are not always sufficient to identify prevalent cases which are "overlooked" at screening. We aimed to update and externally validate our previous risk prediction model for malignant esophageal lesions by redefining the predicted outcome.Entities:
Keywords: ESECC; Esophageal malignant lesions; Precision screening; Risk stratification
Year: 2022 PMID: 35480078 PMCID: PMC9035729 DOI: 10.1016/j.eclinm.2022.101394
Source DB: PubMed Journal: EClinicalMedicine ISSN: 2589-5370
Selected demographic characteristics and life-style variables among subjects in the training set and the validation set.
| Variables | Training Set, n (%) | Validation Set, n (%) | |
|---|---|---|---|
| Age (y) | |||
| Median (interquartile range) | 58 (50, 63) | 54 (49, 60) | <0·01 |
| Gender | |||
| Female | 7716 (50·79) | 2468 (53·93) | <0·01 |
| Male | 7476 (49·21) | 2108 (46·07) | |
| Family history of ESCC | |||
| 0 | 13,523 (89·01) | 4069 (88·92) | 0·06 |
| 1 | 1356 (8·93) | 431 (9·42) | |
| 2 | 244 (1·61) | 67 (1·46) | |
| 3–5 | 69 (0·45) | 9 (0·20) | |
| BMI (kg/m2) | |||
| >22 | 12,462 (82·03) | 3600 (78·67) | <0·01 |
| <=22 | 2730 (17·97) | 976 (21·33) | |
| Smoking | |||
| No | 11,350 (74·71) | 3435 (75·07) | 0·63 |
| Yes | 3842 (25·29) | 1141 (24·93) | |
| Use of coal or wood as main cooking fuel | |||
| No | 7505 (49·40) | 849 (18·55) | <0·01 |
| Yes | 7687 (50·60) | 3727 (81·45) | |
| Pesticide exposure | |||
| No | 5762 (37·93) | 1764 (38·55) | 0·45 |
| Yes | 9430 (62·07) | 2812 (61·45) | |
| Food temperature | |||
| Low | 1739 (11·45) | 607 (13·26) | 0·01 |
| High | 13,453 (88·55) | 3969 (86·64) | |
| Eating speed | |||
| Slow | 2373 (15·62) | 887 (19·38) | <0·01 |
| Fast | 12,819 (84·38) | 3689 (80·62) | |
| Ingestion of leftover food | |||
| No | 9750 (64·18) | 2396 (52·36) | <0·01 |
| Yes | 5442 (35·82) | 2180 (47·64) | |
| SDA detected within 1 year | |||
| No | 15,069 (99·19) | 4520 (98·78) | 0·03 |
| At screening | 113 (0·74) | 52 (1·14) | |
| Interval cancer within 1 year | 10 (0·07) | 4 (0·08) | |
Number of ESCC cases in family members within 3 generations.
P-values reached a significance level of 0.05.
Structure of the prediction model for predicting ESCC within 1 year based on 15,192 subjects enrolled from the screening arm of the ESECC trial.
| Predictors | Total ( | Case ( | Univariate coefficients (95% CI) | Multivariate coefficients (95% CI) |
|---|---|---|---|---|
| Age (continuous) | 58 (50, 63) | 63 (60, 66) | 0·14 (0·11, 0·18) | 0·77 (0·11, 1·52) |
| Age^2 | – | – | 1.19*10 -3 (9.20*10 -4, 1.47*10 -3) | −0·01 (−0·01, 0·00) |
| Family history of ESCC | 0 (0, 0) | 0 (0, 0) | 0.51 (0·23, 0·75) | 0·58 (0·30, 0·82) |
| BMI (kg/m2) | ||||
| >22 | 12,462 (82·03) | 90 (73·17) | Ref | ref |
| <=22 | 2730 (17·97) | 33 (26·83) | 0.52 (0·11, 0·91) | 0·40 (−0·01, 0·80) |
| Use coal or wood as main cooking fuel | ||||
| No | 7505 (49·40) | 39 (31·71) | Ref | ref |
| Yes | 7687 (50·60) | 84 (68·29) | 0·75 (0·38, 1·14) | 0·39 (0·01, 0·79) |
| Eating speed | ||||
| Slow | 2373 (15·62) | 11 (8·94) | Ref | ref |
| Fast | 12,819 (84·38) | 112 (91·06) | 0·64 (0·06, 1·32) | 0·82 (0·24, 1·50) |
| Ingestion of leftover food | ||||
| No | 9750 (64·18) | 64 (52·03) | Ref | ref |
| Yes | 5442 (35·82) | 59 (47·97) | 0·51 (0·15, 0·86) | 0·47 (0·11, 0·83) |
| Constant | – | – | – | −33·22 (−56·10, −13·45) |
Age, gender, family history of ESCC, BMI, cigarette smoking, alcohol drinking, unhealthy dietary habits, ESCC related symptoms, use of coal or wood as main cooking fuel, exposure to fumes in the kitchen, source of drinking water and pesticide exposure were included in a 2-step variable selection method. Variables were first evaluated in the univariate logistic regression model, and variables with odds ratio (OR) >1.3 and P-value<0.5 were subjected to multivariate logistic regression model for further selection. The structure of the final prediction model was determined by the Akaike Information Criterion (AIC). Only variables included in the final prediction model are shown in this Table.
Number of ESCC cases in family members within 3 generations.
Figure 1Receiver operating characteristic (ROC) curves of the risk prediction model for ESCC in the training, internal validation and external validation cohorts
This figure showed ROC curves of the risk prediction model for ESCC in the training, internal validation and external validation cohorts, respectively. Area under the curves (AUCs) were also calculated to quantify the performance of the risk prediction model in discriminating high-risk individuals for ESCC. The AUC during the model development was 0·77 (95% CI: 0·73–0·80), and leave-one-out cross-validation generated a slightly lower AUC of 0·75 (95% CI: 0·72–0·79). When applied to the external validation cohort, the prediction model still showed ideal performance, with an AUC of 0·71 (95% CI: 0·65–0·78).
Abbreviations: AUC, area under the curve; CI, confidence interval; ESCC, esophageal squamous cell carcinoma; ROC, receiver operating characteristic.
Application performance of the established model in different scenarios in the training set and the validation set.
| Proportion of high-risk subjects (%) | Training Set (15,192 subjects, 123 cases) | Validation Set (4576 subjects, 56 cases) | ||||||
|---|---|---|---|---|---|---|---|---|
| No. high-risk subjects | No. SDA | No. endoscopies per case | Detection rate ratio | No. high-risk subjects | No. SDA | No. endoscopies per case | Detection rate ratio | |
| 100 | 15,192 | 123 | 124 | ref | 4576 | 56 | 82 | ref |
| 90 | 13,672 | 123 | 111 | 1·11 | 4118 | 53 | 78 | 1·05 |
| 80 | 12,152 | 122 | 100 | 1·24 | 3660 | 53 | 69 | 1·18 |
| 70 | 10,633 | 120 | 89 | 1·39 | 3202 | 51 | 63 | 1·30 |
| 60 | 9114 | 117 | 78 | 1·59 | 2744 | 50 | 55 | 1·49 |
| 50 | 7595 | 105 | 72 | 1·71 | 2286 | 47 | 49 | 1·68 |
| 40 | 6076 | 99 | 61 | 2·01 | 1828 | 41 | 45 | 1·83 |
| 30 | 4557 | 85 | 54 | 2·30 | 1371 | 35 | 39 | 2·09 |
| 20 | 3038 | 62 | 49 | 2·52 | 914 | 25 | 37 | 2·24 |
| 10 | 1519 | 42 | 36 | 3·42 | 457 | 12 | 38 | 2·15 |
scenario 1, 80% population coverage for endoscopic screening.
scenario 2, 50% population coverage for endoscopic screening.
scenario 3, 10% population coverage for endoscopic screening.