| Literature DB >> 35070749 |
Boyoung Park1,2,3, Yeol Kim1,2, Jaeho Lee1,2, Nayoung Lee1,2, Seung Hun Jang4.
Abstract
BACKGROUND: This study developed a new lung cancer risk prediction model for the Korean population and evaluated the performance, compared to the previously reported risk models developed in Western countries.Entities:
Keywords: Lung cancer; National Lung Screening Trial criteria (NLST criteria); low-dose computed tomography screening (LDCT screening); prediction model
Year: 2021 PMID: 35070749 PMCID: PMC8743515 DOI: 10.21037/tlcr-21-566
Source DB: PubMed Journal: Transl Lung Cancer Res ISSN: 2218-6751
Figure 1Flow diagram of the selection of the eligible population. *, the number of missing values for each variable includes people with missing information for two or more variables. Thus, the total does not match 453,507.
Baseline characteristics of ever-smokers in the National Health Screening Program 2007–2008
| Characteristics | Total (N=969,351) | Training dataset (N=678,407) | Validation dataset (N=290,994) |
|---|---|---|---|
| Age, years | |||
| Mean (standard deviation) | 54.9 (9.3) | 54.9 (9.3) | 54.9 (9.3) |
| 40–49 | 323,722 (33.4%) | 226,492 (33.4%) | 97,230 (33.4%) |
| 50–59 | 350,747 (36.2%) | 245,545 (36.2%) | 105,202 (36.2%) |
| 60–69 | 195,808 (20.2%) | 137,092 (20.2%) | 58,716 (20.2%) |
| 70–79 | 99,074 (10.2%) | 69,278 (10.2%) | 29,796 (10.2%) |
| Sex | |||
| Male | 893,906 (92.2%) | 625,650 (92.2%) | 268,256 (92.2%) |
| Female | 75,445 (7.8%) | 52,757 (7.8%) | 22,688 (7.8%) |
| Smoking status | |||
| Past | 364,768 (37.6%) | 255,331 (37.6%) | 109,487 (37.6%) |
| Current | 604,583 (62.4%) | 423,076 (62.4%) | 181,507 (62.4%) |
| Pack-year | |||
| Mean (standard deviation) | 23.1 (15.2) | 23.1 (15.2) | 23.1 (15.2) |
| <10 pack-years | 157,453 (16.2%) | 110,149 (16.2%) | 47,304 (16.3%) |
| 10–19.9 pack-years | 265,981 (27.4%) | 186,561 (27.5%) | 79,420 (27.3%) |
| 20–29.9 pack-years | 245,179 (25.3%) | 171,334 (25.3%) | 73,845 (25.4%) |
| ≥30 pack-years | 300,738 (31.0%) | 210,363 (31.0%) | 90,375 (31.1%) |
| Years since cessation | |||
| Current | 604,583 (62.4%) | 423,076 (62.4%) | 18,1507 (62.4%) |
| <5 years | 111,446 (11.5%) | 77,983 (11.5%) | 33,463 (11.5%) |
| <15 years | 163,007 (16.8%) | 114,229 (16.8%) | 48,778 (16.8%) |
| ≥15 years | 90,315 (9.3%) | 63,119 (9.3%) | 27,196 (9.4%) |
| Number of days of alcohol consumption | |||
| <5/week | 855,843 (88.3%) | 599,042 (88.3%) | 256,801 (88.3%) |
| ≥5/week | 113,508 (11.7%) | 79,365 (11.7%) | 34,143 (11.7%) |
| Number of days of sweating exercise | |||
| <3/week | 675,201 (69.7%) | 472,545 (69.7%) | 202,656 (69.7%) |
| ≥3/week | 294,150 (30.4%) | 205,862 (30.3%) | 88,288 (30.4%) |
| Body mass index, kg/m2 | |||
| Mean (standard deviation) | 24.0 (3.0) | 24.0 (2.9) | 24.0 (3.1) |
| <20.0 | 23,317 (2.4%) | 16,258 (2.4%) | 7,059 (2.4%) |
| 20.0–24.9 | 595,045 (61.4%) | 416,797 (61.4%) | 178,248 (61.3%) |
| ≥25 | 350,989 (36.2%) | 245,352 (36.2%) | 105,637 (36.3%) |
| Family history of cancer | |||
| No | 746,127 (77.0%) | 522,416 (77.0%) | 223,711 (76.9%) |
| Yes | 223,224 (23.0%) | 155,991 (23.0%) | 67,233 (23.1%) |
| History of chronic pulmonary obstructive disease | |||
| No | 948,895 (97.9%) | 664,079 (97.9%) | 284,816 (97.9%) |
| Yes | 20,456 (2.1%) | 14,328 (2.1%) | 6,128 (2.1%) |
| History of emphysema | |||
| No | 966,894 (99.8%) | 676,738 (99.8%) | 290,156 (99.7%) |
| Yes | 2,457 (0.3%) | 1,669 (0.3%) | 788 (0.3%) |
| History of pneumoconiosis | |||
| No | 968,848 (100%) | 678,045 (100%) | 290,803 (100%) |
| Yes | 503 (0.1%) | 362 (0.1%) | 141 (0.1%) |
| History of interstitial pulmonary disease | |||
| No | 968,454 (99.9%) | 677,769 (99.9%) | 290,685 (99.9%) |
| Yes | 897 (0.1%) | 638 (0.1%) | 259 (0.1%) |
| History of cancer | |||
| No | 961,814 (99.2%) | 673,163 (99.2%) | 288,651 (99.2%) |
| Yes | 7,537 (0.8%) | 5,244 (0.8%) | 2,293 (0.8%) |
N, number.
Predictive performance of previously developed models
| Model | Area under the curve | Expected/observed ratio |
|---|---|---|
| Bach | 0.661 (0.598–0.665) | 2.23 (2.07–2.40) |
| LCRAT | 0.811 (0.807–0.814) | 4.73 (4.50–4.96) |
| PLCOM2012 2013† | 0.772 (0.768–0.777) | 1.24 (1.22–1.26) |
| Simplified PLCOM2012 2013‡ | 0.781 (0.776–0.785) | 1.10 (1.08–1.16) |
| Pittsburgh 2015 | 0.781 (0.778–0.784) | 1.21 (1.19–1.23) |
| LLPi 2015§ | 0.803 (0.800–0.806) | 3.25 (3.20–3.31) |
| Simplified LLPi 2015¶ | 0.803 (0.800–0.806) | 3.21 (3.16–3.26) |
| Korean model | 0.816 (0.810–0.822) | 0.995 (0.973–1.017) |
†, the race was assumed to be Asian. The educational level was assumed to be some college. A family history of lung cancer was imputed as a family history of any cancer. ‡, the race was assumed to be Asian. The educational level was assumed to be some college. A family history of lung cancer was imputed as none. §, family history of lung cancer was imputed as a family history of cancer. All participants with a family history of any cancer were assumed to have a late onset. ¶, family history of lung cancer was imputed as none. LCRAT, Lung Cancer Risk Models for Screening; PLCOM2012, the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial Model 2012; LLPi, Liverpool Lung Project models.
Multivariate lung cancer prediction model in Korean ever-smokers
| Variables | Beta | HR (95% CI) |
|---|---|---|
| Age | ||
| Age-mean | 0.14618 | 1.157 (1.150–1.164) |
| (Age-mean)2 | −0.00242 | 0.998 (0.997–0.998) |
| Sex | ||
| Male | 0 | 1 (ref) |
| Female | −0.38713 | 0.679 (0.611–0.754) |
| Pack-year | ||
| Square root | 0.17456 | 1.191 (1.174–1.208) |
| Smoking status and years since cessation in past smokers | ||
| Current smokers | 1.03818 | 2.824 (2.543–3.137) |
| <5 years | 0.62246 | 1.864 (1.644–2.112) |
| 5–14.9 years | 0.34404 | 1.411 (1.248–1.595) |
| ≥15 years | 0 | 1 (ref) |
| Physical activity | ||
| <3/week | 0 | 1 (ref) |
| ≥3/week | −0.06768 | 0.935 (0.889–0.983) |
| Number of days of alcohol consumption | ||
| <5/week | 0 | 1 (ref) |
| ≥5/week | 0.05952 | 1.061 (1.001–1.125) |
| Body mass index | ||
| <18.5 kg/m2 | 0.26841 | 1.308 (1.189–1.439) |
| 18.5–24.9 kg/m2 | 0 | 1 (ref) |
| ≥25 kg/m2 | −0.24695 | 0.781 (0.741–0.824) |
| History of chronic pulmonary obstructive disease | ||
| No | 0 | 1 (ref) |
| Yes | 0.36037 | 1.434 (1.316–1.563) |
| History of emphysema | ||
| No | 0 | 1 (ref) |
| Yes | 0.25687 | 1.293 (1.029–1.624) |
| History of pneumoconiosis | ||
| No | 0 | 1 (ref) |
| Yes | 0.7179 | 2.05 (1.335–3.149) |
| History of interstitial pulmonary disease | ||
| No | 0 | 1 (ref) |
| Yes | 1.48709 | 4.424 (3.475–5.633) |
Prediction performance of the lung cancer prediction model in Korean ever-smokers
| Statistic | Value (95% CI) |
|---|---|
| Harrell’s C-index in the training dataset | |
| Ever-smokers | 0.816 (0.810–0.822) |
| Current smokers | 0.816 (0.808–0.824) |
| Past smokers | 0.804 (0.790–0.818) |
| Smokers with <10 pack-years | 0.787 (0.760–0.814) |
| Smokers with 10–19.9 pack-years | 0.812 (0.800–0.818) |
| Smokers with 20–29.9 pack-years | 0.826 (0.820–0.829) |
| Smokers with ≥30 pack-years | 0.754 (0.746–0.762) |
| E/O ratio in the training dataset | |
| Ever-smokers | 1.002 (0.979–1.024) |
| Current smokers | 0.881 (0.858–0.904) |
| Past smokers | 1.510 (1.444–1.580) |
| Smokers with <10 pack-years | 1.143 (1.043–1.252) |
| Smokers with 10–19.9 pack-years | 0.919 (0.915-0.924) |
| Smokers with 20–29.9 pack-years | 0.984 (0.979-0.989) |
| Smokers with ≥30 pack-years | 0.958 (0.930–0.988) |
| Harrell’s C-index in the validation dataset | |
| Ever-smokers | 0.816 (0.806–0.826) |
| Current smokers | 0.816 (0.804–0.828) |
| Past smokers | 0.803 (0.783–0.823) |
| Smokers with <10 pack-years | 0.797 (0.758–0.836) |
| Smokers with 10–19.9 pack-years | 0.819 (0.801-0.829) |
| Smokers with 20–29.9 pack-years | 0.823 (0.809-0.830) |
| Smokers with ≥30 pack-years | 0.753 (0.739–0.767) |
| E/O ratio in the validation dataset | |
| Ever-smokers | 0.989 (0.956–1.023) |
| Current smokers | 0.824 (0.793–0.857) |
| Past smokers | 1.504 (1.404–1.611) |
| Smokers with <10 pack-years | 1.072 (0.936–1.227) |
| Smokers with 10–19.9 pack-years | 0.913 (0.906-0.919) |
| Smokers with 20–29.9 pack-years | 1.036 (1.029-1.044) |
| Smokers with ≥30 pack-years | 0.962 (0.919–1.007) |
E/O, expected/observed.
Model accuracy of lung cancer classification compared with the current guidelines for lung cancer screening in Korean ever-smokers
| Criteria | Lung cancer development | Without lung cancer development | Predictive value |
|---|---|---|---|
| Lung cancer screening criteria in the training dataset | |||
| Positive | 3,448 | 115,197 | PPV: 2.9 (2.8–3.0) |
| Negative | 4,319 | 555,443 | NPV: 99.2 (99.2–99.3) |
| Sensitivity | 44.4 (95% CI: 43.2–45.5) | ||
| Specificity | 82.8 (95% CI: 82.7–82.9) | ||
| Lung cancer prediction model in the training dataset | |||
| Positive | 4,708 | 113,402 | PPV: 4.0 (3.9–4.1) |
| Negative | 3,059 | 557,238 | NPV: 99.5 (99.4–99.5) |
| Sensitivity | 60.6 (95% CI: 59.5–61.7) | ||
| Specificity | 83.1 (95% CI: 83.0–83.2) | ||
| Lung cancer screening criteria in the validation dataset | |||
| Positive | 1,491 | 49,328 | PPV: 2.9 (2.8–3.1) |
| Negative | 1,877 | 238,248 | NPV: 99.2 (99.2–99.3) |
| Sensitivity | 44.3 (95% CI: 42.6–46.0) | ||
| Specificity | 82.8 (95% CI: 82.7–83.0) | ||
| Lung cancer prediction model in the validation dataset | |||
| Positive | 2,053 | 48,695 | PPV: 4.0 (3.9–4.2) |
| Negative | 1,315 | 238,881 | NPV: 99.5 (99.4–99.5) |
| Sensitivity | 60.1 (95% CI: 59.2–62.6) | ||
| Specificity | 83.0 (95% CI: 82.9–83.2) | ||
PPV, positive predictive value; NPV, negative predictive value; CI, confidence interval.