| Literature DB >> 31099339 |
Xiaofang Wang1,2, Yan Zhang3, Shiying Hao4,5, Xuefeng B Ling2,5, Le Zheng4,5, Jiayu Liao6,7, Chengyin Ye8, Minjie Xia9, Oliver Wang9, Modi Liu9, Ching Ho Weng2, Son Q Duong10, Bo Jin9, Shaun T Alfreds11, Frank Stearns9, Laura Kanov9, Karl G Sylvester2, Eric Widen9, Doff B McElhinney4,5.
Abstract
BACKGROUND: Lung cancer is the leading cause of cancer death worldwide. Early detection of individuals at risk of lung cancer is critical to reduce the mortality rate.Entities:
Keywords: electronic health records; lung cancer; prospective study; risk prediction model
Mesh:
Year: 2019 PMID: 31099339 PMCID: PMC6542253 DOI: 10.2196/13260
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1A workflow to develop the new incident lung cancer risk prediction model. EHR: electronic health record; HIE: health information exchange; PPV: positive predictive value. XGBoost: Extreme Gradient Boosting.
Baseline characteristics of the retrospective cohort (N=873,598) and prospective cohort (N=836,659).
| Characteristic | Retrospective cohort, n (%) | Prospective cohort, n (%) | |
| <45 | 385,009 (44.1) | 366,752 (43.8) | |
| 45-54 | 116,655 (13.4) | 109,986 (13.1) | |
| 55-64 | 143,960 (16.5) | 139,219 (16.6) | |
| ≥65 | 227,974 (26.1) | 220,702 (26.4) | |
| Male | 386,251 (44.2) | 369,022 (44.1) | |
| Female | 487,347 (55.8) | 467,637 (55.9) | |
| Smokinga | 16,611 (1.9) | 14,248 (1.7) | |
| Other cancer history | 59,239 (6.8) | 72,039 (8.6) | |
| COPDb | 32,180 (3.7) | 36,221 (4.3) | |
| Pneumonia | 9,896 (1.1) | 12,179 (1.5) | |
| Other respiratory disordersc | 5131 (0.6) | 5738 (0.7) | |
| Diabetes | 73,854 (8.5) | 70,005 (8.4) | |
| CVDsd | 166,088 (19) | 161,685 (19.3) | |
| CKDe | 18,458 (2.1) | 18,912 (2.3) | |
| Cough | 26,574 (3) | 36,810 (4.4) | |
| Chest pain | 32,101 (3.7) | 35,057 (4.2) | |
| Hemoptysis | 770 (0.1) | 981 (0.1) | |
| Dyspnea | 4071 (0.5) | 3755 (0.5) | |
| Pleural effusion | 2024 (0.2) | 2356 (0.3) | |
| Abnormal weight loss | 6136 (0.7) | 5801 (0.7) | |
| C-reactive protein test | 11,613 (1.3) | 8,517 (1) | |
| Leukocytes count | 90,131 (10.3) | 71,694 (8.6) | |
| Platelets | 69,334 (7.9) | 51,477 (6.2) | |
| Glomerular filtration rate | 21,446 (2.5) | 18,695 (2.2) | |
| Glucose in serum or plasma | 137,575 (15.8) | 103,671 (12.4) | |
aSmoking was defined with a diagnosis code of Z72_2 from the International Classification of Diseases, 10th Revision, Clinical Modification.
bCOPD: chronic obstructive pulmonary disease (including chronic bronchitis and emphysema).
cOther respiratory disorders were defined with a diagnosis code of J98 from the from the International Classification of Diseases, 10th Revision, Clinical Modification.
dCVD: cardiovascular disease (including hypertension, coronary artery disease, peripheral vascular disease, arrhythmia, and abdominal aortic aneurysm).
eCKD: chronic kidney disease.
Figure 2The receiver operating characteristic curves derived from the prospective cohort, smoking subgroup, age≥65 years subgroup, and age<45 years subgroup. The 95% CI of each receiver operating characteristic curve is indicated by the blue shaded area and the AUC (with 95% CI) of each subgroup is listed under each receiver operating characteristic curve. AUC: area under the curve.
Figure 3(a) Stratification of patients in the prospective cohort. Positive predictive value was plotted as a function of the predictive score. Two thresholds of 0.0045 and 0.01 were applied to stratify the population into low-, medium- and high-risk categories. (b) Survival curves of the three risk categories. HR: Hazard Ratio.
Figure 4Time-to-diagnosis curves of the disease subgroup, smoking subgroup, and other cancer history subgroup for the low-risk (a) and high-risk (b) categories of the prospective cohort. Disease subgroups comprised patients who received diagnoses of COPD, pneumonia, other respiratory disorders, diabetes, CVDs, or CKD. CKD: chronic kidney disease; COPD: chronic obstructive pulmonary disease; CVD: cardiovascular disease.