| Literature DB >> 32181751 |
Abstract
BACKGROUND: Lung cancer is one of the most dangerous malignant tumors, with the fastest-growing morbidity and mortality, especially in the elderly. With a rapid growth of the elderly population in recent years, lung cancer prevention and control are increasingly of fundamental importance, but are complicated by the fact that the pathogenesis of lung cancer is a complex process involving a variety of risk factors.Entities:
Keywords: aged; deep learning; lung cancer; primary prevention; risk factors
Mesh:
Year: 2020 PMID: 32181751 PMCID: PMC7109611 DOI: 10.2196/17695
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 7.076
Figure 1Data selection flowchart. BRFSS: Behavioral Risk Factor Surveillance System.
Lung cancer risk factors assessed by the Behavioral Risk Factor Surveillance System questionnaire.
| Risk factors | Description |
| Age | Age ≥65 years? (yes/no) |
| Body mass index | Level 1: <18.5 kg/m2; 2: 18.5-24.9 kg/m2; 3: 25.0-29.9 kg/m2; 4: ≥30.0 kg/m2 |
| Education | Level of education completed (level 1: Did not graduate from high school; 2: Graduated from high school; 3: Attended postsecondary or technical school; 4: Graduated from postsecondary or technical school) |
| Smoked at least 100 cigarettes | Smoked at least 100 cigarettes in your entire life (yes/no; 1 pack contains 20 cigarettes) |
| Smoking frequency | Level 1: Every day; 2: Some days; 3: Not at all |
| Smoking start age | How old were you when you first started to smoke cigarettes regularly? (Age in years) |
| Smoking intensity | How many cigarettes do you smoke each day? (Number of cigarettes/day) |
| Smoking quit attempts | During the past 12 months, have you stopped smoking for 1 day or longer? (yes/no) |
| Time since quitting | How long has it been since you last smoked a cigarette? (1: Within the past month; 2: Within the past 3 months; 3: Within the past 6 months; 4: Within the past year; 5: Within the past 5 years; 6: Within the past 10 years; 7: 10 years or more; 8: Never smoked regularly) |
| E-cigarette use | Have you ever used an e-cigarette or other electronic vaping product, even just one time? (yes/no) |
| E-cigarette use frequency | Do you now use e-cigarettes or other electronic vaping products every day, some days, or not at all? (1: Every day; 2: Some days; 3: Not at all) |
| Chronic obstructive pulmonary disease (COPD) history | History of COPD (yes/no) |
| Asthma history | History of asthma (yes/no) |
| Cancer history | Personal history of cancer (yes/no) |
| Family history of cancer | Family history of cancer (yes/no) |
| Computed tomography (CT) or computerized axial tomography (CAT) scan | In the last 12 months, did you have a CT or CAT scan? (yes/no) |
Figure 2Schematic diagram of lung cancer risk factor identification in the elderly. DNN: deep neural network.
Figure 3Deep learning model training process. DNN: deep neural network; HDF5: hierarchical data format version 5.
Figure 4Data analysis equations.
Figure 5Normalized weights of risk factors in the stratified groups. BMI: body mass index; CAT: computerized axial tomography; COPD: chronic obstructive pulmonary disease; CT: computed tomography; PM2.5: fine particulate matter with a diameter ≤2.5 μm.
Normalized weight values and odds ratios (95% CI) of the main risk factors in the 4 population groups.
| Risk factors | Population aged ≥65 years | Men aged ≥65 years | Women aged ≥65 years | All age groups | ||||||
| Weight | Odds ratio (95% CI) | Weight | Odds ratio (95% CI) | Weight | Odds ratio (95% CI) | Weight | Odds ratio (95% CI) | |||
|
| ||||||||||
| Time since quitting | 0.21 | 1.422 (0.806-1.095) | 0.13 | 1.587 (0.776-0.998) | 0.20 | 1.590 (0.927-1.358) | 0.009 | 1.109 (0.993-1.322) | ||
| Smoking frequency | 0.11 | 1.312 (0.796-0.998) | 0.18 | 1.625 (0.866-1.097) | 0.16 | 1.536 (1.106-1.427) | 0.14 | 1.370 (1.352-1.701) | ||
| Cancer history | 0.099 | 1.295 (0.876-1.027) | 0.09 | 1.387 (1.239-1.667) | 0.11 | 1.442 (0.951-1.356) | 0.09 | 1.271 (0.852-1.201) | ||
| Smoking quit attempts | 0.091 | 1.253 (0.933-1.201) | 0.06 | 1.273 (1.413-1.702) | 0.07 | 1.368 (1.127-1.406) | 0.20 | 1.405 (0.995-1.381) | ||
| Lifetime smoking of ≤100 cigarettes | 0.081 | 1.239 (1.336-1.587) | 0.11 | 1.506 (0.681-0.937) | 0.18 | 1.588 (1.237-1.601) | 0.16 | 1.387 (1.225-1.611) | ||
| Asthma history | 0.08 | 1.303 (1.029-1.403) | 0.005 | 1.095 (0.962-1.329) | 0.07 | 1.381 (0.953-1.317) | 0.007 | 1.112 (0.961-1.406) | ||
| Radiation | 0.08 | 1.224 (1.550-1.781) | 0.09 | 1.291 (0.983-1.307) | 0.12 | 1.453 (1.302-1.759) | 0.03 | 1.190 (0.952-1.357) | ||
| E-cigarette use | 0.023 | 1.025 (0.766-0.934) | 0.12 | 1.539 (1.112-1.406) | 0.005 | 1.135 (0.897-1.309) | 0.074 | 1.239 (0.851-1.307) | ||
| Physical activity | 0.023 | 1.132 (0.983-1.246) | 0.01 | 1.170 (0.851-1.209) | 0.03 | 1.280 (0.991-1.308) | 0.08 | 1.268 (1.131-1.670) | ||
Figure 6Relationship between smoking and lung cancer incidence, 1996-2015.
Performance of the 4 DNN models.
| Model | Accuracy (95% CI) | AUROCa (95% CI) | |
| ≥65 years | 0.962 (0.530-0.751) | 0.931(0.499-0.593) | .002 |
| Men ≥65 years | 0.943 (0.459-0.643) | 0.927 (0.506-0.681) | .015 |
| Women ≥65 years | 0.932 (0.437-0.689) | 0.926 (0.543-0.782) | .003 |
| All | 0.927 (0.223-0.525) | 0.913 (0.564-0.803) | .002 |
aAUROC: area under the receiver operating characteristic curve.
bP<.05 was considered to indicate statistical significance.
Comparison of our model with previous models for identifying lung cancer risk factors.
| Model | Population | Method | Risk factors | Accuracy | AUROCa |
| Our model | 235,673 | Deep neural network | As listed in the Results section | 0.927 | 0.913 |
| Panayiotis, 2016 [ | 25,486 | Dynamic Bayesian network | Demographics, smoking status, family history of cancer, cancer history, comorbidities related to lung cancer, occupational exposures, and low-dose computed tomography screening outcomes | 0.65 | 0.75 |
| Wang, 2019 [ | 961 | Conditional Gaussian Bayesian network | Age, sex, level of education, region, urbanization, diagnosis-based factors, prior utilization factors, prescription factors | 0.67 | N/Ab |
| Ankit, 2012 [ | 70,132 | Decision tree | Age, birthplace, cancer grade, diagnostic confirmation, farthest extension of tumor, type of surgery performed, reason for no surgery, order of surgery and radiation therapy, scope of regional lymph node surgery | 0.863 | 0.91 |
| Xie, 2014 [ | 1703 | Artificial neural network | 41 risk factors: age, education level, marital status, income status, smoking, alcohol drinking, coffee intake, etc | 0.838 | N/A |
| Kaviarasi, 2019 [ | 321 | Gaussian classifier | Age, sex, radiation sequence with surgery, first malignant primary indicator, radiation, etc | N/A | 0.881 |
aAUROC: area under the receiver operating characteristic curve.
bNot available.