| Literature DB >> 31156302 |
Wei Tang1, Qin Peng1, Yanzhang Lyu2, Xiaoshuang Feng2, Xin Li2, Luopei Wei2, Ni Li2, Hongda Chen2, Wanqing Chen2, Min Dai2, Ning Wu1,3, Jiang Li2, Yao Huang1.
Abstract
OBJECTIVE: The objective was to systematically assess lung cancer risk prediction models by critical evaluation of methodology, transparency and validation in order to provide a direction for future model development.Entities:
Keywords: Lung neoplasms; bronchogenic; carcinoma; models; risk assessment; theoretical
Year: 2019 PMID: 31156302 PMCID: PMC6513747 DOI: 10.21147/j.issn.1000-9604.2019.02.06
Source DB: PubMed Journal: Chin J Cancer Res ISSN: 1000-9604 Impact factor: 5.087
Framework for quality assessment of multiple-use models
| Term | Content |
| Transparency | 1. Variables include (Yes or No) |
| 2. Model expression (Yes or No) | |
| 3. Limitation | |
| 4. Financial support | |
| 5. Conflict of interest | |
| 6. Validation | |
| Risk of bias | 1. Blind evaluation of outcome (Yes or No) |
| 2. Blind evaluation of predictor (Yes or No) | |
| 3. Sensitivity analysis (Yes or No) | |
| 4. Calibration (Yes or No) | |
| 5. External validation (Yes or No) | |
| Validation
| 1. Internal validation |
| 2. Cross-validation | |
| 3. External validation |
Characteristics of multiple-use models
| Model | Year | Country or region | Research design | Statistical methods | Population | Modeling sample | AUC (95% CI) | C-index (95% CI) |
| AUC, area under receiver-operating characteristic curve; 95% CI, 95% confidence interval; C-index, concordance index. | ||||||||
| Bach ( | 2003 | US | Cohort study | Cox proportional hazards regression | Aged 50−69 years, current and former smokers | 18,172 | 0.72 | |
| Spitz ( | 2007 | US | Case-control study | Logistic | Never, former and current smokers | Cancer case 1,851/Control 2,001 Never smokers: cancer case 330/Control 379 Former smokers: cancer case 784/Control 884 Current smokers: cancer case 737/Control 738 | Never smokers,
| Never smokers:
|
| Spitz ( | 2008 | US | Case-control study | Logistic | Current and former smokers, White non-Hispanic cases | Current smokers: cancer case 350/Control 244; Former smokers: cancer case 375/Control 371 | Former smokers,
| |
| LLP ( | 2008 | United
| Case-control study | Logistic | Aged 20−80 years | Cancer case 579/
| 0.71 | |
| LLPi ( | 2015 | United
| Case-control study | Cox proportional hazards regression | Aged 45−79 years | 8,760: cancer case 237, control 8,523 | 0.852 (0.831−0.873) | |
| PLCO ( | 2009 | Canada | Cohort study | Logistic | Aged 55−74 years who were free of the cancers under study | 12,314 | 0.865 | |
| PLCO ( | 2011 | Canada | Cohort study | Logistic | Aged 55−74 years, Model 1: the PLCO control arms; Model 2: smokers only | Model 1: 70,962
| Model 1: 0.859
| |
| PLCOM ( | 2012 | Canada | Cohort study | Logistic | Aged 55−74 years, former smokers | 36,286 | 0.803 (0.782−0.813) | |
| Etzel ( | 2008 | US | Case-control study | Logistic | African-Americans | Cancer case 491/Control 497 | 0.75 | |
| Pittsburgh ( | 2016 | US | Case-control study | Logistic | Aged 55−74 years, current and former smokers | LDCT 25,929/CXR 25,648 | LDCT 0.679/CXR 0.687 | |
| Hoggart ( | 2012 | United
| Cohort study | Survival analysis | Aged 40−65 years, current, former and never smokers | 169,035 (90% of the data) | One year-current: 0.82; Former: 0.83; Never: 0.84. 5 year-current: 0.77; Former: 0.72; Never: 0.79 | |
Transparency assessment of multiple-use models
| Model | Variable
| Model expression | Limitation | Financial support | Conflict of
| Validation |
| LLP, Liverpool Lung Project; PLCO, the Prostate, Lung, Colorectal and Ovarian; Y, reported; N, no reported. | ||||||
| Bach | Y | N | It does not distinguish among the risks of different histologic types of lung cancer, and it is relevant only to one subset (albeit a large subset) of at-risk individuals — those aged 50 years or older who have a smoking history. | Research institution; National project fund | N | Internal validation/
|
| Spitz (2007) | Y | N | The models may not be sufficiently discriminatory to allow accurate risk assessment at the individual level. They are needed to be validated in independent populations. | Research institution; National project fund | N | External
|
| Spitz (2008) | Y | N | Without an independent validation. | Research institution; National project fund | Y | Cross-
|
| LLP | Y | Y | More work is needed to test the applicability of the model in diverse populations, including those from diverse geographic regions. | Research institution; Foundation | N | Cross-
|
| LLPi | Y | Y | More work is needed to test the applicability of the model in diverse populations, including those from diverse geographic regions. | Region project fund; Foundation | Y | Internal
|
| PLCO (2009) | Y | N | The study model was developed in asymptomatic individuals. It is unclear whether its performance will be substantially different in symptomatic individuals presenting to clinicians. | National project fund | N | Internal
|
| PLCO (2011) | Y | N | The models may not be generalizable to other populations. Data on exposure to radon, asbestos, second-hand smoke, occupational carcinogens, and history of adult pneumonia were not available for analysis. | National project fund | N | Internal validation/
|
| PLCOM2012 | Y | N | Excluded persons who had never smoked. | Research institution | N | External
|
| Etzel | Y | N | The study was hospital-based and the controls were drawn only from the metropolitan area of Houston, Texas; therefore, the results may vary in other geographic locations; the sample size of the study was small. | National project fund | Y | Internal validation/
|
| Pittsburgh | Y | N | The model is derived in preselected high-risk populations and not necessarily applicable to the general population of smokers, and it was derived and tested in the United States and applicability to other populations will need to be tested. | Research institution; National project fund | N | External
|
| Hoggart | Y | N | Measures of carcinogens are limited to occupational exposures. | European Union project fund | Y | External
|
Risk of bias assessment of multiple-use models
| Model | Blind evaluation
| Blind evaluation
| Sensitivity
| Calibration | External
| Risk of bias |
| Y, reported; N, no reported; H, high risk; M, middle risk. | ||||||
| Bach | N | N | Y | Y | N | M |
| Spitz (2007) | N | N | N | N | Y | M |
| Spitz (2008) | Y | Y | N | Y | N | M |
| LLP | N | N | N | N | N | H |
| LLPi | N | N | N | Y | N | M |
| PLCO (2009) | N | N | N | N | N | H |
| PLCO (2011) | N | N | N | N | Y | M |
| PLCOM2012 | N | N | N | Y | Y | M |
| Etzel | N | N | N | Y | Y | M |
| Pittsburgh | N | N | N | Y | Y | M |
| Hoggart | N | N | N | N | Y | M |
Validation and samples of multiple-use models
| Model | Internal validation | Cross-validation | External validation |
| LLP, Liverpool Lung Project; PLCO, the Prostate, Lung, Colorectal and Ovarian. | |||
| Bach | Operate the model from five related study sites 3 times | 10-fold cross-validation | |
| Spitz (2007) | 25% of the data | ||
| Spitz (2008) | 3-fold cross-validation | ||
| LLP | 10-fold cross-validation | ||
| LLPi | Bootstrap 200 times | ||
| PLCO (2009) | Bootstrap 1,000 times | ||
| PLCO (2011) | Bootstrap 200 times | 44,233 | |
| PLCOM2012 | 37,332 | ||
| Etzel | 156 | 325 | |
| Pittsburgh | 3,642 | ||
| Hoggart | 10% of the data | ||
Variables of multiple-use models
| Variables | Bach | Spitz
| Spitz
| LLP | LLPi | PLCO
| PLCO
| PLCOM2012 | Etzel | Pittsburgh | Hoggart |
| BMI, body mass index; COPD, chronic obstructive pulmonary disease; LLP, Liverpool Lung Project; PLCO, the Prostate, Lung, Colorectal and Ovarian; Y, the variable was included in the model. | |||||||||||
| Sociodemographic factors | |||||||||||
| Age | Y | Y | Y | Y | Y | Y | |||||
| Gender | Y | Y | Y | Y | |||||||
| Race or ethnic group | Y | ||||||||||
| Education | Y | Y | Y | Y | |||||||
| BMI | Y | Y | Y | Y | |||||||
| Exposure history | |||||||||||
| Dust exposures | Y | Y | Y | Y | |||||||
| Asbestos exposure | Y | Y | Y | Y | Y | ||||||
| Environmental tobacco | |||||||||||
| Smoke exposure | Y | ||||||||||
| Smoking history | |||||||||||
| Age stopped smoking | Y | Y | |||||||||
| Smoking duration | Y | Y | Y | Y | Y | Y | |||||
| Pack-years smoked | Y | Y | Y | Y | |||||||
| Smoking status | Y | Y | Y | ||||||||
| Smoking intensity | Y | Y | |||||||||
| Smoking quit time | Y | Y | |||||||||
| Cigarettes per day | Y | ||||||||||
| Time since smoking cessation | Y | ||||||||||
| Medical history | |||||||||||
| Emphysema | Y | Y | |||||||||
| Hay fever | Y | Y | Y | Y | |||||||
| Bleomycin sensitivity | Y | ||||||||||
| Prior diagnosis of pneumonia | Y | ||||||||||
| Prior diagnosis of malignant
| Y | Y | |||||||||
| COPD | Y | Y | Y | Y | |||||||
| Chest X-ray in past 3 years | Y | ||||||||||
| Personal history of cancer | Y | ||||||||||
| Asthma | Y | ||||||||||
| Family history | |||||||||||
| Family history of cancer | Y | Y | Y | ||||||||
| First-degree relatives with
| Y | ||||||||||
| Family history of lung cancer | Y | Y | Y | Y | Y | ||||||
| Nodule | Y | ||||||||||
| Family history of smoking-
| Y | ||||||||||
| Genetic risk factors | |||||||||||
| DNA repair capacity | Y | ||||||||||
| chr15q25 | Y | ||||||||||
| chr5p15 | Y | ||||||||||
Single-use models for lung cancer prediction
| Model | Year | Country
| Research
| Statistical
| Variable
| Modeling
| Validation | AUC
| C-index
|
| AUC, area under receiver-operating characteristic curve; 95% CI, 95% confidence interval; C-index, concordance index; ANN, artificial neural network; SVM, support vector machine; BMI, body mass index; CRP, C-reactive protein; HGF, hepatocyte growth factor; SNP, single nucleotide polymorphism; DNMT, DNA-methyltransferase; AFP, alpha-fetal protein; CEA, carcinoembryonic antigen; NSE, neuron specific enolase; CA, carbohydrate antigen; HGH, human growth hormone. | |||||||||
| Wozniak MB ( | 2015 | Germany | Case-control study | Logistic | Gender, age and smoking status, 24 microRNAs | 100 case;
| Internal
| 0.874 | N |
| Wang X ( | 2015 | China | Case-control study | Logistic | Gender, age, education, BMI, family history, medical history, exposure history, lifestyle | 705 case;
| Internal
| 0.8851 | N |
| Muller DC ( | 2017 | US | Cohort study | Flexible parametric survival | Gender, smoking history, medical history, family history | 502,321 | Internal
| N | 0.85
|
| Ma S ( | 2016 | China | Cohort study | Logistic | Gender, age, smoke, prolactin, CRP, NY-ESI-1, HGF | 543 | External
| 0.86 (95% CI:
| N |
| Wu X ( | 2016 | China | Cohort study | Cox regression analysis | Age, gender, smoking pack-years, BMI, family history, medical history, exposure history, biomarkers | 395,875 | Internal validation; External validation | 0.851, with never smokers
| N |
| Gu F ( | 2017 | US | Cohort study | Cox proportional hazard model | Age, gender, race/ethnicity, education, family history, BMI, smoking status, smoking history | 18,729 | N | Incidence model: 0.6941;
| N |
| Lin KF ( | 2017 | China | Cohort study | Logistic | Age, gender, and BMI, nodule number, family history of lung cancer, family history of other cancer | 784 | N | N | N |
| Sha R ( | 2017 | China | Case-control study | Logistic | Age, gender, BMI, family history | 227 case;
| N | Model 1: 0.827 (0.794−0.861);
| N |
| Lin H ( | 2011 | China | Case-control study | Logistic | Gender, age, smoking status, medical history, exposure history, family history | 633 case;
| N | N | 0.881 |
| Ni R ( | 2016 | China | Case-control study | ANN, SVM, Decision tree | Gender, age, medical history, smoking history, drinking history, family history | 214 | External validation | N | 0.972 |
| Li H ( | 2012 | China | Case-control study | Logistic | Gender, age, smoking status, SNPs | N | N | 0.637 | N |
| Feng YJ ( | 2013 | China | Case-control study | Logistic, Decision tree, ANN, SVM | Gender, age, smoking history, DNMT1, DNMT3a | 136 cancer;
| External validation | Logistic: 0.923; Decision tree: 0.946; ANN: 0.877; SVM: 0.851 | N |
| Wang N ( | 2012 | China | Case-control study | Fisher, Decision tree, ANN | Gender, age, smoking status, medical history, genetic factors | 251 case
| N | Fisher: 0.722; Decision tree: 0.929; ANN: 0.894 | N |
| Zhang HQ ( | 2012 | China | Case-control study | Decision tree, ANN, Logistic, Fisher | Ferritin, AFP, CEA, NSE, CA199, CA242, CA125, CA153, HGH9 | 150 case
| External validation | Decision tree: 0.923; ANN: 0.86; Logistic: 0.809; Fisher: 0.765 | N |
| Sun RL ( | 2013 | China | Case-control study | Logistic | Family history, smoking status, lifestyle, psychology | 563 case
| N | N | N |
| Nie GJ ( | 2009 | China | Case-control study | ANN, Logistic | Tumor marker | 53 case
| External validation | ANN: 0.88,
| N |
| Chang TT ( | 2011 | China | Case-control study | Fisher | Gender, age, smoking status, medical history, exposure history | 807 case
| External validation | Non-lung cancer: 0.823; Lung cancer: 0.745 | N |