| Literature DB >> 31462469 |
Xue Qin Yu1,2, Qingwei Luo3, Suzanne Hughes1, Stephen Wade1, Michael Caruana1, Karen Canfell1,2,4, Dianne L O'Connell1,2,5.
Abstract
OBJECTIVES: To identify and summarise all studies using statistical methods to project lung cancer incidence or mortality rates more than 5 years into the future. STUDY TYPE: Systematic review.Entities:
Keywords: lung cancer; statistical modelling; statistical projections; systematic review; tobacco smoking
Year: 2019 PMID: 31462469 PMCID: PMC6720154 DOI: 10.1136/bmjopen-2018-028497
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Inclusion and exclusion criteria employed
| Domain | Inclusion criteria | Exclusion criteria |
| Study type | Population-based original research studies | Any of: Editorial comment, literature review, case studies, clinical trials, case–control studies. |
| Study population | General population in any country | Restricted to selected groups, that is, selected patients with cancer or high-risk populations. |
| Outcomes | Reports projections of lung cancer incidence and/or mortality rates | No relevant outcomes are reported, that is, no lung cancer-specific outcomes. |
| Statistical method | Uses a statistical method for the projection, including studies, which used simulation methods to estimate confidence intervals, that is, Bayesian technique | Uses mathematical models, which generate outcomes based on a proposed theoretical model of the disease’s natural history. |
| No of years projected | Reports long-term projections, that is, greater than 5 years | Does not report projections of lung cancer rates, that is, only explains past trends, or reports projections less than or equal to 5 years. |
| Publication type | Full-text published | Conference proceedings, abstracts, posters. |
| Time of publication | Published from 1 January 1988 to 14 August 2018 | Published before 1988. |
| Language | English | Language other than English. |
Figure 1PRISMA flow chart of study selection process. PRISMA, Preferred Reporting Items for Systematic Reviews and Meta-Analyses.
Prespecified criteria for assessing studies included in this review
| Criterion | Yes | No or not clear |
| Strengths | ||
| ≥10 years observed data | Observed data period reported ≥10 years. | Observed data period reported <10 years, or there is insufficient information to make an assessment. |
| Good quality data source | Data source reported, and the majority of observed data used are included in IARC Cancer in Five Continents, or with high population coverage as stated in WHO database. | Data source reported but the majority of observed data used are not included in IARC Cancer in Five Continents, or with low population coverage as stated in WHO database, or there is insufficient information to make an assessment. |
| Provided fitted values of observed data | Reports both model estimates and observed data for the period used for model fitting. | Does not report both model estimates and observed data for the period used for model fitting. |
| Validated projections using observed data | The model was validated by excluding data for the most recent years from the model fitting, and then comparing the projected rates for those years with the observed data. Provides both model projections beyond the period included in model fitting and a comparison with the observed data for the same period. | Does not provide validation using observed data. |
| Advantage | ||
| Provided software information | Software information was described or referenced. | Software information not provided. |
IARC, International Agency for Research on Cancer; WHO, World Health Organization.
Figure 2Organisational framework to categorise methods for lung cancer mortality projections. APC, age–period–cohort; GLM, generalised linear model; SAF, smoking attributable fraction.
Summary of included studies
| First author and year | Lung cancer outcome(s) | Country | Observed data | No of years projected | Incorporated smoking data | Model | Software | Good data quality* | Provides fitted values† | Validation‡ |
| AIHW 2012 | Incidence | Australia | 1982–2007 | 10–19 years | No | Joinpoint analysis/GLM | Joinpoint | Yes | Yes | No |
| Alonso 2018 | Incidence | Uruguay | 1990–2014 | 20+years | No | APC model | Stata, R, Nordpred | No | No | No |
| Arslanhan 2012 | Incidence/ | Turkey | 2002 | 20+years | Yes, smoking prevalence and smoking status | Relative risk | Not provided | No | No | No |
| Baade 2012 | Incidence | Australia | 1982–2007 | 10–19 years | No | Assume same rate | Not provided | Yes | No | No |
| Bashir 2001 | Incidence/ | Finland | 1955–1974 | 20+years | No | Bayesian APC model | WinBUGs | Yes | Yes | Yes |
| Bosetti 2012 | Mortality | 32 European countries | 1970–2009 | <10 years | No | Bayesian APC model /Joinpoint analysis | Joinpoint, GLIM | Yes | No | No |
| Bray 2012 | Incidence/ | 184 countries | 1988–2002 | 20+ years | No | GLM/annual percentage change | Not provided | Yes | No | No |
| Brenner 1992 | Incidence | Germany | 1968–1987 | 10–19 years | No | GLM | GLIM | Yes | No | No |
| Brown 1988 | Mortality | USA | 1958–1982 | 20+ years | Yes, smoking prevalence, consumption and tar content | GLM with smoking as a covariate | Not provided | Yes | Yes | No |
| Byers 2006 | Mortality | USA | 1990–2002 | 10–19 years | No | GLM/assume same rate | Not provided | Yes | No | No |
| Cancer Institute 2016 | Incidence/ | Australia | 1994–2008 | 10–19 years | No | GLM | SAS | Yes | No | No |
| Cancer Projections Network 2010 | Incidence/ | Canada | 1975–1994 | 10–19 years | No | Bayesian APC model/GAM | Nordpred, WinBUGs, GLIM | Yes | Yes | Yes |
| Carson 1993 | Mortality | USA | 1979–1989 | 10–19 years | No | GLM/assume same rate | BMDP | Yes | No | No |
| Castro 2016 | Incidence | Portugal | 1994–2009 | 10–19 years | No | Joinpoint analysis and GLM | Stata, Joinpoint | Yes | No | No |
| Cayuela 2011 | Mortality | Spain | 1979–2008 | 20+ years | No | APC model | Nordpred | Yes | No | No |
| Chen 2011 | Incidence | China | 1998–2007 | 10–19 years | No | Bayesian APC model | BAMP | No | No | No |
| Clements 2005 | Mortality | 5 countries | 1950–2001 | 10–19 years | No | Bayesian APC model | R, WinBUGs | Yes | Yes | Yes |
| Clèries 2016 | Mortality | Spain | 1998–2012 | 10–19 years | No | Bayesian APC model | INLA | Yes | No | No |
| Clèries 2018 | Incidence/ | Spain | 1994–2013 | 10–19 years | No | Bayesian APC model | Not provided | Yes | No | No |
| Coupland 2010 | Incidence | UK | 1985–2003 | 20+ years | No | APC model | Nordpred | Yes | No | No |
| Davis 2013 | Incidence/ | USA | 1990–2007 | 10–19 years | Yes, smoking prevalence | Annual percentage change and SAF | SAS | Yes | No | No |
| Didkowska 2009 | Mortality | Poland | 1998–2006 | 10–19 years | No | GLM | Stata | No | No | No |
| D'Souza 2013 | Mortality | India | 2001–2004 | 20+ years | No | Assume same rate | Not provided | No | No | No |
| D'Souza 2013b | Incidence | India | 2001–2004 | 20+ years | No | Assume same rate | Not provided | No | No | No |
| Dušek 2015 | Incidence | Czech Republic | 1978–2011 | <10 years | No | GLM | S | Yes | No | No |
| Dyba 1997 | Incidence | Sweden | 1960–1984 | 20+ years | No | GLM | GLIM | Yes | No | No |
| Dyba 2000 | Incidence | Finland | 1954–1978 | 10–19 years | No | GLM | GLIM | Yes | No | Yes |
| Eilstein 2008 | Mortality | France | 1978–2002 | 10–19 years | No | Bayesian APC model | WinBUGs | Yes | No | No |
| Eilstein 2012 | Mortality | France | 1977–2006 | 10–19 years | No | Bayesian APC model/GAM | R, WinBUGs | Yes | Yes | No |
| Engeland 1995 | Mortality | Nordic countries | 1958–1987 | 20+ years | No | GLM | Not provided | Yes | No | No |
| Ferlay 2010 | Incidence/ | European countries | 1978–2002 | <10 years | No | APC model | Nordpred | Yes | No | No |
| Ferlay 2013 | Incidence/ | European countries | 1978–2006 | <10 years | No | APC model | Nordpred | No | No | No |
| Ferlay 2013 | Incidence/ | Worldwide | 1989–2011 | 20+ years | No | Assume same rate | Not provided | Yes | No | No |
| French 2006 | Mortality | UK | 1984–2004 | 10–19 years | No | Joinpoint analysis/GLM | Stata, Joinpoint | Yes | No | No |
| Fukuda 2002 | Mortality | Japan | 1988–1997 | 10–19 years | No | GLM | Not provided | No | No | No |
| Galasso 2013 | Incidence/ | Italy | 1970–2002 | 10–19 years | No | APC model | MIAMOD | Yes | No | No |
| Godlewski 2012 | Incidence | Poland | 1999–2008 | 10–19 years | No | GLM | Stata | No | No | No |
| Hakulinen 1994 | Incidence | Sweden | 1960–1984 | 20+ years | No | GLM | GLIM | Yes | No | No |
| Heinävaara 2006 | Incidence/ | Finland | 1987–1997 | 10–19 years | No | GLM | Not provided | Yes | Yes | Yes |
| Hristova 1997 | Incidence | Bulgaria | 1968–1992 | 20+ years | No | APC model | GLIM | No | No | No |
| Jee 1998 | Mortality | Korea (South) | 1980–1994 | 10–19 years | No | GLM | Not provided | Yes | No | No |
| Jürgens 2014 | Mortality | Switzerland | 1974–2008 | 10–19 years | No | Bayesian APC model | R, WinBUGs | Yes | Yes | Yes |
| Kaneko 2003 | Mortality | Japan | 1952–2001 | 20+ years | No | Bayesian APC model | WinBUGs | Yes | Yes | No |
| Knorr-Held 2001 | Mortality | Germany | 1952–1996 | 10–19 years | Yes, smoking prevalence and consumption | Bayesian APC model and GLM with smoking as a covariate | BAMP | Yes | No | No |
| Kubík 1998 | Mortality | 4 European countries | 1960–1989 | 20+ years | No | APC model | GLIM | Yes | No | No |
| Kuroishi 1992 | Mortality | Japan | 1969–1989 | 20+ years | No | GLM | Not provided | Yes | No | No |
| Li 2017 | Mortality | China | 1974–2014 | 10–19 years | No | APC model | Nordpred | No | No | Yes |
| Malvezzi 2013 | Mortality | 33 European countries | 1970–2009 | <10 years | No | Joinpoint analysis/Bayesian APC model | R, Joinpoint, GLIM | Yes | No | No |
| Malvezzi 2015 | Mortality | 28 European countries | 1970–2009 | <10 years | No | Joinpoint analysis/GLM | R, Joinpoint | Yes | No | No |
| Malvezzi 2018 | Mortality | 6 countries | 1970–2012 | <10 years | No | Joinpoint analysis/GLM | Joinpoint | Yes | No | No |
| Martín-Sánchez 2016 | Mortality | Spain | 2007–2013 | <10 years | No | GLM | R, WinBUGs | Yes | No | No |
| Martín-Sánchez 2017 | Mortality | Spain | 1980–2013 | <10 years | Yes, smoking prevalence | GLM | Not provided | Yes | No | No |
| Martín-Sánchez 2018 | Mortality | 52 countries | 2008–2014 | 10–19 years | No | GLM | WinBUGs | No | No | No |
| Mistry 2011 | Incidence | UK | 1975–2007 | 20+ years | No | APC model | Stata, Nordpred | Yes | Yes | No |
| Møller 2002 | Incidence | Nordic countries | 1958–1997 | 20+ years | No | APC model | Nordpred | Yes | No | No |
| Møller 2005 | Incidence | Nordic countries | 1958–1987 | 10–19 years | No | APC model | R | Yes | No | Yes |
| Møller 2007 | Incidence | UK | 1974–2003 | 20+ years | No | APC model | Nordpred | Yes | No | No |
| Murray 1997 | Mortality | 47 countries | 1950–1990 | 20+ years | Yes, smoking intensity | GLM | Not provided | Yes | No | No |
| Negri 1990 | Mortality | Italy | 1955–1984 | 10–19 years | Yes, smoking prevalence | APC model involve smoking data | GLIM | Yes | Yes | No |
| Negri 1990 | Mortality | Switzerland | 1950–1984 | 10–19 years | Yes, smoking prevalence | APC model involve smoking data | GLIM | Yes | Yes | No |
| Ng 2009 | Mortality | Indonesia, Vietnam, Ethiopia | 2005–2006 | 10–19 years | Yes, smoking prevalence | GLM | SAS, Stata | No | No | No |
| Nowatzki 2011 | Incidence | Canada | 1976–2005 | 20+ years | No | APC model | Nordpred | Yes | No | No |
| Oberaigner 2014 | Incidence | Austria | 1990–2009 | 10–19 years | No | GLM | Stata | Yes | Yes | No |
| Olajide 2015 | Incidence | UK | 2002–2011 | <10 years | No | GLM | SAS, Stata | Yes | Yes | No |
| O'Lorcain 2004 | Mortality | Ireland | 1954–2000 | 10–19 years | No | GLM | Stata | Yes | No | No |
| Olsen 2008 | Mortality | UK | 1971–2005 | 20+ years | No | APC model | Nordpred | Yes | No | No |
| Parsons 2000 | Incidence | UK | 1981–1995 | 20+ years | No | GLM | S-PLUS | Yes | Yes | No |
| Pearce 2016 | Mortality | Ireland | 2007–2011 | 10–19 years | No | Assume same rate | SAS | Yes | No | No |
| Pierce 1992 | Mortality | 8 countries | 1975–1986 | 10–19 years | Yes, tobacco consumption | The simple tobacco consumption model | Not provided | No | No | No |
| Pisani 1993 | Mortality | 24 geographical global areas | 1985–1985 | 10–19 years | No | Assume same rate | Not provided | No | No | No |
| Pompe-Kirn 2000 | Incidence | Slovenia | 1965–1994 | 10–19 years | No | APC model | GLIM | Yes | No | No |
| Preston 2014 | Mortality | USA | 1940–2009 | 20+ years | Yes, smoking prevalence | GLM | Not provided | Yes | No | No |
| Quante 2016 | Incidence/ | Germany | 1998–2012 | 10–19 years | No | Joinpoint analysis/annual percentage change | SAS, Joinpoint | Yes | No | No |
| Rahib 2014 | Incidence/ | USA | 2006–2010 | 20+ years | No | Annual percentage change | Joinpoint | Yes | No | No |
| Rapiti 2014 | Incidence | Switzerland | 1985–2009 | 10–19 years | No | APC model | Nordpred | Yes | No | No |
| Reissigova 1994 | Mortality | Czech Republic | 1960–1989 | 10–19 years | No | APC model | GLIM | No | Yes | No |
| Ribes 2014 | Incidence/ | Spain | 1993–2007 | 10–19 years | No | Bayesian APC model | INLA | Yes | No | No |
| Riebler 2017 | Mortality | 5 countries | 1950–2011 | 10–19 years | No | Bayesian APC model | R, WinBUGs, INLA | Yes | Yes | Yes |
| Rutherford 2012 | Incidence | Finland | 1957–1987 | 20+ years | No | APC model | Stata | Yes | Yes | Yes |
| Sánchez 2010 | Incidence/ | Spain | 1981–2006 | <10 years | No | APC model | MIAMOD | Yes | Yes | No |
| Shamseddine 2014 | Incidence | Lebanon | 2003–2008 | 10–19 years | No | Joinpoint analysis/GLM | Joinpoint | No | No | No |
| Sharp 1996 | Incidence/ | UK | 1968–1992 | <10 years | No | APC model | GLIM | Yes | No | No |
| Shibuya 2005 | Mortality | Four countries | 1950–2000 | 20+ years | Yes, tobacco consumption and tar content | GLM with smoking as a covariate | Not provided | Yes | Yes | Yes |
| Smith 2009 | Incidence | USA | 2003–2005 | 20+years | No | Assume same rate | SAS | Yes | No | No |
| Smittenaar 2016 | Incidence/mortality | UK | 1979–2014 | 20+years | No | APC model | Stata | Yes | Yes | No |
| Son 2016 | Mortality | Korea (South) | 1983–2012 | 20+years | No | APC model | Nordpred | Yes | No | No |
| Stoeldraijer 2015 | Mortality | 4 European countries | 1950–2009 | 20+years | Yes, smoking prevalence | APC model / SAF | R | Yes | No | No |
| Stracci 2013 | Incidence/ | Italy | 1970–2002 | 10–19 years | No | APC model | MIAMOD | Yes | No | No |
| Strong 2008 | Mortality | 107 countries | 1950–2002 | 20+years | Yes, smoking intensity | GLM | Not provided | No | No | No |
| Swaminathan 2011 | Incidence | India | 1982–2006 | 10–19 years | No | APC model | Nordpred | Yes | No | No |
| Torres-Avilés 2015 | Mortality | Chile | 1990–2009 | <10 years | No | GLM | WinBUGs | Yes | Yes | Yes |
| Tsoi 2017 | Incidence | China | 1993–2007 | 20+years | No | GLM | R | Yes | Yes | No |
| Vardanjani 2017 | Incidence | Iran | 2003–2009 | <10 years | No | Joinpoint analysis/annual percentage change | Joinpoint | No | No | No |
| Virani 2017 | Incidence | Thailand | 1989–2012 | 10–19 years | No | Joinpoint analysis/APC model | R, Joinpoint, Nordpred | No | Yes | No |
| Vogt 2017 | Mortality | German | 1956–2013 | 20+years | Yes, years smoked | GLM | Not provided | Yes | No | No |
| Weir 2015 | Incidence | USA | 1975–2009 | 10–19 years | No | APC model | Nordpred | Yes | No | No |
| Wiklund 1992 | Mortality | Sweden | 1975–1984 | 20+years | No | GLM | CAN*TROL | Yes | No | No |
| Winkler 2015 | Mortality | South Africa | 2010 | 10–19 years | Yes, smoking prevalence | GLM/relative risk for smokers | Not provided | No | No | No |
| Yabroff 2008 | Mortality | USA | 1999–2003 | 10–19 years | No | Assume same rate | Not provided | Yes | No | No |
| Yang 2004 | Mortality | China | 1990–1999 | <10 years | No | GLM | Not provided | No | Yes | No |
| Yang 2005 | Incidence | China | 1993–1997 | <10 years | No | GLM | GLIM | No | No | No |
*The majority of observed data used are included in the Cancer Incidence in Five Continents series published by the International Agency for Research on Cancer, or have high population coverage as stated in WHO mortality database.
†Provides fitted values of observed data to allow appraisal of the model fit to the observed data.
‡Validation using observed data: Paper compared the projected values with the observed data beyond the period included in model fitting. The model was validated by excluding data for the most recent years from the model fitting, and then compared the projected rates for those years with the observed data.
APC, age–period–cohort; GLM, generalised linear model; SAF, smoking attributable fraction.
Summary of study characteristics grouped according to projection method used
| Method | Total studies* | Incidence | Mortality | ≥10 years observed data | Good data quality† | No of years projected | Provide fitted values‡ | Validation§ | ||
| 6–9 | 10–20 | >20 | ||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| ( | ( | ( | ( | ( | ( | ( | ( | ( | ( | |
| APC models, (%) | 44 | 26 | 29 | 44 | 37 | 6 | 31 | 7 | 15 | 8 |
| ( | (59.1) | (65.9) | (100.0) | (84.1) | (13.6) | (70.5) | (15.9) | (34.1) | (18.2) | |
| Other GLMs, (%) | 35 | 17 | 21 | 30 | 29 | 10 | 20 | 5 | 8 | 3 |
| ( | (48.6) | (60.0) | (85.7) | (82.9) | (28.6) | (57.1) | (14.3) | (22.9) | (8.6) | |
| Present state methods, (%) | 12 | 7 | 8 | 4 | 8 | 1 | 7 | 4 | 0 | 0 |
| ( | (58.3) | (66.7) | (33.3) | (66.7) | (8.3) | (58.3) | (33.3) | (0.0) | (0.0) | |
|
|
|
|
|
|
|
|
|
|
|
|
| ( | ( | ( | ( | ( | ( | ( | ( | ( | ( | |
| GLM with a smoking variable as one of the covariates, (%) | 8 | 0 | 8 | 8 | 7 | 1 | 1 | 6 | 3 | 1 |
| ( | (0.0) | (100.0) | (100.0) | (87.5) | (12.5) | (12.5) | (75.0) | (37.5) | (12.5) | |
| APC model including an effect for smoking, (%) | 2 | 0 | 2 | 2 | 2 | 0 | 2 | 0 | 2 | 0 |
| ( | (0.0) | (100.0) | (100.0) | (100.0) | (0.0) | (100.0) | (0.0) | (100.0) | (0.0) | |
| Projections adjusted for the SAF, (%) | 2 | 1 | 2 | 2 | 2 | 0 | 1 | 1 | 0 | 0 |
| ( | (50.0) | (100.0) | (100.0) | (100.0) | (0.0) | (50.0) | (50.0) | (0.0) | (0.0) | |
| Other methods, (%) | 4 | 1 | 4 | 1 | 0 | 0 | 3 | 1 | 0 | 0 |
| ( | (25.0) | (100.0) | (25.0) | (0.0) | (0.0) | (75.0) | (25.0) | (0.0) | (0.0) | |
|
|
|
|
|
|
|
|
|
|
|
|
| ( | ( | ( | ( | ( | ( | ( | ( | ( | ( | |
*Numbers of studies are not mutually exclusive, with six studies using more than one method.
†The majority of observed data used are included in the Cancer Incidence in Five Continents series published by the International Agency for Research on Cancer, or have high population coverage as stated in WHO mortality database.
‡Provides fitted values of observed data to allow appraisal of the model fit to the observed data.
§Validation using observed data: Paper compared the projected values with the observed data beyond the period included in model fitting. The model was validated by excluding data for the most recent years from the model fitting, and then compared the projected rates for those years with the observed data.
APC, age–period–cohort; GLM, generalised linear model; SAF, smoking attributable fraction.
Figure 3Studies included by year of publication, 1988–2018 and level of human development of the country providing the data, stratified by method. *Six studies used more than one method, and 22 studies used data from multiple countries. HDI, Human development index.
Figure 4Statistical software packages used by method and year of publication. *Six studies used more than one method, 20 studies used more than one software package. **Others include BMDP, BAMP, S-Plus, S and Can*Trol.
Summary of software packages commonly used in 2008–2018
| Methods group | Software/ | Free software | References | Descriptions | Programming requirement |
| APC model | Nordpred | Yes |
| Nordpred is an R package for projection up to 25 years, based on log-link or the power 5 model, and provide significance test for use of recent slope or average slope for the whole period. Requires specific data format by 5-year age group and 5-year period and cannot incorporate other covariates. | Requires a specific data format and basic R programming. Assumes that the last non-linear period component applies to all future periods, and the non-linear cohort component was projected for estimated cohorts. |
| Stata | No |
| User-written command, published packages include ‘apcfit’ | Apcfit requires some programming when projecting beyond the observed data. User defines the number of knots for age, period and cohort, therefore involves model selection and comparison. | |
| R-other | Yes |
| Other packages include ‘Epi’ | Requires R programming when projecting beyond the observed data. User defines the number of knots for age, period and cohort, and allows user to specify the centering of period and cohort. | |
| WinBUGS | Yes |
| Commonly used for Bayesian models, with Markov Chain Monte Carlo (MCMC) techniques. Trends for age, period and cohort effects are smoothed. MCMC is inherently less robust than analytic statistical methods. There is no in-built protection against misuse. | Requires knowledge of Bayesian methods including recognition of the importance of prior distributions. | |
| GLM | Stata | No |
| Stata’s glm fits GLMs by using either maximum quasi likelihood or Newton-Raphson (maximum likelihood) optimisation, which is the default. | Requires basic programming and user can define link functions, distributions, or perform analyses via a menu. |
| SAS | No |
| SAS’s genmod procedure fits a GLM to the data by maximum likelihood estimation of the parameter vector beta. | Requires some SAS programming experience. | |
| Joinpoint | Yes |
| Analyse Joinpoint models based on linear or log-linear regression, the tests of significance for change in trend use a Monte Carlo Permutation method. | No programming required. Can be easily learnt using the sample analyses provided on their website. |
APC, age-period-cohort; GLM, generalised linear model.