| Literature DB >> 20353578 |
Susan Mallett1, Patrick Royston, Susan Dutton, Rachel Waters, Douglas G Altman.
Abstract
BACKGROUND: Development of prognostic models enables identification of variables that are influential in predicting patient outcome and the use of these multiple risk factors in a systematic, reproducible way according to evidence based methods. The reliability of models depends on informed use of statistical methods, in combination with prior knowledge of disease. We reviewed published articles to assess reporting and methods used to develop new prognostic models in cancer.Entities:
Mesh:
Year: 2010 PMID: 20353578 PMCID: PMC2856521 DOI: 10.1186/1741-7015-8-20
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 8.775
Figure 1Flowchart of articles.
Study design and size
| % (n = 47) articles | |
|---|---|
| Study design | |
| RCT* | 11 (5) |
| Prospective cohort study | 21 (10) |
| Database or other retrospective cohort | 68 (32) |
| Reason for sample size | |
| No reason given | 77 (36) |
| Justified time interval (clinical or technology) | 4 (2) |
| Trial size (RCT or cohort) | 19 (9) |
| Power calculation reported | 0 (0) |
| Eligible for study: | |
| Number of patients (n = 35) | 403 (148 to 833) |
| Number of events (n = 20) | 112 (62 to 289) |
| Included in analysis: | |
| Number of patients (n = 45) | 342 (130 to 684) |
| Number of events (n = 33) | 110 (61 to 230) |
| Number of events per candidate variable (n = 28) | 10 (5 to 33) [2 to 95] |
*RCT randomised controlled trial
Outcomes and definitions
| % (n = 47) articles | |
|---|---|
| Primary outcome | |
| Overall survival | 66 (31) |
| Disease free survival (DFS) | 34 (16) |
| Definition of outcomes | |
| Overall survival (n = 31) | |
| Explicitly death from any cause | 15 (7) |
| Cancer death only | 21 (10) |
| Type of death unclear | 30 (14) |
| Disease free survival (n = 16) | |
| DFS including death* | 10 (5) |
| DFS not including death | 13 (6) |
| DFS but unclear if includes death | 10 (5) |
| Multivariable outcome clearly defined‡ | 60 (28) |
* In one study it was not clear if DFS deaths included all deaths or only cancer specific deaths
‡ In two articles it was unclear how many outcomes were examined
Variables in multivariable analysis
| % (n = 47) articles | |
|---|---|
| Coding of variables in model | |
| Coding explicit for all candidate variables | 68 (32) |
| Coding explicit for all variables in final model‡ | 70 (33) |
| Coding of continuous candidate variables | |
| All continuous | 2 (1) |
| All categorised/dichotomised | 51 (24) |
| Combination of continuous and categorised variables | 19 (9) |
| No continuous candidate variables | 0 (0) |
| Unclear/not reported | 28 (13) |
| Number of variables | |
| Candidates used to develop model* (n = 46) | 11 (7 to 14) (2 to 27) |
| Included in final model** (n = 44) | 4 (3 to 6) (2 to 12) |
‡ Two papers could not be assessed as the final model was unclear.
* In four articles variables were excluded where it was unclear if these were candidate variables. In one article the number of candidate variables was completely unclear
** Three articles could not be included: two because the final model was not presented in the paper, one because it was not clear which of the alternative models in the paper was the final model.
Selection of variables in multivariable analysis
| % (n = 43*) | |
|---|---|
| All candidate variables used (no selection) | 26 (11) |
| All candidate variables apart from a few with contra indications** | 5 (2) |
| Without statistical analysis | |
| Previous literature | 5 (2) |
| Previous literature and few variables by investigator choice | 5 (2) |
| By statistical analysis | |
| Screening by univariable analysis - only significant variables | 37 (16) |
| Screening by univariable analysis - significant variables and investigator choice | 11 (5) |
| Unclear/Not reported | 11 (5) |
| A priori variables fixed, others added | 2 (1) |
| Backward elimination | 14 (6) |
| Forward selection | 5 (2) |
| Other (pairwise multiple testing for categories of variables) | 2 (1) |
| Unclear/Not reported | 77 (33) |
| No selection. All variables kept in model | 14 (6) |
| Retain only significant variables based on P-value | 65 (28) |
| Retain significant variables plus variables based on previous literature | 2 (1) |
| Retain all variables but alter prognostic score after model to include only significant variables and adjust for other variables | 5 (2) |
| Retain only significant variables but alter prognostic score after final model | 5 (2) |
| Retain based on model | 2 (1) |
| Unclear/Not reported | 7 (3) |
* Excluded four studies using recursive partitioning analysis and artificial neural network models
** Contra indications reported as reasons for exclusion of variables were missing data, collinearity and treatment indicator