| Literature DB >> 27193918 |
Gary S Collins1, Emmanuel O Ogundimu1, Jonathan A Cook1, Yannick Le Manach2, Douglas G Altman1.
Abstract
Continuous predictors are routinely encountered when developing a prognostic model. Investigators, who are often non-statisticians, must decide how to handle continuous predictors in their models. Categorising continuous measurements into two or more categories has been widely discredited, yet is still frequently done because of its simplicity, investigator ignorance of the potential impact and of suitable alternatives, or to facilitate model uptake. We examine three broad approaches for handling continuous predictors on the performance of a prognostic model, including various methods of categorising predictors, modelling a linear relationship between the predictor and outcome and modelling a nonlinear relationship using fractional polynomials or restricted cubic splines. We compare the performance (measured by the c-index, calibration and net benefit) of prognostic models built using each approach, evaluating them using separate data from that used to build them. We show that categorising continuous predictors produces models with poor predictive performance and poor clinical usefulness. Categorising continuous predictors is unnecessary, biologically implausible and inefficient and should not be used in prognostic model development.Entities:
Keywords: continuous predictors; dichotomisation; prognostic modelling
Mesh:
Year: 2016 PMID: 27193918 PMCID: PMC5026162 DOI: 10.1002/sim.6986
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Characteristics of the individuals in the THIN data set, used as predictors in the developed prognostic models; SD: standard deviation.
| Variable | Cardiovascular disease | Hip fracture | ||
|---|---|---|---|---|
| Development (England) ( | Validation (Scotland) ( | Development (England) ( | Validation (Scotland) ( | |
| Mean age in years (SD) | 48.6 (14.1) | 48.9 (14.0) | 50.8 (15.3) | 50.1 (14.9) |
| Men | 922,913 (51.2%) | 62,321 (56.2%) | — | — |
| Family history of cardiovascular disease | 74,668 (4.1%) | 4,268 (3.8%) | — | — |
| Mean serum cholesterol (SD) | 5.5 (1.2) | 5.5 (1.2) | — | — |
| Mean systolic blood pressure (SD) | 131.8 (20.3) | 131.6 (20.2) | — | — |
| Mean body mass index (SD) | 26.3 (4.4) | 26.3 (4.5) | 26.1 (4.9) | 26.5 (5.1) |
| Treated for hypertension | 96,634 (5.4%) | 6,223 (5.6%) | — | — |
| Diagnosis of asthma | — | — | 84,279 (8.6%) | 5,207 (8.5%) |
| History of falls | — | — | 26,124 (2.7%) | 1,016 (1.7%) |
| Prescription of tricyclic antidepressants | — | — | 51,849 (5.3%) | 3,528 (5.7%) |
Figure 1Continuous predictors from the THIN development data set. Models developed to predict cardiovascular disease using the (orange) fractional polynomial approach with four degrees of freedom, (black) restricted cubic spline approach using three knots, (blue) dichotomising at the median predictor value approach, (green) categorising into five equally sized groups approach and (red) linear approach.
c‐index of the hip fracture prognostic models varying the development sample size (200 simulations) [mean (SD)] df: degrees of freedom; SD: standard deviation.
| 25 events (development) | 50 events (development) | 100 events (development) | 2000 events (development) | |||||
|---|---|---|---|---|---|---|---|---|
| Apparent performance | Validation | Apparent performance | Validation | Apparent performance | Validation | Apparent performance | Validation | |
| Linear (all continuous) | 0.9017 (0.0289) | 0.8678 (0.0335) | 0.9027 (0.0209) | 0.8828 (0.0122) | 0.8993 (0.0148) | 0.8860 (0.0022) | 0.8979 0.0032) | 0.8880 (0.0002) |
| Age (only) dichotomised at the median | 0.8110 (0.0345) | 0.7791 (0.0290) | 0.8067 (0.0258) | 0.7922 (0.0097) | 0.8012 (0.0195) | 0.7951 (0.0059) | 0.7987 (0.0038) | 0.7983 (0.0007) |
| Dichotomised all continuous predictors at the median | 0.7991 (0.0318) | 0.7654 (0.0265) | 0.7938 (0.0223) | 0.7780 (0.0131) | 0.7872 (0.0184) | 0.7820 (0.0067) | 0.7846 (0.0038) | 0.7864 (0.0015) |
| Dichotomised all continuous predictors at the optimal | 0.8573 (0.0389) | 0.7631 (0.0626) | 0.8401 (0.0312) | 0.7798 (0.0296) | 0.8261 (0.0266) | 0.7818 (0.0249) | 0.7704 (0.0065) | 0.7722 (0.0055) |
| Categorised into 5‐year age categories | 0.9059 (0.0241) | 0.8554 (0.0201) | 0.9011 (0.0194) | 0.8705 (0.0100) | 0.8946 (0.0142) | 0.8780 (0.0048) | 0.8907 (0.0031) | 0.8847 (0.0004) |
| Categorised into 10‐year age categories | 0.8885 (0.0279) | 0.8520 (0.0230) | 0.8859 (0.0211) | 0.8659 (0.0103) | 0.8811 (0.0149) | 0.8708 (0.0036) | 0.8797 (0.0031) | 0.8741 (0.0003) |
| Categorised all continuous predictors into thirds | 0.8563 (0.0295) | 0.8218 (0.0210) | 0.8558 (0.0215) | 0.8349 (0.0103) | 0.8491 (0.0163) | 0.8403 (0.0050) | 0.8426 (0.0033) | 0.8394 (0.0009) |
| Categorised all continuous predictors into fourths | 0.8831 (0.0257) | 0.8408 (0.0216) | 0.8784 (0.0202) | 0.8541 (0.0088) | 0.8726 (0.0143) | 0.8588 (0.0045) | 0.8688 (0.0030) | 0.8634 (0.0005) |
| Categorised all continuous predictors into fifths | 0.8980 (0.0259) | 0.8485 (0.0198) | 0.8902 (0.0194) | 0.8639 (0.0091) | 0.8839 (0.0136) | 0.8699 (0.0039) | 0.8800 (0.0032) | 0.8752 (0.0004) |
| Age (only) categorised into thirds | 0.8584 (0.0302) | 0.8281 (0.0236) | 0.8586 (0.0223) | 0.8411 (0.0094) | 0.8529 (0.0161) | 0.8453 (0.0040) | 0.8474 (0.0032) | 0.8432 (0.0006) |
| Age (only) categorised into fourths | 0.8806 (0.0266) | 0.8462 (0.0221) | 0.8776 (0.0214) | 0.8583 (0.0079) | 0.8734 (0.0145) | 0.8615 (0.0036) | 0.8706 (0.0030) | 0.8640 (0.0004) |
| Age (only) categorised into fifths | 0.8917 (0.0272) | 0.8559 (0.0190) | 0.8874 (0.0201) | 0.8680 (0.0084) | 0.8830 (0.0139) | 0.8724 (0.0032) | 0.8810 (0.0031) | 0.8755 (0.0004) |
| Fractional polynomials [df = 4] | 0.9023 (0.0288) | 0.8666 (0.0344) | 0.9030 (0.0210) | 0.8827 (0.0123) | 0.8997 (0.0148) | 0.8854 (0.0050) | 0.8985 (0.0032) | 0.8885 (0.0003) |
| Fractional polynomials of age only [df = 4] | 0.9020 (0.0288) | 0.8677 (0.0335) | 0.9027 (0.0209) | 0.8828 (0.0122) | 0.8994 (0.0147) | 0.8860 (0.0022) | 0.8979 (0.0031) | 0.8879 (0.0003) |
| Restricted cubic splines [3 knots] | 0.9041 (0.0292) | 0.8656 (0.0324) | 0.9047 (0.0203) | 0.8821 (0.0121) | 0.9002 (0.0146) | 0.8859 (0.0024) | 0.8985 (0.0032) | 0.8882 (0.0002) |
| Restricted cubic splines of age only [3 knots] | 0.9023 (0.0288) | 0.8681 (0.0314) | 0.9031 (0.0208) | 0.8826 (0.0121) | 0.8994 (0.0147) | 0.8859 (0.0022) | 0.8979 (0.0032) | 0.8879 (0.0002) |
c‐index of the cardiovascular disease prognostic models varying the development sample size (200 simulations) [mean (SD)] df: degrees of freedom; SD: standard deviation.
| 25 events (development) | 50 events (development) | 100 events (development) | 2000 events (development) | |||||
|---|---|---|---|---|---|---|---|---|
| Apparent performance | Validation | Apparent performance | Validation | Apparent performance | Validation | Apparent performance | Validation | |
| Linear (all continuous) | 0.8361(0.0362) | 0.7920(0.0197) | 0.8211 (0.0263) | 0.8043 (0.0096) | 0.8183 (0.0187) | 0.8095 (0.0036) | 0.8153 (0.0039) | 0.8141 (0.0004) |
| Age (only) dichotomised at the median | 0.7957 (0.0371) | 0.7393 (0.0210) | 0.7805 (0.0291) | 0.7535 (0.0138) | 0.7732 (0.0194) | 0.7631 (0.0058) | 0.7690 (0.0047) | 0.7706 (0.0006) |
| Dichotomised all continuous predictors at the median | 0.7910 (0.0367) | 0.7302 (0.0193) | 0.7725 (0.0275) | 0.7447 (0.0130) | 0.7653 (0.0202) | 0.7525 (0.0071) | 0.7614 (0.0045) | 0.7604 (0.0006) |
| Dichotomised all continuous predictors at the optimal | 0.8379 (0.0348) | 0.7237 (0.0315) | 0.8103 (0.0300) | 0.7439 (0.0236) | 0.7948 (0.0212) | 0.7591 (0.0141) | 0.7745 (0.0043) | 0.7739 (0.0025) |
| Categorised into 5‐year age categories | 0.8501 (0.0336) | 0.7481 (0.0763) | 0.8286 (0.0253) | 0.7868 (0.0130) | 0.8193 (0.0185) | 0.8007 (0.0054) | 0.8125 (0.0040) | 0.8109 (0.0004) |
| Categorised into 10‐year age categories | 0.8327 (0.0359) | 0.7690 (0.0219) | 0.8160 (0.0272) | 0.7888 (0.0106) | 0.8101 (0.0192) | 0.7982 (0.0045) | 0.8063 (0.0042) | 0.8046 (0.0004) |
| Categorised all continuous predictors into thirds | 0.8281 (0.0353) | 0.7479 (0.0234) | 0.8080 (0.0261) | 0.7678 (0.0118) | 0.7990 (0.0184) | 0.7788 (0.0061) | 0.7908 (0.0043) | 0.7877 (0.0006) |
| Categorised all continuous predictors into fourths | 0.8486 (0.0331) | 0.7504 (0.0220) | 0.8240 (0.0254) | 0.7736 (0.0132) | 0.8121 (0.0187) | 0.7880 (0.0060) | 0.8018 (0.0040) | 0.7999 (0.0005) |
| Categorised all continuous predictors into fifths | 0.8636 (0.0295) | 0.7418 (0.0258) | 0.8347 (0.0251) | 0.7732 (0.0134) | 0.8205 (0.0182) | 0.7903 (0.0070) | 0.8079 (0.0039) | 0.8061 (0.0006) |
| Age (only) categorised into thirds | 0.8178 (0.0370) | 0.7611 (0.0206) | 0.8031 (0.0268) | 0.7763 (0.0116) | 0.7976 (0.0184) | 0.7850 (0.0046) | 0.7929 (0.0043) | 0.7906 (0.0005) |
| Age (only) categorised into fourths | 0.8296 (0.0355) | 0.7682 (0.0206) | 0.8129 (0.0260) | 0.7846 (0.0120) | 0.8065 (0.0183) | 0.7946 (0.0044) | 0.8019 (0.0040) | 0.8006 (0.0004) |
| Age (only) categorised into fifths | 0.8344 (0.0340) | 0.7698 (0.0207) | 0.8188 (0.0260) | 0.7886 (0.0105) | 0.8117 (0.0184) | 0.7989 (0.0043) | 0.8068 (0.0040) | 0.8058 (0.0004) |
| Fractional polynomials [df = 4] | 0.8349 (0.0364) | 0.7810 (0.0405) | 0.8230 (0.0256) | 0.7973 (0.0453) | 0.8184 (0.0191) | 0.8078 (0.0048) | 0.8168 (0.0039) | 0.8155 (0.0005) |
| Fractional polynomials of age only [df = 4] | 0.8336 (0.0363) | 0.7857 (0.0213) | 0.8218 (0.0261) | 0.8013 (0.0099) | 0.8179 (0.0187) | 0.8084 (0.0037) | 0.8149 (0.0039) | 0.8136 (0.0004) |
| Restricted cubic splines [3 knots] | 0.8455 (0.0334) | 0.7708 (0.0279) | 0.8286 (0.0267) | 0.7960 (0.0123) | 0.8217 (0.0187) | 0.8071 (0.0052) | 0.8167 (0.0039) | 0.8155 (0.0005) |
| Restricted cubic splines of age only [3 knots] | 0.8350 (0.0357) | 0.7812 (0.0250) | 0.8227 (0.0263) | 0.7999 (0.0103) | 0.8178 (0.0186) | 0.8077 (0.0042) | 0.8148 (0.0040) | 0.8134 (0.0004) |
Figure 2Calibration plots of cardiovascular disease risk in the validation cohort (2000 events).
Figure 3Boxplot of the predicted cardiovascular disease risks in the validation cohort (2000 events).
Range of additional net cases per 1000 found when using each approach for handling continuous predictors, compared with categorising all of the continuous predictors at the median. Models developed using 2000 outcome events. Range of thresholds 0.09 to 0.2 for cardiovascular disease and 0.01 to 0.05 for hip fracture.
| Model | Cardiovascular disease | Hip fracture | ||
|---|---|---|---|---|
| Development | Validation | Development | Validation | |
| Linear | 6 to 11 | 5 to 10 | 4 to 6 | 5 to 7 |
| Fractional polynomials | 7 to 11 | 6 to 10 | 4 to 6 | 5 to 7 |
| Restricted cubic splines | 7 to 11 | 6 to 10 | 4 to 6 | 5 to 7 |
| 5‐year age categories | 6 to 10 | 6 to 9 | 4 to 6 | 4 to 7 |
| 10‐year age categories | 6 to 10 | 5 to 9 | 2 to 5 | 3 to 6 |
| Thirds | 2 to 6 | 1 to 5 | 1 to 3 | 1 to 3 |
| Fourths | 4 to 8 | 3 to 7 | 1 to 4 | 2 to 5 |
| Fifths | 5 to 9 | 4 to 8 | 2 to 5 | 3 to 6 |