| Literature DB >> 28882171 |
Justine B Nasejje1, Henry Mwambi2.
Abstract
BACKGROUND: Uganda just like any other Sub-Saharan African country, has a high under-five child mortality rate. To inform policy on intervention strategies, sound statistical methods are required to critically identify factors strongly associated with under-five child mortality rates. The Cox proportional hazards model has been a common choice in analysing data to understand factors strongly associated with high child mortality rates taking age as the time-to-event variable. However, due to its restrictive proportional hazards (PH) assumption, some covariates of interest which do not satisfy the assumption are often excluded in the analysis to avoid mis-specifying the model. Otherwise using covariates that clearly violate the assumption would mean invalid results.Entities:
Keywords: Cox proportional hazards model; Random survival forests; Survival trees; proportional hazards assumption
Mesh:
Year: 2017 PMID: 28882171 PMCID: PMC5590231 DOI: 10.1186/s13104-017-2775-6
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
The distribution of births and deaths by survival determinants
| Characteristics | Dead N (%) | Alive N (%) | Total |
|---|---|---|---|
| Mother’s education level | |||
| Illiterate Mothers | 344 (7.7) | 4149 (92.3) | 4493 |
| Mother completed primary | 119 (6.4) | 1749 (93.6) | 1868 |
| Secondary and higher | 14 (4.2) | 317 (95.8) | 331 |
| Partner’s level of education | |||
| Illiterate Father | 266 (7.7) | 3180 (92.3) | 3446 |
| Father completed primary | 170 (6.9) | 2287 (93.1) | 2457 |
| Secondary and higher | 41 (5.2) | 748 (94.8) | 789 |
| Birth status | |||
| Singleton births | 431 (6.7) | 6048 (93.3) | 6479 |
| Multiple births (twins) | 46 (21.5) | 167 (78.5) | 213 |
| Sex of the child | |||
| Males | 258 (7.8) | 3067 (92.2) | 3325 |
| Females | 212 (6.3) | 3155 (93.7) | 3367 |
| Type of place of residence | |||
| Urban | 81 (5.8) | 1308 (94.2) | 1389 |
| Rural | 396 (7.5) | 4907 (92.5) | 5303 |
| Wealth index | |||
| Poorest | 131 (7.5) | 1623 (92.5) | 1754 |
| Poorer | 112 (8.5) | 1205 (91.5) | 1317 |
| Middle | 86 (7.2) | 1109 (92.8) | 1195 |
| Richer | 72 (6.9) | 969 (93.1) | 1041 |
| Richest | 76 (5.5) | 1309 (94.5) | 1385 |
| Children ever born | |||
| One child | 20 (3.3) | 581 (96.7) | 601 |
| Two children | 81 (7.1) | 1065 (92.9) | 1146 |
| Three children | 67 (6.6) | 953 (93.4) | 1020 |
| Four and more | 309 (7.9) | 3616 (92.1) | 3925 |
| Birth order number | |||
| First child | 95 (7.6) | 1154 (92.4) | 1249 |
| Second to third child | 117 (5.6) | 1974 (94.4) | 2091 |
| 4th–6th child | 149 (7.1) | 1949 (92.9) | 2098 |
| 6th+ child | 116 (9.2) | 1138 (90.8) | 1254 |
| Religion | |||
| Catholics | 217 (7.4) | 2722 (92.6) | 2939 |
| Muslims | 69 (7.5) | 852 (92.5) | 921 |
| Other Christians | 187 (6.8) | 2571 (93.2) | 2758 |
| Others | 4 (5.4) | 70 (94.6) | 74 |
| Type of toilet facility | |||
| Flush toilet | 5 (4.1) | 116 (95.9) | 121 |
| Pitlatrine | 376 (6.9) | 5031 (93.1) | 5407 |
| No-facility | 96 (8.2) | 1068 (91.8) | 1164 |
| Mother’s occupation | |||
| Not-working | 93 (6.9) | 1260 (93.1) | 1353 |
| Sales and services | 110 (6.5) | 1589 (93.5) | 1699 |
| Agriculture | 274 (7.5) | 3366 (92.5) | 3640 |
| Births in past 5 years | |||
| 1-Birth | 93 (4.5) | 1982 (95.5) | 2075 |
| 2-Birth | 227 (6.5) | 3288 (93.5) | 3515 |
| 3-Births | 140 (13.6) | 887 (86.4) | 1027 |
| 4-Births | 17 (22.7) | 58 (77.3) | 75 |
| Births in past 1 year | |||
| No-births | 309 (6.8) | 4212 (93.2) | 4521 |
| 1-Birth | 163 (7.6) | 1971 (92.4) | 2134 |
| 2-Births | 5 (13.5) | 32 (86.5) | 37 |
| Children under 5 in household | |||
| No-child | 101 (34.9) | 188 (65.1) | 289 |
| 1-Child | 178 (10.5) | 1511 (89.5) | 1689 |
| 2-Children | 146 (4.9) | 2831 (95.1) | 2977 |
| 3-Children | 35 (2.5) | 1349 (97.5) | 1384 |
| 4-Children | 17 (4.8) | 336 (95.2) | 353 |
| Mother’s age group | |||
| Less than 20 years | 29 (8.9) | 296 (91.1) | 325 |
| 20–29 years | 235 (6.5) | 3376 (93.5) | 3611 |
| 30–39 years | 164 (7.4) | 2054 (92.6) | 2218 |
| 40 years+ | 49 (7.9) | 489 (90.1) | 538 |
| Birth order number | |||
| First child | 95 (7.6) | 1154 (92.4) | 1249 |
| Second to third child | 117 (5.6) | 1974 (94.4) | 2091 |
| 4th–6th child | 149 (7.1) | 1949 (92.9) | 2098 |
| 6th+ child | 116 (9.3) | 1138 (90.7) | 1254 |
| Sex of household head | |||
| Male | 341 (6.7) | 4771 (93.3) | 5112 |
| Female | 136 (8.6) | 1444 (91.4) | 1580 |
| Source of drinking water | |||
| Piped water | 76 (5.9) | 1204 (94.1) | 1280 |
| Borehole | 216 (7.3) | 2731 (92.7) | 2947 |
| Well | 93 (6.9) | 1261 (93.1) | 1354 |
| Surface/rain/pond/lake/tank | 70 (8.5) | 756 (91.5) | 826 |
| Other | 22 (7.7) | 263 (92.3) | 285 |
| Age at first birth | |||
| Less than 20 years | 347 (7.5) | 4291 (92.5) | 4638 |
| 20–29 years | 127 (6.3) | 1899 (93.7) | 2026 |
| 30–39 years | 3 (12.0) | 22 (88.0) | 25 |
Testing the proportional hazard assumption using scaled Schoenfeld residuals
| Covariates | χ2 (df) | p-value |
|---|---|---|
| Mother’s education | ||
| Illiterate | 1 | |
| Primary | 4.83 | 0.03 |
| Secondary and higher | 7.52 | <0.01 |
| GLOBAL | 11.25 |
|
| Father’s education | ||
| Illiterate | 1 | |
| Primary | 0.51 | 0.48 |
| Secondary and higher | 0.86 | 0.35 |
| GLOBAL | 1.12 | 0.57 |
| Sex of the child | ||
| Male | 1 | |
| Female | 1.99 | 0.16 |
| Total number of children ever born | ||
| 1 child | 1 | |
| 2 child | 5.39 | 0.02 |
| 3 child | 0.44 | 0.51 |
| 4+ child | 0.26 | 0.61 |
| GLOBAL | 14.61 |
|
| Type of place of residence | ||
| Rural | 1 | |
| Urban | 8.43 |
|
| Wealth index | ||
| Poorest | 1 | |
| Poorer | 0.17 | 0.7 |
| Middle | 0.00 | 0.98 |
| Richer | 6.94 |
|
| Richest | 2.26 | 0.13 |
| GLOBAL | 9.29 | 0.05 |
| Birth order | ||
| 1st | 1 | |
| 2nd | 0.28 | 0.59 |
| 3rd | 6.69 |
|
| 4th+ | 2.64 | 0.10 |
| GLOBAL | 8.46 | 0.04 |
| Age at first birth | ||
| <20 | 1 | |
| 20–29 | 0.10 | 0.75 |
| 30+ | 0.41 | 0.52 |
| GLOBAL | 0.54 | 0.76 |
| Previous birth interval (years) | 1 | |
| <2 | Ref | |
| 2 | 1.83 | 0.18 |
| 3 | 0.97 | 0.32 |
| 4+ | 2.53 | 0.11 |
| GLOBAL | 8.69 | 0.03 |
| Number of births in the past 1 year | ||
| No birth | 1 | |
| 1 birth | 0.7 | 0.40 |
| 2 | 1.24 | 0.27 |
| GLOBAL | 1.81 | 0.40 |
| Number of births in the last 5 years | ||
| 1 births | 1 | |
| 2 births | 0.11 | 0.75 |
| 3 births | 0.03 | 0.86 |
| 4+ | 5.00 | 0.03 |
| GLOBAL | 5.85 | 0.12 |
| Mother’s age (years) | ||
| <20 | 1 | |
| 20–29 | 0.16 | 0.69 |
| 30–39 | 0.63 | 0.43 |
| 40+ | 0.08 | 0.78 |
| GLOBAL | 5.58 | 0.13 |
| Sex of household head | ||
| Male | 1 | |
| Female | 0.07 | 0.79 |
| Source of drinking water | ||
| Piped water | 1 | |
| Borehole | 0.17 | 0.68 |
| Well water | 0.12 | 0.73 |
| Surface/pond/lake/rain/etc | 2.58 | 0.11 |
| Others | 1.82 | 0.18 |
| GLOBAL | 6.55 | 0.16 |
| Mother’s occupation | ||
| Not working | 1 | |
| Sales and Services | 0.202 | 0.65 |
| Agriculture | 6.88 |
|
| GLOBAL | 14.41 |
|
| Type of birth | ||
| Single birth | 1 | |
| Multiple births | 13 |
|
| Religion | ||
| Catholic | 1 | |
| Muslim | 0.009 | 0.92 |
| Other Christians | 0.73 | 0.39 |
| Others | 1.59 | 0.21 |
| GLOBAL | 2.21 | 0.53 |
Fig. 1Survival curves for children under the age of five by wealth index
Fig. 2Survival curves for children under the age of five by Births in the past 5 years. Some of the survival curves diverge disproportionately from each other over time and some cross each other confirming a violation of the PH assumption (see Figs. 1 and 2)
The adjusted and unadjusted hazard ratios from fitting the Cox-proportional hazard model for only those covariates that satisfy the proportionality hazard assumption
| Variable | Unadjusted HR [95% CI] | Adjusted HR [95% CI] | p-values |
|---|---|---|---|
| Father’s education | |||
| Illiterate | 1 | 1 | |
| Primary | 0.89 | 0.92 | 0.43 |
| Secondary and higher | 0.67 | 0.72 | 0.06 |
| Sex of the child | |||
| Male | 1 | ||
| Female | 0.83 | 0.83 | 0.04 |
| Age at first birth | |||
| <20 | 1 | 1 | |
| 20–29 | 0.84 | 0.86 | 0.16 |
| 30+ | 1.52 | 1.59 | 0.42 |
| Sex of household head | |||
| Male | 1 | 1 | |
| Female | 1.30 | 1.33 | 0.01 |
| Number of births in the past 1 year | |||
| No birth | 1 | 1 | |
| 1 birth | 1.18 | 1.22 | 0.04 |
| 2 births | 2.34 | 2.57 | 0.04 |
| Mother’s age (years) | |||
| <20 | 1 | 1 | |
| 20–29 | 0.66 | 0.71 | 0.08 |
| 30–39 | 0.74 | 0.79 | 0.27 |
| 40+ | 0.90 | 0.99 | 0.98 |
| Source of drinking water | |||
| Piped water | 1 | 1 | |
| Borehole | 1.24 | 1.12 | 0.39 |
| Well water | 1.17 | 1.06 | 0.69 |
| Surface/pond/lake/rain/etc | 1.44 | 1.28 | 0.15 |
| Others | 1.32 | 1.21 | 0.44 |
| Religion | |||
| Catholic | 1 | 1 | |
| Muslim | 1.01 | 1.02 | 0.91 |
| Other Christians | 0.91 | 0.94 | 0.51 |
| Others | 0.717 | 0.67 | 0.43 |
The best fitting Cox proportional hazards model
| Variable |
| HR [95% CI] | p values |
|---|---|---|---|
| Father’s education | |||
| Illiterate | 1 | ||
| Primary | −0.09 (0.09) | 0.90 | 0.31 |
| Secondary and higher | −0.41 (0.17) | 0.66 | 0.014 |
| Sex of the child | |||
| Male | 1 | ||
| Female | −0.18 (0.09) | 0.83 | 0.04 |
| Number of births in the past 1 year | |||
| No birth | 1 | ||
| 1 birth | 0.20 (0.09) | 1.22 | 0.04 |
| 2 births | 0.922( 0.45) | 2.51 | 0.04 |
| Household head | |||
| Male | 1 | ||
| Female | 0.28 (0.10) | 1.33 | 0.01 |
| Mother’s age group | |||
| Less than 20 years | 1 | ||
| 20–29 | −0.38 (0.19) | 0.68 | 0.05 |
| 30–39 | −0.27 (0.20) | 0.77 | 0.17 |
| 40+ | −0.05 (0.24) | 0.95 | 0.83 |
Fig. 3Survival curves for children under the age of five by sex of the child
Fig. 4Survival curves for children under the age of five by sex of the household head
Characteristics of the two fitted forests
| First forest | |
| Number of deaths | 477 |
| Minimum terminal node size | 3 |
| Average no. of terminal nodes | 514.902 |
| No. of variables tried at each split | 3 |
| Total no. of variables | 8 |
| Splitting rule | Log-rank |
| Error rate | 47.32 |
| Second forest | |
| Number of deaths | 477 |
| Minimum terminal node size | 3 |
| Average no. of terminal nodes | 607.567 |
| No. of variables tried at each split | 3 |
| Total no. of variables | 8 |
| Splitting rule | Log-rank score |
| Error rate | 47.36 |
Fig. 5The prediction error rate (left panel) for random survival forest of 1000 trees together with the rank of covariates (right panel) based on how they influence under-five child mortality while considering covariates that satisfy the PH assumption. The trees in this forest are built using the log-rank split-rule
Fig. 6The prediction error rate (left panel) for random survival forest of 1000 trees together with the rank of covariates (right panel) based on how they influence under-five child mortality while considering covariates that satisfy the PH assumption. Survival trees in this forest are built using the log-rank score split-rule
Characteristics of the two fitted forests
| First forest | |
| Number of deaths | 477 |
| Minimum terminal node size | 3 |
| Average no. of terminal nodes | 480.167 |
| No. of variables tried at each split | 5 |
| Total no. of variables | 19 |
| Splitting rule | Log-rank |
| Error rate | 17.29 |
| Second forest | |
| Number of deaths | 477 |
| Minimum terminal node size | 3 |
| Average no. of terminal nodes | 910.187 |
| No. of variables tried at each split | 5 |
| Total no. of variables | 19 |
| Splitting rule | Log-rank score |
| Error rate | 19.69 |
Fig. 7The prediction error rate (left panel) for random survival forest of 1000 trees together with the rank of covariates (right panel) based on how they influence under-five child mortality while considering all covariates including those that violate the PH assumption. Survival trees in this forest are built using the log-rank split-rule
Fig. 8The prediction error rate curve (left panel) for random survival forest of 1000 trees together with the rank of covariates (right panel) based on how they influence under-five child mortality while considering all covariates including those that violate the PH assumption. Survival trees in this forest are built using the log-rank score split-rule
Fig. 9Predictive performance for random survival forests with both covariates that satisfy and violate the PH assumption, the Cox PH model and random survival forests with only covariates that satisfy the PH assumption