| Literature DB >> 31386230 |
Andreas Gleiss1, Michael Schemper1.
Abstract
We suggest measures to quantify the degrees of necessity and of sufficiency of prognostic factors for dichotomous and for survival outcomes. A cause, represented by certain values of prognostic factors, is considered necessary for an event if, without the cause, the event cannot develop. It is considered sufficient for an event if the event is unavoidable in the presence of the cause. Necessity and sufficiency can be seen as the two faces of causation, and this symmetry and equal relevance are reflected by the suggested measures. The measures provide an approximate, in some cases an exact, multiplicative decomposition of explained variation as defined by Schemper and Henderson for censored survival and for dichotomous outcomes. The measures, ranging from zero to one, are simple, intuitive functions of unconditional and conditional probabilities of an event such as disease or death. These probabilities often will be derived from logistic or Cox regression models; the measures, however, do not require any particular model. The measures of the degree of necessity implicitly generalize the established attributable fraction or risk for dichotomous prognostic factors and dichotomous outcomes to continuous prognostic factors and to survival outcomes. In a setting with multiple prognostic factors, they provide marginal and partial results akin to marginal and partial odds and hazard ratios from multiple logistic and Cox regression. Properties of the measures are explored by an extensive simulation study. Their application is demonstrated by three typical real data examples.Entities:
Keywords: Cox regression; attributable risk; explained variation; logistic regression; necessary condition; sufficient condition
Mesh:
Year: 2019 PMID: 31386230 PMCID: PMC6771968 DOI: 10.1002/sim.8331
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Data of 12 664 males in Sweden7
| Never smoked | Ever smoked | Total | |
|---|---|---|---|
| Died from lung cancer | 36 | 177 | 213 |
| Otherwise | 8120 | 4331 | 12 451 |
| Total | 8156 | 4508 | 12 664 |
Notation for a 2 x 2 setting
| Outcome | |||
|---|---|---|---|
|
|
|
| |
|
|
|
| |
| 0 | 1 |
|
Population values of DN, DS, and EV for dichotomous outcome and dichotomous prognostic factor
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 0.1 | 0.1 | 10 | 84.1 | 5.9 | 5.9 | 4.1 | 0.346 | 0.346 | 0.120 |
| 2 | 0.1 | 0.5 | 10 | 48.9 | 41.1 | 1.1 | 8.9 | 0.787 | 0.087 | 0.069 |
| 3 | 0.1 | 0.9 | 10 | 9.9 | 80.1 | 0.1 | 9.9 | 0.878 | 0.011 | 0.010 |
| 4 | 0.5 | 0.1 | 10 | 48.9 | 1.1 | 41.1 | 8.9 | 0.087 | 0.787 | 0.069 |
| 5 | 0.5 | 0.5 | 10 | 38.0 | 12.0 | 12.0 | 38.0 | 0.519 | 0.519 | 0.270 |
| 6 | 0.5 | 0.9 | 10 | 8.9 | 41.1 | 1.1 | 48.9 | 0.787 | 0.087 | 0.069 |
| 7 | 0.9 | 0.1 | 10 | 9.9 | 0.1 | 80.1 | 9.9 | 0.011 | 0.878 | 0.010 |
| 8 | 0.9 | 0.5 | 10 | 8.9 | 1.1 | 41.1 | 48.9 | 0.087 | 0.787 | 0.069 |
| 9 | 0.9 | 0.9 | 10 | 4.1 | 5.9 | 5.9 | 84.1 | 0.346 | 0.346 | 0.120 |
| 10 | 0.1 | 0.1 | 100 | 87.4 | 2.6 | 2.6 | 7.4 | 0.716 | 0.716 | 0.513 |
| 11 | 0.1 | 0.5 | 100 | 49.9 | 40.1 | 0.1 | 9.9 | 0.975 | 0.108 | 0.106 |
| 12 | 0.1 | 0.9 | 100 | 10.0 | 80.0 | 0.0 | 10.0 | 0.988 | 0.012 | 0.012 |
| 13 | 0.5 | 0.1 | 100 | 49.9 | 0.1 | 40.1 | 9.9 | 0.108 | 0.975 | 0.106 |
| 14 | 0.5 | 0.5 | 100 | 45.5 | 4.5 | 4.5 | 45.5 | 0.818 | 0.818 | 0.669 |
| 15 | 0.5 | 0.9 | 100 | 9.9 | 40.1 | 0.1 | 49.9 | 0.975 | 0.108 | 0.106 |
| 16 | 0.9 | 0.1 | 100 | 10.0 | 0.0 | 80.0 | 10.0 | 0.012 | 0.988 | 0.012 |
| 17 | 0.9 | 0.5 | 100 | 9.9 | 0.1 | 40.1 | 49.9 | 0.108 | 0.975 | 0.106 |
| 18 | 0.9 | 0.9 | 100 | 7.4 | 2.6 | 2.6 | 87.4 | 0.716 | 0.716 | 0.513 |
| 19 | 0.1 | 0.1 | 0.1 | 5.9 | 84.1 | 4.1 | 5.9 | 0.346 | 0.346 | 0.120 |
| 20 | 0.1 | 0.5 | 0.1 | 41.1 | 48.9 | 8.9 | 1.1 | 0.787 | 0.087 | 0.069 |
| 21 | 0.1 | 0.9 | 0.1 | 80.1 | 9.9 | 9.9 | 0.1 | 0.878 | 0.011 | 0.010 |
| 22 | 0.5 | 0.1 | 0.1 | 1.1 | 48.9 | 8.9 | 41.1 | 0.087 | 0.787 | 0.069 |
| 23 | 0.5 | 0.5 | 0.1 | 12.0 | 38.0 | 38.0 | 12.0 | 0.519 | 0.519 | 0.270 |
| 24 | 0.5 | 0.9 | 0.1 | 41.1 | 8.9 | 48.9 | 1.1 | 0.787 | 0.087 | 0.069 |
| 25 | 0.9 | 0.1 | 0.1 | 0.1 | 9.9 | 9.9 | 80.1 | 0.011 | 0.878 | 0.010 |
| 26 | 0.9 | 0.5 | 0.1 | 1.1 | 8.9 | 48.9 | 41.1 | 0.087 | 0.787 | 0.069 |
| 27 | 0.9 | 0.9 | 0.1 | 5.9 | 4.1 | 84.1 | 5.9 | 0.346 | 0.346 | 0.120 |
| 28 | 0.1 | 0.1 | 1 | 81.0 | 9.0 | 9.0 | 1.0 | 0.000 | 0.000 | 0.000 |
| 29 | 0.5 | 0.5 | 1 | 25.0 | 25.0 | 25.0 | 25.0 | 0.000 | 0.000 | 0.000 |
| 30 | 0.5 | 0.5 | 0 | 0.0 | 50.0 | 50.0 | 0.0 | 1.000 | 1.000 | 1.000 |
| 31 | 0.1 | 0.5 | ∞ | 50.0 | 40.0 | 0.0 | 10.0 | 1.000 | 0.111 | 0.111 |
| 32 | 0.5 | 0.5 | ∞ | 50.0 | 0.0 | 0.0 | 50.0 | 1.000 | 1.000 | 1.000 |
| 33 | 0.9 | 0.5 | ∞ | 10.0 | 0.0 | 40.0 | 50.0 | 0.111 | 1.000 | 0.111 |
| 34 | 0.01 | 0.5 | 100 | 50.0 | 49.0 | 0.01 | 1.0 | 0.980 | 0.010 | 0.010 |
| 35 | 0.5 | 0.01 | 100 | 50.0 | 0.01 | 49.0 | 1.0 | 0.010 | 0.980 | 0.010 |
Abbreviations: P(D), unconditional disease probability; OR, odds ratio; α, probability of harmful level of X; a, b, c, d, as by Table 2 but with probabilities (%).
Figure 1Population values of DN, DS, and EV for four scenarios with OR = 10, 100, P(D) = 0.5, 0.9, and standard normally distributed prognostic factor X. Below and above each of the corresponding panels, the distributions of X for Y = 0 and Y = 1 are shown, the areas of the smoothed histograms being proportional to 1 ‐ P(D) and P(D), respectively. Black solid line: logistic regression curve P(D | X); dashed horizontal line at P(D); dashed vertical line at X where P(D | X) = P(D), resulting in α = 0.50, 0.50, 0.75, 0.82 for upper left to lower right panel
The DN, DS, and EV for survival outcome and balanced dichotomous prognostic factor
| Type 1 censoring | Administrative censoring | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| HR |
| % cens. |
|
|
| % cens. |
|
|
|
| 1 | ‐ | 0 | 0.00 | 0.00 | 0.00 | ||||
| 1.64 | 20 | 0.00 | 0.00 | 0.00 | 50 | 0.00 | 0.00 | 0.00 | |
| 0.22 | 80 | 0.00 | 0.00 | 0.00 | 90 | 0.00 | 0.00 | 0.00 | |
| 2 | ‐ | 0 | 0.22 | 0.31 | 0.05 | ||||
| 2.26 | 23 | 0.25 | 0.20 | 0.05 | 50 | 0.25 | 0.20 | 0.05 | |
| 0.30 | 80 | 0.32 | 0.04 | 0.02 | 90 | 0.33 | 0.04 | 0.02 | |
| 10 | ‐ | 0 | 0.55 | 0.62 | 0.32 | ||||
| 4.90 | 31 | 0.71 | 0.44 | 0.34 | 50 | 0.71 | 0.44 | 0.34 | |
| 0.43 | 81 | 0.80 | 0.09 | 0.09 | 90 | 0.80 | 0.09 | 0.10 | |
| 100 | ‐ | 0 | 0.67 | 0.69 | 0.48 | ||||
| 15.30 | 43 | 0.96 | 0.45 | 0.53 | 50 | 0.96 | 0.45 | 0.53 | |
| 0.47 | 81 | 0.98 | 0.11 | 0.14 | 90 | 0.98 | 0.10 | 0.13 | |
Abbreviations: HR, hazard ratio; τ, time of maximum follow‐up; % cens., percentage of censored observations. For Type 1 censoring, the entries are population values, while under administrative censoring, they are based on simulated samples of 100 000 observations.
Note that both variants of DN and of DS are identical for dichotomous prognostic factors.
Population values of DN, DS, and EV for uncensored survival outcome and dichotomous prognostic factor
| | | | |||||||
|---|---|---|---|---|---|---|---|---|---|
| HR |
|
|
|
|
|
|
|
|
|
| 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 2 | 0.05 | 0.47 | 0.02 | 0.22 | 0.31 | 0.05 | 0.36 | 0.08 | 0.02 |
| 10 | 0.16 | 0.85 | 0.12 | 0.55 | 0.61 | 0.32 | 0.80 | 0.21 | 0.12 |
| 100 | 0.24 | 0.94 | 0.17 | 0.68 | 0.68 | 0.48 | 0.93 | 0.25 | 0.17 |
Abbreviations: HR, hazard ratio; α, probability of harmful level of X.
Marginal degrees of necessity and sufficiency and explained variation of prognostic factors for tumor penetration of prostatic capsule
| Prognostic factors |
|
|
|
|
|
|---|---|---|---|---|---|
| Age | <0.01 | 0.05 | 0.04 | 0.04 | 0.03 |
| Digital rectal exam | 0.09 | 0.49 | 0.20 | 0.49 | 0.12 |
| PSA | 0.12 | 0.23 | 0.46 | 0.21 | 0.34 |
| Tumor volume | 0.01 | 0.16 | 0.08 | 0.12 | 0.07 |
| Gleason score | 0.23 | 0.51 | 0.44 | 0.47 | 0.39 |
| Race | <0.01 | 0.03 | <0.01 | 0.03 | <0.01 |
| Capsular involvement | 0.06 | 0.10 | 0.58 | 0.10 | 0.58 |
|
| 0.30 | 0.56 | 0.52 | 0.50 | 0.43 |
Marginal degrees of sufficiency and necessity and explained variation of prognostic factors for survival with primary biliary cirrhosis
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| log (bilirubin) | 0.31 | 0.56 | 0.51 | 0.52 | 0.43 |
| log (prothrombin time) | 0.09 | 0.28 | 0.24 | 0.25 | 0.17 |
| Edema | 0.10 | 0.20 | 0.53 | 0.20 | 0.53 |
| Albumin | 0.17 | 0.41 | 0.36 | 0.35 | 0.28 |
| Age | 0.06 | 0.29 | 0.20 | 0.25 | 0.16 |
|
| 0.40 | 0.63 | 0.59 | 0.59 | 0.50 |
Figure 2The effect of edema (left panel) and of log bilirubin (right panel) on survival with primary biliary cirrhosis. Log bilirubin grouped according to quartiles