| Literature DB >> 26464564 |
Abraham D Flaxman1, Peter T Serina1, Bernardo Hernandez1, Christopher J L Murray1, Ian Riley2, Alan D Lopez3.
Abstract
BACKGROUND: Verbal autopsy is gaining increasing acceptance as a method for determining the underlying cause of death when the cause of death given on death certificates is unavailable or unreliable, and there are now a number of alternative approaches for mapping from verbal autopsy interviews to the underlying cause of death. For public health applications, the population-level aggregates of the underlying causes are of primary interest, expressed as the cause-specific mortality fractions (CSMFs) for a mutually exclusive, collectively exhaustive cause list. Until now, CSMF Accuracy is the primary metric that has been used for measuring the quality of CSMF estimation methods. Although it allows for relative comparisons of alternative methods, CSMF Accuracy provides misleading numbers in absolute terms, because even random allocation of underlying causes yields relatively high CSMF accuracy. Therefore, the objective of this study was to develop and test a measure of CSMF that corrects this problem.Entities:
Year: 2015 PMID: 26464564 PMCID: PMC4603634 DOI: 10.1186/s12963-015-0061-1
Source DB: PubMed Journal: Popul Health Metr ISSN: 1478-7954
Random allocation algorithm
| Training Data: mutually exclusive, collectively exhaustive cause list ( |
| Input: VAI results |
| Output: Cause C, selected uniformly at random from ( |
Confusion matrices for physician-certified verbal autopsy and random-allocation verbal autopsy.
| a) Physician Certified VA Confusion Matrix | b) Random Allocation Confusion Matrix | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Predicted | Predicted | ||||||||
| Stroke | Diabetes | Other | Stroke | Diabetes | Other | ||||
| True | Stroke | 123 | 18 | 125 | True | Stroke | 87 | 84 | 95 |
| Diabetes | 5 | 86 | 55 | Diabetes | 32 | 61 | 53 | ||
| Other | 83 | 95 | 2112 | Other | 746 | 780 | 764 | ||
Panel A shows the confusion matrix for physician certified verbal autopsy (with a length-three cause list for clarity). The entry in each cell counts the number of deaths truly due to the row cause that were predicted to be due to the column cause. For example, the value 83 in the “other” row, “stroke” column indicates that 83 deaths truly due to causes other than stroke or diabetes were (incorrectly) attributed to stroke by physicians. This table demonstrates that (for this dataset) physicians are right more often than they are wrong when they predict stroke as the cause of death, but wrong more than they are right when they predict diabetes. Panel B shows the confusion matrix for Random Allocation with the same dataset, where random chance predicts stroke and diabetes incorrectly for a vast majority of the cases. True and PCVA data from Lozano et al. [18, 22], where physicians were presented with VAI data where the underlying cause was known to meet stringent clinical diagnostic criteria, and their results compared to the truth
Random-From-Train Algorithm
| Training Data: VAI results ( |
| Input: VAI result |
| Output: |
Confusion matrix for Random-From-Train verbal autopsy.
| a) Random-From-Train Confusion Matrix | ||||
|---|---|---|---|---|
| Predicted | ||||
| Stroke | Diabetes | Other | ||
| True | Stroke | 28 | 14 | 224 |
| Diabetes | 11 | 13 | 122 | |
| Other | 223 | 106 | 1961 | |
The confusion matrix for Random-From-Train (with a length-three cause-list for clarity). As in Table 2, the entry in each cell counts the number of deaths truly due to the row cause that were predicted to be due to the column cause. This table demonstrates that while Random-From-Train is inaccurate at the individual level, at the population level the prediction distribution can closely match the truth
Fig. 1CSMF Accuracy of random allocation as a function of CoD list length. The mean CSMF accuracy of random allocation was calculated with 10,000 Monte Carlo replicates for cause-list length ranging from 3 to 50. The CSMF accuracy decreases monotonically as a function of J and appears to stay above 1 − 1/e ≈ 0.632, which we selected for our chance-correction parameter
Fig. 2Comparison of concordance from Monte Carlo calculation and analytic calculation. The analogous chance-correction value for concordance was calculated analytically in Murray et al. [13], and we confirmed its accuracy in our simulation environment. The absolute relative difference was always less than 1 %
Fig. 3Comparison of individual-level and population-level prediction quality for three commonly used methods: InterVA, Tariff, physician-certified verbal autopsy (PCVA). Questions that rely on the deceased having health care experience (HCE) are necessary for population-level PCVA quality to surpass random guessing. Data from Murray et al. [12]
CCCSMF accuracy of Random Allocation and Random-From-Train with and without resampling the test CSMF distribution.
|
| Random-From-Train (Same CSMF) | Random Allocation | Random-From-Train (Resampled CSMF) |
|---|---|---|---|
| 5 | 0.980 | 0.075 | 0.092 |
| 15 | 0.964 | 0.028 | 0.027 |
| 25 | 0.953 | 0.016 | 0.016 |
| 35 | 0.945 | 0.010 | 0.007 |
| 50 | 0.933 | 0.006 | −0.005 |
This table demonstrates the importance of resampling the CSMF distribution in the test set; if the test and train sets have the same CSMF distribution, then simple approaches like Random-From-Train, as well as state-of-the-art approaches like King-Lu [23], can appear to have better performance than is justified, due to “overfitting”