| Literature DB >> 31242925 |
Prabhat Jha1, Dinesh Kumar2, Rajesh Dikshit3, Atul Budukh3, Rehana Begum4, Prabha Sati4, Patrycja Kolpak4, Richard Wen4, Shyamsundar J Raithatha2, Utkarsh Shah2, Zehang Richard Li5, Lukasz Aleksandrowicz6, Prakash Shah4, Kapila Piyasena4, Tyler H McCormick7,8, Hellen Gelband4, Samuel J Clark6,9.
Abstract
BACKGROUND: Verbal autopsies with physician assignment of cause of death (COD) are commonly used in settings where medical certification of deaths is uncommon. It remains unanswered if automated algorithms can replace physician assignment.Entities:
Keywords: Algorithms; COD classification; Physician coding; Verbal autopsies
Mesh:
Year: 2019 PMID: 31242925 PMCID: PMC6595581 DOI: 10.1186/s12916-019-1353-2
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 8.775
Fig. 1Flow diagram for the 9529 deaths in 117 mainly rural villages randomly allocated to either physician or computer COD assignment of verbal autopsies and analytic design. ϮThe following deaths were excluded for the physician and automated assignment arms, respectively: 9 and 5 refused consent after the randomization 83 and 39 were unable to provide consent (as the respondent was determined to be < 18 years), and 7 and 8 were test records from field training by surveyors. As well, 4 stillborn deaths were excluded in the physician assignment arm
Baseline characteristics of deaths by study group
| Overall | Standard (physician assignment) | Automated assignment | |
|---|---|---|---|
| Study sites | |||
| Gujarat | 5174 (55%) | 2562 (55%) | 2612 (55%) |
| Punjab | 4200 (45%) | 2089 (45%) | 2111 (45%) |
| Age groups | |||
| Adult (12–69 years) | 8704 (93%) | 4311 (93%) | 4393 (93%) |
| Child (28 days to 11 years) | 403 (4%) | 190 (4%) | 213 (5%) |
| Neonate (0–27 days) | 267 (3%) | 150 (3%) | 117 (2%) |
| Sex of the deceased* | |||
| Male | 6229 (66%) | 3086 (66%) | 3143 (67%) |
| Female | 3143 (34%) | 1564 (34%) | 1579 (33%) |
| Deceased’s education level* | |||
| No formal education | 4623 (49%) | 2317 (50%) | 2306 (49%) |
| 1–9 years | 2591 (28%) | 1252 (27%) | 1339 (28%) |
| 10+ years | 1189 (13%) | 596 (13%) | 593 (13%) |
| Not applicable as ≤ 5 years | 570 (6%) | 292 (6%) | 278 (6%) |
| Deceased’s type of house* | |||
| Semi-solid/thatched | 7442 (79%) | 3702 (79%) | 3740 (79%) |
| Solid | 1855 (20%) | 915 (20%) | 940 (20%) |
| Location of death* | |||
| Home | 6558 (70%) | 3233 (70%) | 3325 (70%) |
| Facility | 1610 (17%) | 812 (17%) | 798 (17%) |
| Other | 1190 (13%) | 599 (13%) | 591 (13%) |
| Adult key symptoms | |||
| Fever | 2834 (33%) | 1400 (32%) | 1434 (33%) |
| Breathlessness | 2171 (25%) | 998 (23%) | 1173 (27%) |
| Chest pain | 1896 (22%) | 848 (20%) | 1048 (24%) |
| Cough | 1798 (21%) | 847 (20%) | 951 (22%) |
| Weight loss | 1688 (19%) | 710 (16%) | 978 (22%) |
| Injury | 1554 (18%) | 832 (19%) | 722 (16%) |
| Paralysis/stroke | 685 (8%) | 294 (7%) | 391 (9%) |
| Diarrhea | 677 (8%) | 299 (7%) | 378 (9%) |
| Jaundice | 412 (5%) | 191 (4%) | 221 (5%) |
| Child key symptoms | |||
| Fever | 179 (44%) | 95 (50%) | 84 (39%) |
| Diarrhea | 76 (19%) | 36 (19%) | 40 (19%) |
| Jaundice | 58 (14%) | 40 (21%) | 18 (8%) |
| Injury | 61 (15%) | 24 (13%) | 37 (17%) |
| Cough | 57 (14%) | 30 (16%) | 27 (13%) |
| Neonate key symptoms | |||
| Breathing problems | 60 (23%) | 31 (21%) | 29 (25%) |
| Fever | 36 (13%) | 15 (10%) | 21 (18%) |
| Jaundice | 36 (13%) | 29 (19%) | 7 (6%) |
| Injury | 3 (1%) | 1 (1%) | 2 (2%) |
Data are in (%). Key symptoms refer to a subset of symptoms of each age group that are essential to distinguish various CODs
*Every effort was made to collect data; however, deaths with missing data for sex were 1 (0%) and 1 (0%), for education level were 194 (4%) and 207 (4%), for type of house were 34 (1%) and 43 (1%), and for location of death were 7 (0%) and 9 (0%), for physician and computer assignment study groups, respectively
Percent population-level concordance in cause of death distribution between automated assignment and standard (physician assignment) verbal autopsies, by algorithms and age groups
| Require training data | Do not require training data | ||||||
|---|---|---|---|---|---|---|---|
| Age group | Average (SD) | NBC | King-Lu | SmartVA | InSilicoVA | InSilicoVA-NT | InterVA-4 |
| Adult | 62 (15) | 50 | 44 | 57 | 66 | 77 | 80 |
| Child | 56 (11) | 51 | 58 | 36 | 60 | 66 | 66 |
| Neonate | 59 (18) | 57 | 68 | 27 | 80 | 54 | 65 |
Average and standard deviation (SD) of the population-level concordance attained for the automated algorithms when using data from all PHMRC sites as the training data. The concordance compares the cause of death distributions generated by each algorithm on the 4723 deaths in the automated arm (4393 adult, 213 child, and 117 neonatal deaths) to the distribution on the 4651 standard physician-coded deaths (4311 adult, 190 child, and 150 neonatal deaths). When only the Indian sites were used as the training data, the concordance for NBC, King-Lu, and InSilicoVA was 37, 57, and 68 for adult, 48, 59, and 66 for child, and 23, 76, and 80 for neonatal deaths, respectively. The results were similar if we excluded “ill-defined” deaths (see Additional file 1). InSilicoVA-NT and InterVA-4 do not require training data, whereas SmartVA was pre-trained on the PHMRC data; hence, the percent concordance generated by these algorithms is unchanged when changing the training dataset. Dual physician review of the automated assignment arm generated the population-level concordance of 84, 82, and 91 for adults, child, and neonate age groups, respectively (see Additional file 10)
Fig. 2Average population-level concordance (%) of algorithms with standard (physician-assigned) in a randomized trial and the average population-level concordance in earlier non-randomized studies. 100% concordance would indicate complete agreement with the standard. The horizontal bars indicate the range of the mean concordance estimates (weighted by sample size) in each study
Cause of death counts, proportions, and rankings for adults
| Rank | Cause of death | No. of deaths | Standard (physician assigned), 7% | Proportion, % (rank) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Standard (physician-assigned) | Mean algorithm estimated deaths | 2 or more algorithms agreed | Require training data | Do not require training data | ||||||||
| Total | Both physicians initially agreed† | NBC | King-Lu | SmartVA | InSilicoVA | InSilicoVA-NT | InterVA-4 | |||||
| Adult (12–69 years) | ||||||||||||
| 1 | Ischemic heart disease | 737 | 661 | 368 | 506 | 17.1 | 5.1 (5) | 8.3 (5) | 4.0 (8) | 13.0 (2) | 12.8 (2) | 8.0 (5) |
| 2 | Cancers* | 592 | 517 | 287 | 393 | 13.7 | 2.0 (12) | 1.6 (10) | 8.4 (3) | 4.4 (10) | 10.6 (4) | 12.9 (1) |
| 3 | Other noncommunicable diseases | 376 | 273 | 314 | 365 | 8.7 | 6.1 (4) | 0.2 (16) | 6.1 (5) | 6.6 (6) | 17.2 (1) | 7.5 (6) |
| 4 | Unspecified infections | 363 | 265 | 242 | 278 | 8.4 | 1.8 (13) | 0.7 (15) | 1.3 (14) | 8.5 (4) | 12.4 (3) | 9.0 (4) |
| 5 | Falls, bites, and other injuries* | 328 | 288 | 811 | 401 | 7.6 | 50.4 (1) | 32.7 (1) | 9.3 (2) | 4.6 (9) | 10.1 (6) | 5.8 (9) |
| 6 | Tuberculosis | 303 | 252 | 211 | 283 | 7.0 | 3.9 (7) | 1.9 (8) | 2.3 (10) | 0.2 (16) | 10.1 (5) | 10.9 (3) |
| 7 | Chronic respiratory diseases | 296 | 231 | 175 | 235 | 6.9 | 1.0 (15) | 0.9 (14) | 8.1 (4) | 3.8 (11) | 4.3 (9) | 6.2 (8) |
| 8 | Road and transport injuries* | 274 | 246 | 491 | 288 | 6.4 | 6.6 (2) | 14.9 (2) | 5.6 (6) | 27.8 (1) | 7.0 (7) | 6.4 (7) |
| 9 | Stroke | 232 | 196 | 160 | 197 | 5.4 | 2.2 (11) | 3.9 (7) | 5.2 (7) | 4.8 (8) | 0.7 (15) | 5.5 (10) |
| 10 | Suicide* | 208 | 185 | 200 | 109 | 4.8 | 3.9 (6) | 11.7 (3) | 1.8 (11) | 5.5 (7) | 1.9 (11) | 3.1 (13) |
| 11 | Liver and alcohol related diseases | 137 | 105 | 63 | 67 | 3.2 | 0.9 (16) | 1.5 (11) | 0.5 (16) | 2.8 (12) | 1.6 (12) | 1.4 (14) |
| 12 | Other cardiovascular diseases | 108 | 64 | 165 | 166 | 2.5 | 3.1 (8) | 1.2 (12) | 1.6 (12) | 8.5 (3) | 3.7 (10) | 4.8 (11) |
| 13 | Acute respiratory infections | 100 | 72 | 127 | 114 | 2.3 | 6.1 (3) | 1.1 (13) | 0.5 (17) | 0.5 (14) | 4.8 (8) | 4.7 (12) |
| 14 | Diarrheal diseases | 96 | 68 | 139 | 64 | 2.2 | 2.8 (10) | 6.3 (6) | 1.4 (13) | 7.9 (5) | 0.4 (17) | 0.5 (17) |
| 15 | Ill-defined | 69 | 59 | 366 | 194 | 1.6 | 0.0 (18) | 0.0 (18) | 39.7 (1) | 0.0 (18) | 0.0 (18) | 11.3 (2) |
| 16 | Diabetes mellitus | 63 | 49 | 64 | 35 | 1.5 | 1.3 (14) | 1.7 (9) | 3.6 (9) | 0.3 (15) | 0.9 (13) | 1.1 (15) |
| 17 | Maternal conditions | 25 | 21 | 121 | 29 | 0.6 | 3.0 (9) | 11.4 (4) | 0.8 (15) | 0.6 (13) | 0.6 (16) | 0.4 (18) |
| 18 | Nutritional deficiencies | 4 | 3 | 10 | 4 | 0.1 | 0.0 (17) | 0.0 (17) | 0.0 (18) | 0.0 (17) | 0.8 (14) | 0.6 (16) |
| Concordance | 50 | 44 | 57 | 66 | 77 | 80 | ||||||
*More obvious diagnoses. The order of injuries in this category, from highest to lowest number of deaths, is falls, other injuries, and bites
†Percentage of agreement between both physicians at the initial stage of ICD coding, where both physicians assigned the same cause of death for the deceased record. The overall physician initial agreement for adult was 83%; while 18% required a third physician to arbitrate (adjudication)
Individual-level sensitivity in the cause of death assignment predicted by different algorithms for adult deaths (12–69 years) in the computer assignment arm (n = 4393)
| Comparator listed below | SmartVA | InSilicoVA | ISilicoVA-NT | InterVA-4 |
|---|---|---|---|---|
| NBC | 22 | 23 | 19 | 18 |
| SmartVA | * | 21 | 38 | 40 |
| InSilicoVA | * | 22 | 25 | |
| InSilicoVA-NT | * | 67 |
Average for the five algorithms: 30 (SD 16). King-Lu only produces population-level results and, thus, was not included. Individual-level sensitivity calculates each algorithm combination (i.e., NBC against SmartVA and SmartVA against NBC). Dual physician review of computer assignment arm produced the following individual-level sensitivity for each computer algorithm: NBC 22; SmartVA 47; InSilicoVA 31; InSilicoVA-NT 51; and InterVA-4 53
*Not applicable