| Literature DB >> 28350807 |
Yannick Fortin1,2, James A G Crispo1,2,3, Deborah Cohen2,4,5, Douglas S McNair6, Donald R Mattison1,7, Daniel Krewski1,2,7.
Abstract
Assessing prevalent comorbidities is a common approach in health research for identifying clinical differences between individuals. The objective of this study was to validate and compare the predictive performance of two variants of the Elixhauser comorbidity measures (ECM) for inhospital mortality at index and at 1-year in the Cerner Health Facts® (HF) U.S. DATABASE: We estimated the prevalence of select comorbidities for individuals 18 to 89 years of age who received care at Cerner contributing health facilities between 2002 and 2011 using the AHRQ (version 3.7) and the Quan Enhanced ICD-9-CM ECMs. External validation of the ECMs was assessed with measures of discrimination [c-statistics], calibration [Hosmer-Lemeshow goodness-of-fit test, Brier Score, calibration curves], added predictive ability [Net Reclassification Improvement], and overall model performance [R2]. Of 3,273,298 patients with a mean age of 43.9 years and a female composition of 53.8%, 1.0% died during their index encounter and 1.5% were deceased at 1-year. Calibration measures were equivalent between the two ECMs. Calibration performance was acceptable when predicting inhospital mortality at index, although recalibration is recommended for predicting inhospital mortality at 1 year. Discrimination was marginally better with the Quan ECM compared the AHRQ ECM when predicting inhospital mortality at index (cQuan = 0.887, 95% CI: 0.885-0.889 vs. cAHRQ = 0.880, 95% CI: 0.878-0.882; p < .0001) and at 1-year (cQuan = 0.884, 95% CI: 0.883-0.886 vs. cAHRQ = 0.880, 95% CI: 0.878-0.881, p < .0001). Both the Quan and the AHRQ ECMs demonstrated excellent discrimination for inhospital mortality of all-causes in Cerner Health Facts®, a HIPAA compliant observational research and privacy-protected data warehouse. While differences in discrimination performance between the ECMs were statistically significant, they are not likely clinically meaningful.Entities:
Mesh:
Year: 2017 PMID: 28350807 PMCID: PMC5369776 DOI: 10.1371/journal.pone.0174379
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Patient demographic and index encounter characteristics, N = 3,273,298.
| Characteristic | N (%) |
|---|---|
| Female | 1,761,525 (53.8) |
| Male | 1,511,773 (46.2) |
| Mean ± SE | 43.9 ± 0.01 |
| Caucasian | 2,366,665 (72.3) |
| African American | 711,051 (21.7) |
| Hispanic | 146,877 (4.5) |
| Asian | 48,705 (1.5) |
| Private | 795,449 (24.3) |
| Medicare | 370,701 (11.3) |
| Medicaid | 248,009 (7.6) |
| Uninsured | 378,536 (11.6) |
| Other | 139,745 (4.3) |
| Missing | 1,340,858 (41.0) |
| Low | 3,010,916 (92.0) |
| High | 262,382 (8.0) |
| Deaths | 31,298 (1.0) |
| Deaths | 50,215 (1.5) |
| ED Visit | 2,204,680 (67.4) |
| Inpatient Stay | 1,068,618 (32.6) |
| Northeast | 1,180,270 (36.1) |
| Midwest | 648,644 (19.8) |
| South | 1,077,965 (32.9) |
| West | 366,419 (11.2) |
Abbreviations: ED, emergency department; SE, Standard error.
Prevalence of comorbid conditions by ECM variant, N = 3,273,298.
| Condition | Quan, N (%) | AHRQ, N (%) | McNamar’s Test | Cohen’s h |
|---|---|---|---|---|
| Hypertension | 572,139 (17.48) | 573,457 (17.52) | < .0001 | 0.001 |
| Chronic Pulmonary Disease | 256,170 (7.83) | 248,676 (7.60) | < .0001 | 0.009 |
| Diabetes Uncomplicated | 226,807 (6.93) | 227,815 (6.96) | < .0001 | 0.001 |
| Fluid and Electrolyte Disorders | 187,321 (5.72) | 186,282 (5.69) | < .0001 | 0.001 |
| Cardiac Arrhythmia | 174,656 (5.34) | na | na | na |
| Depression | 113,659 (3.47) | 89,410 (2.73) | < .0001 | 0.043 |
| Congestive Heart Failure | 99,280 (3.03) | 90,775 (2.77) | < .0001 | 0.015 |
| Alcohol Abuse | 89,460 (2.73) | 86,307 (2.64) | < .0001 | 0.006 |
| Hypothyroidism | 85,493 (2.61) | 83,926 (2.56) | < .0001 | 0.003 |
| Obesity | 79,680 (2.43) | 80,996 (2.47) | < .0001 | 0.003 |
| Other Neurological Disorders | 76,237 (2.33) | 97,360 (2.97) | < .0001 | 0.040 |
| Drug Abuse | 61,260 (1.87) | 59,029 (1.80) | < .0001 | 0.005 |
| Renal Failure | 60,238 (1.84) | 57,231 (1.75) | < .0001 | 0.007 |
| Solid Tumor without Metastasis | 56,376 (1.72) | 56,519 (1.73) | < .0001 | 0.000 |
| Valvular Disease | 51,250 (1.57) | 21,907 (0.67) | < .0001 | 0.087 |
| Peripheral Vascular Disorders | 41,651 (1.27) | 43,410 (1.33) | < .0001 | 0.005 |
| Diabetes Complicated | 34,016 (1.04) | 34,111 (1.04) | < .0001 | 0.000 |
| Liver Disease | 31,307 (0.96) | 21,910 (0.67) | < .0001 | 0.032 |
| Psychoses | 29,656 (0.91) | 69,664 (2.13) | < .0001 | 0.102 |
| Coagulopathy | 27,019 (0.83) | 27,330 (0.83) | < .0001 | 0.001 |
| Rheumatoid Arthritis/collagen | 24,143 (0.74) | 22,407 (0.68) | < .0001 | 0.006 |
| Metastatic Cancer | 22,836 (0.70) | 23,004 (0.70) | < .0001 | 0.001 |
| Weight Loss | 21,219 (0.65) | 19,429 (0.59) | < .0001 | 0.007 |
| Pulmonary Circulation Disorders | 21,084 (0.64) | 20,894 (0.64) | < .0001 | 0.001 |
| Deficiency Anemia | 19,745 (0.60) | 104,246 (3.18) | < .0001 | 0.203 |
| Paralysis | 13,707 (0.42) | 19,001 (0.58) | < .0001 | 0.023 |
| Blood Loss Anemia | 10,884 (0.33) | 24,487 (0.75) | < .0001 | 0.058 |
| Peptic Ulcer Disease excl. bleeding | 8,251 (0.25) | 507 (0.02) | < .0001 | 0.076 |
| Lymphoma | 6,778 (0.21) | 6,920 (0.21) | < .0001 | 0.001 |
| AIDS/HIV | 4,246 (0.13) | 4,246 (0.13) | 1.000 | 0.000 |
* Described as a small effect size according to the interpretation criteria suggested by Cohen (1988).
Measures of discrimination and calibration performance by ECM and mortality outcome.
| External Validation | |||||
|---|---|---|---|---|---|
| Inhospital Mortality at Index | Inhospital Mortality at 1 Year | ||||
| Quan | AHRQ | Quan | AHRQ | ||
| AUROC | 0.887 (0.885,0.889) | 0.880 (0.878,0.882) | 0.884 (0.883,0.886) | 0.880 (0.878,0.881) | |
| HL Test | 485.5 | 459.8 | 890.1 | 879.1 | |
| Brier Score | 0.009 | 0.009 | 0.014 | 0.014 | |
| R2
| 24.9 | 23.1 | 25.1 | 24.0 | |
| NRI>0 | 0.6115 (0.6006,0.6224) | 0.5234 (0.5149,0.5318) | |||
| Reclassification, E—NE | -18%–79% | -27%–80% | |||
| Deaths (%) | 31,298 (1.0) | 50,215 (1.5) | |||
| AUROC | 0.870 (0.865,0.874) | 0.861 (0.857,0.866) | 0.834 (0.830,0.837) | 0.830 (0.826,0.833) | |
| HL Test | 106.6 | 96.7 | 271.5 | 261.1 | |
| Brier Score | 0.018 | 0.018 | 0.037 | 0.037 | |
| R2
| 22.7 | 21.4 | 20.7 | 20.0 | |
| NRI>0 | 0.4725 (0.4450,0.5000) | 0.3566 (0.3384,0.3747) | |||
| Reclassification, E—NE | -15%–62% | -28%–64% | |||
| Deaths (%) | 5,035 (1.9) | 11,043 (4.2) | |||
| AUROC | 0.886 (0.884,0.889) | 0.879 (0.877,0.881) | 0.885 (0.883,0.886) | 0.880 (0.878,0.882) | |
| HL Test | 378.0 | 410.6 | 619.4 | 616.2 | |
| Brier Score | 0.008 | 0.008 | 0.012 | 0.012 | |
| R2
| 24.8 | 22.9 | 25.0 | 23.8 | |
| NRI>0 | 0.6183 (0.6064,0.6302) | 0.5384 (0.5288,0.548) | |||
| Reclassification, E—NE | -18%–80% | -27%–80% | |||
| Deaths (%) | 26,263 (0.9) | 39,172 (1.3) | |||
* P-value < 0.001. E = Events. NE = Non-events.
a Area under the Receiver Operating Characteristic (ROC) curve (AUROC). AUROC is a measure of discrimination ranging from 0.5 (zero discrimination) to 1.0 (perfect discrimination).
b Pearson chi-square value derived from the Hosmer–Lemeshow goodness-of-fit test [32].
c Measure of predictive accuracy, greater accuracy is reflected by lower score.
d R-squared, explained variation, displayed in percentage.
e Category-free net reclassification improvement using the AHRQ ECM as the reference model.
f E–NE, percentage of events (E) and non-events (NE) correctly reclassified by the Quan ECM compared to the AHRQ ECM.
g AUROC curve differed significantly from the baseline model limited to age and sex (p < 0.0001), and from the competing ECM (p < 0.0001). Differences between AUROC curves were evaluated with the Mann-Whitney U test approach developed by DeLong et al. [35]. In the unstratified sample, the baseline model had an AUROC of 0.820 (95% CI 0.818–0.822) for inhospital mortality at index, and 0.826 (95% CI 0.824–0.827) for inhospital mortality at 1 year. For high risk patients, the baseline model had an AUROC of 0.770 (95% CI 0.764–0.775) for inhospital mortality at index, and 0.755 (95% CI 0.751–0.759) for inhospital mortality at 1 year. For low risk patients, the baseline model had an AUROC of 0.821 (95% CI 0.819–0.823) for inhospital mortality at index, and 0.828 (95% CI 0.826–0.830) for inhospital mortality at 1 year.
h High risk patients had one or more inpatient stay in the 12 months preceding the index encounter or three or more emergency department visits in the 3 months preceding the index encounter. Patients that did not satisfy the high-risk criteria were assigned to the low risk group.
Fig 1AUROC comparisons by ECM for predicting inhospital mortality at index [A] and at 1 year [B]. ROC = receiver operating characteristic, AUROC = area under the receiver operating characteristic.
Fig 2AUROC comparisons by ECM for predicting inhospital mortality at index [A] and at 1 year [B] in high risk patients, and inhospital mortality at index [C] and at 1 year [D] in low risk patients. High risk patients had one or more inpatient stay in the 12 months preceding the index encounter or three or more emergency department visits in the 3 months preceding the index encounter. Patients that did not satisfy the high-risk criteria were assigned to the low risk group. ROC = receiver operating characteristic, AUROC = area under the receiver operating characteristic.
Fig 3Observed versus predicted risk of inhospital mortality [A] at index and [B] at 1 Year. Perfect calibration is represented by the full line with a slope of 1 starting at the origin.
Fig 4Observed versus predicted risk of inhospital mortality at index [A] and at 1 year [B] for high risk patients, and inhospital mortality at index [C] and at 1 year [D] for low risk patients. Perfect calibration is represented by the full line with a slope of 1 starting at the origin. High risk patients had one or more inpatient stay in the 12 months preceding the index encounter or three or more emergency department visits in the 3 months preceding the index encounter. Patients that did not satisfy the high-risk criteria were assigned to the low risk group.