| Literature DB >> 31315852 |
Gregory D Berg1, Virginia F Gurley2.
Abstract
OBJECTIVE: The objective is to develop and validate a predictive model for 15-month mortality using a random sample of community-dwelling Medicare beneficiaries. DATA SOURCE: The Centres for Medicare & Medicaid Services' Limited Data Set files containing the five per cent samples for 2014 and 2015. PARTICIPANTS: The data analysed contains de-identified administrative claims information at the beneficiary level, including diagnoses, procedures and demographics for 2.7 million beneficiaries.Entities:
Keywords: achine learning; classification; hospice care; palliative care; terminal care
Year: 2019 PMID: 31315852 PMCID: PMC6661632 DOI: 10.1136/bmjopen-2018-022935
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Explanatory variables
| Variable category | Number of variables | Explanation of variables |
| Charlson condition indicators | 17 | 0/1 indicator for the presence of the Charlson condition (ICD-9 and ICD-10 diagnoses) |
| Charlson score | 1 | |
| AHRQ’s clinical classification software condition indicators | 286 | 0/1 indicator for the presence of each of the condition categories identified by the AHRQ software (ICD-9 and ICD-10 diagnoses) |
| AHRQ’s clinical classification software procedure indicators | 231 | 0/1 indicator for the presence of each of the procedure categories identified by the AHRQ software (ICD-9 and ICD-10 procedures) |
| Age and gender categories | 20 | 10 age categories for each gender |
| Medical service utilisation groups | 8 | |
| Long-term care indicator | 1 | |
| Hospice indicator | 1 | This was determined as the presence of a hospice claim in the last 120 days before 31 December, 2014 |
| Palliative care indicator | 1 | |
| HMO indicator. | 1 | This was determined as six or more months of HMO coverage from the CMS LDS data |
CMS, Centres for Medicare & Medicaid Services; HMO, health maintenance organisation; ICD, International Classification of Diseases; ICD-9, ICD – ninth revision; ICD-10, ICD - tenth revision; LDS, Limited Data Set.
Updated Charlson weight rules
| Weight | Rule |
| 0 | 0<=ln(OR)<=0.15 |
| 1 | 0.15<ln(OR)<=0.45 |
| 2 | 0.45<ln(OR)<=0.75 |
| 3 | 0.75<ln(OR)<=1.05 |
| 4 | 1.05<ln(OR)<=1.35 |
| 5 | 1.35<ln(OR)<=1.65 |
| 6 | 1.65<ln(OR)<=1.95 |
Descriptive statistics from the training and validation data sets
| Training data set | Validation data set | Variable description |
| 1 357 989 | 1 356 245 | Sample size |
| 4.9% | 4.9% | Per cent with a death date listed (2015 through first quarter of 2016) |
| 29.5% | 29.5% | Per cent with six or more months of HMO in 2014 |
| 16.7% | 16.7% | Per cent with six or more months of state buy in for 2014 (duals) |
| 1.56 | 1.55 | Average Charlson score per person |
| 0.6% | 0.6% | Per cent of people with a hospice claim in the last 120 days of 2014 |
| Annualised utilisation rates per 1000 | ||
| 250.3 | 252.0 | Ambulatory/surgery visits |
| 249.9 | 251.6 | Ancillary visits |
| 1456.5 | 1452.3 | Diagnostic services visits |
| 1318.3 | 1316.0 | Emergency department visits |
| 318.0 | 315.9 | Inpatient admissions |
| 221.0 | 220.4 | Non-acute visits |
| 7749.5 | 7746.4 | Outpatient visits |
| Demographics | ||
| 54.3% | 54.5% | Per cent Female |
| 45.7% | 45.5% | Per cent Male |
| 87.1% | 87.0% | Per cent age 60 and over |
| 69.8 | 69.9 | Average age |
| Charlson conditions (percentage of population in data set) | ||
| 2.9% | 2.9% | Myocardial infarction |
| 7.3% | 7.3% | CHF |
| 9.4% | 9.4% | PVD |
| 8.5% | 8.5% | Cerebrovascular disease |
| 2.6% | 2.6% | Dementia |
| 14.1% | 14.1% | Chronic pulmonary disease |
| 2.7% | 2.7% | Connective tissue disease-rheumatic disease |
| 0.9% | 0.9% | PUD |
| 3.1% | 3.1% | Mild liver disease |
| 18.4% | 18.4% | Diabetes without complications |
| 6.1% | 6.1% | Diabetes with complications |
| 1.0% | 1.1% | Paraplegia and hemiplegia |
| 7.7% | 7.6% | Renal disease |
| 7.8% | 7.7% | Cancer |
| 0.3% | 0.3% | Moderate or severe liver disease |
| 0.9% | 0.9% | Metastatic carcinoma |
| 0.3% | 0.3% | AIDS/HIV |
CHF, congestive heart failure; HMO, health maintenance organisation; PUD, peptic ulcer disease; PVD, peripheral vascular disease.
Figure 1Mortality rate per 1000 and 95% CI by number of hospice claims. The red line is the mortality rate for all people at various levels of hospice claims. The black vertical lines represent the 95% CI for the mortality rate.
C-statistic (area under the curve) for estimated models
| Model description | C-statistic | C-statistic |
| Logistic regression: age+gender only | 0.547 | 0.549 |
| Naïve Bayes | 0.696 | 0.675 |
| Logistic regression: Charlson score | 0.713 | 0.701 |
| Logistic regression: Charlson conditions | 0.726 | 0.714 |
| Logistic regression: Elixhauser conditions | 0.734 | 0.724 |
| Decision tree with adaptive boosting | 0.762 | 0.744 |
| Support vector machine | 0.788 | 0.773 |
| Neural network | 0.795 | 0.780 |
| Logistic regression: stepwise regression | 0.797 | 0.783 |
| Logistic regression: LASSO | 0.798 | 0.784 |
| Logistic regression: all variables | 0.798 | 0.784 |
LASSO, least absolute shrinkage and selection operator.
Figure 2Actual mortality rate per 1000 by predicted risk band. Risk bands are deciles of the predicted mortality. Both predicted and actual mortality were calculated using the validation data set. The dashed line is the mortality rate for all people whereas the solid line is the mortality rate for all people excluding those with hospice.
Charlson conditions and scoring weights
| Conditions from the Romano adaptation of the Charlson index | Original Charlson | Schneeweiss | CMS national sample OR estimates | Assigned CMS national sample |
| Myocardial infarction | 1 | 1 | 1.11 | 0 |
| Congestive heart failure | 1 | 2 | 2.58 | 3 |
| Peripheral vascular disease | 1 | 1 | 1.48 | 1 |
| Cerebrovascular disease | 1 | 1 | 1.19 | 1 |
| Dementia | 1 | 3 | 5.07 | 5 |
| Chronic pulmonary disease | 1 | 2 | 1.47 | 1 |
| Connective tissue disease-rheumatic disease | 1 | 0 | 0.95 | 0 |
| Peptic ulcer disease | 1 | 0 | 1.09 | 0 |
| Mild liver disease | 1 | 2 | 1.16 | 1 |
| Diabetes without complications | 1 | 1 | 0.89 | 0 |
| Diabetes with complications | 2 | 2 | 1.03 | 0 |
| Paraplegia and hemiplegia | 2 | 1 | 1.48 | 1 |
| Renal disease | 2 | 3 | 1.69 | 2 |
| Cancer | 2 | 2 | 1.32 | 1 |
| Moderate or severe liver disease | 3 | 4 | 2.74 | 3 |
| Metastatic carcinoma | 6 | 6 | 5.72 | 6 |
| AIDS/HIV | 6 | 4 | 0.94 | 0 |
CMS, Centres for Medicare & Medicaid Services.
Figure 3Average Charlson scores and actual mortality rate by predicted risk band. Risk bands are deciles of the predicted mortality. Both Charlson scores and actual mortality were calculated using the validation data set. The dashed line is the average Charlson score using the Schneeweiss weights whereas the solid line is the average Charlson score using the updated weights. The bars represent the mortality rate for all people.