| Literature DB >> 26606986 |
Eleazar Gil-Herrera1, Garrick Aden-Buie2, Ali Yalcin3, Athanasios Tsalatsanis1, Laura E Barnes4, Benjamin Djulbegovic1,5.
Abstract
BACKGROUND: This paper explores and evaluates the application of classical and dominance-based rough set theory (RST) for the development of data-driven prognostic classification models for hospice referral. In this work, rough set based models are compared with other data-driven methods with respect to two factors related to clinical credibility: accuracy and accessibility. Accessibility refers to the ability of the model to provide traceable, interpretable results and use data that is relevant and simple to collect.Entities:
Mesh:
Year: 2015 PMID: 26606986 PMCID: PMC4659220 DOI: 10.1186/s12911-015-0216-9
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Example decision table
| Condition attributea | Decision attribute | ||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
| |
| Patient | Gender | Age | SystBP | HDL | Diabetic | Smoker | Coronary disease |
|
| F | H | M | L | No | No | No |
|
| M | L | L | L | No | Yes | No |
|
| F | M | M | H | No | No | No |
|
| F | M | M | H | No | No | Yes |
|
| M | H | H | L | Yes | Yes | Yes |
|
| M | H | H | L | Yes | Yes | Yes |
|
| F | M | M | H | No | No | Yes |
aGender: Female/Male; Age: L = [54,59), M = [59,69), H = [69,74]; SystBP: L =<129, M = [ 129−139], H =(139−159]; HDL: L = <40 M = [ 40−60], H =>60
Description of attributes from SUPPORT dataset
| Variable name | Description | Patient distribution | ||
|---|---|---|---|---|
| Numerical condition attributes | Range | Mean | Std. Dev. | |
|
| Age of the patient | 18–101 | 62.65 | 15.59 |
|
| Serum albumin | 0.4–29 | 2.95 | 0.87 |
|
| Bilirubin | 0.1–63 | 2.55 | 5.32 |
|
| Serum creatinine | 0.09–21.5 | 1.77 | 1.69 |
|
| Number of days in hospital at study entry | 1–148 | 1.00 | 9.13 |
|
| Heart rate | 0–300 | 97.16 | 31.56 |
|
| Mean arterial blood pressure | 0–195 | 84.55 | 27.70 |
|
| Blood gasses, | 12–890.4 | 239.50 | 109.70 |
|
| Respiration rate | 0–90 | 23.33 | 9.57 |
|
| SUPPORT coma score, based on Glasgow coma scale | 0–100 | 12.06 | 24.63 |
|
| Sodium | 110–181 | 137.60 | 6.03 |
|
| Temperature in °C | 31.7–41.7 | 37.10 | 1.25 |
|
| White blood cell count | 0.05–200 | 12.35 | 9.27 |
| Categorical condition attributes | Patients | Percentage (%) | ||
|
| Diagnosis group: | |||
|
| 3,513 | 38.59 | ||
| CHF | 1,387 | 15.23 | ||
| Cirrhosis | 508 | 5.56 | ||
| Colon cancer | 512 | 5.62 | ||
| Coma | 596 | 6.54 | ||
| COPD | 967 | 10.60 | ||
| Lung cancer | 908 | 9.97 | ||
| MOSF w. malignancy | 712 | 7.81 | ||
|
| Presence of cancer: | |||
|
| 1,252 | 13.75 | ||
|
| 5,993 | 65.84 | ||
|
| 1,858 | 20.40 | ||
| Decision attribute | Patients | Percentage (%) | ||
|
| Death occurred within 6 months: | |||
|
| 4,263 | 46.83 | ||
|
| 4,840 | 53.17 | ||
Values of 0 for hrt, meanbp and resp correspond to cardiac arrests during the day when the measurements were taken
Fig. 1Kaplan-Meier survivability of patients with respect to number of days until death
Fig. 2Kaplan-Meier survival curve by dzgroup
Discretized attributes not in APACHE III
| Attribute | Description | Categorization |
|---|---|---|
|
| Minor | (∗,9] |
| Moderate | (9,44] | |
| Severe | (44,∗) | |
|
| Normal | [ 300,∗) |
| Severe defect in gas exchange | [ 200,300) | |
| Acute respiratory distress syndrome | [ 0,200) | |
|
| Short | (∗,44] |
| Long | (44,∗] |
AUC and coverage for MODLEM and VC-DomLEM algorithms with l and m-consistent rules
| MODLEM | VC-DomLEM | |||
|---|---|---|---|---|
|
| AUC | Coverage (%) | AUC | Coverage (%) |
| 0.1 | 0.6646 | 100.00 | 0.7280 | 99.88 |
| 0.2 | 0.6646 | 100.00 | 0.7279 | 99.87 |
| 0.4 | 0.6888 | 100.00 | 0.7277 | 99.65 |
| 0.6 | 0.6974 | 97.41 | 0.7173 | 98.72 |
| 0.8 | 0.6419 | 86.72 | 0.7093 | 76.85 |
| 1.0 | 0.6158 | 80.08 | 0.6559 | 35.89 |
Fig. 3Number of rules fired in each test case for m-consistent MODLEM classifiers
Fig. 4Number of rules fired in each test case for l-consistent VC-DomLEM classifiers
Number of descriptors and rules in MODLEM and VC-DomLEM induced decision rule sets, for m=l=0.6 consistent rules, across the five cross validation folds
| Descriptors in rules | ||||
|---|---|---|---|---|
| Method | Mean number of rules | Min. | Max. | Mean |
| MODLEM | 773 | 1 | 8 | 3.65 |
| VC-DomLEM | 1095 | 2 | 13 | 6.85 |
Summary of performance evaluation results of the classification models, averaged over the 5 cross validation folds, with standard deviations
| Method | AUC | Kappa | Sensitivity | Specificity | Threshold ( |
|---|---|---|---|---|---|
| VC-DomLEM | 0.7173 (0.014) | 0.35 (0.03) | 0.6391 (0.042) | 0.7175 (0.033) | 0.4234 (0.045) |
| MODLEM | 0.6974 (0.015) | 0.32 (0.03) | 0.6447 (0.038) | 0.6862 (0.037) | 0.4597 (0.042) |
| C4.5 | 0.7088 (0.018) | 0.31 (0.04) | 0.6078 (0.055) | 0.7254 (0.070) | 0.4531 (0.095) |
| RF | 0.7459 (0.014) | 0.37 (0.02) | 0.6384 (0.044) | 0.7388 (0.039) | 0.4872 (0.022) |
| Log. Reg. | 0.7421 (0.009) | 0.35 (0.01) | 0.6374 (0.055) | 0.7282 (0.058) | 0.4715 (0.050) |
| SVM | 0.7352 (0.009) | 0.35 (0.02) | 0.6526 (0.050) | 0.7132 (0.040) | 0.4056 (0.034) |
Selected decision rules from the CRSA using MODLEM and the VC-DRSA using VC-DomLEM
| RHS | |||
|---|---|---|---|
| CRSA rules using MODLEM | LHS | ||
| 1. If | 969 | 593 (61 %) | 376 (39 %) |
| 2. If | 1016 | 399 (39 %) | 617 (61 %) |
| 3. If | 465 | 119 (26 %) | 346 (74 %) |
| 4. If | 47 | 11 (23 %) | 36 (77 %) |
| VC-DRSA rules using VC-DomLEM | |||
| 5. If | 51 | 4 (8 %) | 47 (92 %) |
| 6. If | 8 | 8 (100 %) | 0 (0 %) |
aage_score: 0=(age≤44)
bhrt_score: 0=(50≤hrt≤99)
cresp_score: 6=(25≤resp≤34)
dwbc_score: 5=((1≤wbc≤2.9) or (wbc≥25))
ecrea_score: ≥4=(crea≥1.5)
fsod_score: ≥2=((sod≤134) or (sod≥155))