Literature DB >> 35212634

Automatically Explaining Machine Learning Predictions on Severe Chronic Obstructive Pulmonary Disease Exacerbations: Retrospective Cohort Study.

Siyang Zeng1, Mehrdad Arjomandi2,3, Gang Luo1.   

Abstract

BACKGROUND: Chronic obstructive pulmonary disease (COPD) is a major cause of death and places a heavy burden on health care. To optimize the allocation of precious preventive care management resources and improve the outcomes for high-risk patients with COPD, we recently built the most accurate model to date to predict severe COPD exacerbations, which need inpatient stays or emergency department visits, in the following 12 months. Our model is a machine learning model. As is the case with most machine learning models, our model does not explain its predictions, forming a barrier for clinical use. Previously, we designed a method to automatically provide rule-type explanations for machine learning predictions and suggest tailored interventions with no loss of model performance. This method has been tested before for asthma outcome prediction but not for COPD outcome prediction.
OBJECTIVE: This study aims to assess the generalizability of our automatic explanation method for predicting severe COPD exacerbations.
METHODS: The patient cohort included all patients with COPD who visited the University of Washington Medicine facilities between 2011 and 2019. In a secondary analysis of 43,576 data instances, we used our formerly developed automatic explanation method to automatically explain our model's predictions and suggest tailored interventions.
RESULTS: Our method explained the predictions for 97.1% (100/103) of the patients with COPD whom our model correctly predicted to have severe COPD exacerbations in the following 12 months and the predictions for 73.6% (134/182) of the patients with COPD who had ≥1 severe COPD exacerbation in the following 12 months.
CONCLUSIONS: Our automatic explanation method worked well for predicting severe COPD exacerbations. After further improving our method, we hope to use it to facilitate future clinical use of our model. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR2-10.2196/13783. ©Siyang Zeng, Mehrdad Arjomandi, Gang Luo. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 25.02.2022.

Entities:  

Keywords:  chronic obstructive pulmonary disease; forecasting; machine learning; patient care management

Year:  2022        PMID: 35212634      PMCID: PMC8917430          DOI: 10.2196/33043

Source DB:  PubMed          Journal:  JMIR Med Inform


Introduction

Background

Chronic obstructive pulmonary disease (COPD) is a leading cause of death [1] and affects 6.5% of American adults [2]. In the United States, COPD leads to 0.7 million inpatient stays and 1.5 million emergency department (ED) visits every year [2]. Severe COPD exacerbations are exacerbations that need inpatient stays or ED visits [3]. These exacerbations often result in irreversible deterioration in health status and lung function [4-9] and account for 90.3% of the US $32.1 billion total annual medical costs of the United States associated with COPD [2,10]. Many of these exacerbations, which include 47% of inpatient stays and many ED visits because of COPD, are regarded as preventable with suitable outpatient care [3,11]. To reduce severe COPD exacerbations, many health care systems and health plans use predictive models to identify high-risk patients [12] for preventive care management [13]. Once a patient is enrolled in the care management program, care managers will regularly follow up with the patient on the phone to assess the patient’s health status and help schedule health and related services. For patients with COPD, successful care management can cut up to 40% of their inpatient stays [14] and 27% of their ED visits [15]. As a care management program can take ≤3% of patients because of resource limits [16], the effectiveness of the program depends critically on the performance of the predictive model that is used. To optimize the allocation of precious care management resources and improve the outcomes for high-risk patients with COPD, we recently built the most accurate model to date to predict severe COPD exacerbations in the following 12 months [17]. Our model achieved an area under the receiver operating characteristic curve of 0.866, a sensitivity of 56.6% (103/182), and a specificity of 91.17% (6698/7347). In comparison, to the best of our knowledge, each published prior model for this prediction target [18-51] had an area under the receiver operating characteristic curve ≤0.809 and a sensitivity <50% when the specificity was set at approximately 91%. Our model is based on the machine learning algorithm of extreme gradient boosting (XGBoost) [52]. As is the case with most machine learning models, our model does not explain its predictions, forming a barrier for clinical use [53]. Offering explanations is essential for care managers to make sense of and trust the model’s predictions to make care management enrollment decisions and identify suitable interventions. Currently, there is no consensus on what explanation means for machine learning predictions. In this paper, by explaining the prediction that a machine learning model makes on a patient, we mean to find ≥1 rule whose left-hand side is fulfilled by the patient and whose right-hand side is consistent with the prediction. Previously, we developed a method to automatically provide rule-type explanations for any machine learning model’s predictions on tabular data and suggest tailored interventions with no loss of model performance [54-58]. This method has been tested before for asthma outcome prediction but not for COPD outcome prediction.

Objective

The goal of this particular study is to assess the generalizability of our automatic explanation method for predicting severe COPD exacerbations. After further improving our method in the future, our eventual goal is that care managers can use our method to make COPD care management enrollment and intervention decisions more quickly and reliably.

Methods

Ethics Approval and Study Design

The institutional review board of the University of Washington Medicine (UWM) approved this retrospective cohort study (STUDY00000118) using administrative and clinical data.

Patient Population

In Washington state, the UWM is the largest academic health care system. The enterprise data warehouse of the UWM contains administrative and clinical data from 12 clinics and 3 hospitals. This study used the same patient cohort as our previous predictive modeling study [17]. The patient cohort included all patients with COPD who visited the UWM facilities between 2011 and 2019. As adapted from the literature [59-62], a patient was deemed to have COPD if the patient was aged at least 40 years and met at least one of the following criteria: The patient had “an outpatient visit diagnosis code of COPD (International Classification of Diseases, Ninth Revision (ICD-9): 491.22, 491.21, 491.9, 491.8, 493.2x, 492.8, 496; International Classification of Diseases, Tenth Revision (ICD-10): J42, J41.8, J44.*, J43.*) followed by ≥1 prescription of long-acting muscarinic antagonist (aclidinium, glycopyrrolate, tiotropium, and umeclidinium) within 6 months” The patient had “≥1 ED or ≥2 outpatient visit diagnosis codes of COPD (International Classification of Diseases, Ninth Revision: 491.22, 491.21, 491.9, 491.8, 493.2x, 492.8, 496; International Classification of Diseases, Tenth Revision: J42, J41.8, J44.*, J43.*)” The patient had “≥1 inpatient stay discharge having a principal diagnosis code of COPD (International Classification of Diseases, Ninth Revision: 491.22, 491.21, 491.9, 491.8, 493.2x, 492.8, 496; International Classification of Diseases, Tenth Revision: J42, J41.8, J44.*, J43.*)” The patient had “≥1 inpatient stay discharge having a principal diagnosis code of respiratory failure (International Classification of Diseases, Ninth Revision: 518.82, 518.81, 799.1, 518.84; International Classification of Diseases, Tenth Revision: J96.0*, J80, J96.9*, J96.2*, R09.2) and a secondary diagnosis code of acute COPD exacerbation (International Classification of Diseases, Ninth Revision: 491.22, 491.21, 493.22, 493.21; International Classification of Diseases, Tenth Revision: J44.1, J44.0)” [17]. We used one exclusion criterion: when calculating the data instances in a given year, the patients who died or had no encounter at the UWM during that year were excluded.

Data Set

This study used the same structured data set as our previous predictive model paper [17]. The data set contained the administrative and clinical data of the patient cohort’s encounters at the 12 UWM clinics and 3 UWM hospitals between 2011 and 2020.

Prediction Target (Dependent or Outcome Variable)

This study used the same prediction target as our previous predictive model [17]. For a patient with COPD and ≥1 encounter at the UWM in a particular year (index year), we used patient data up to the end of the year to predict the outcome—whether the patient would have ≥1 severe COPD exacerbation in the following 12 months. A severe COPD exacerbation is defined as an inpatient stay or an ED visit with a principal diagnosis of COPD (International Classification of Diseases, Ninth Revision: 491.22, 491.21, 491.9, 491.8, 493.2x, 492.8, 496; International Classification of Diseases, Tenth Revision: J42, J41.8, J44.*, J43.*).

Data Preprocessing, Predictive Model, and Features (Independent Variables)

We applied the same methods as in our previous predictive model paper [17] to perform data preprocessing. Using the upper and lower bounds provided by a clinical expert in our team, as well as the upper and lower bounds from the Guinness World Records, we pinpointed the biologically implausible values, marked them missing, and normalized each numerical feature. Our model used 229 features and the XGBoost classification algorithm [52] to make predictions. As listed in the second table in the web-based multimedia appendix of our previous paper [17], these features were calculated on the attributes in our structured data set and covered various aspects such as vital signs, diagnoses, visits, procedures, medications, laboratory tests, and patient demographics. An example feature is the number of days since the patient had the last diagnosis of acute COPD exacerbation. Each input data instance to the predictive model contained these 229 features, corresponded to a distinct patient and index year pair, and was used to predict the outcome of the patient in the following 12 months. As in our previous predictive model paper [17], the cutoff threshold for binary classification was set at the top 10% of patients with the largest predicted risk. A care management program can take ≤3% of patients because of resource limits [16]. After using our model to identify the top 10% of patients with the largest predicted risk and using our automatic explanation method to explain the predictions, care managers could review patient charts, consider factors such as social dimensions, and choose ≤3% of patients for care management enrollment. A value of 10% was chosen to strike a balance between covering a large percentage of patients who would have ≥1 severe COPD exacerbation in the following 12 months and keeping the care managers’ workload manageable.

Review of Our Automatic Explanation Method

Overview

Previously, we developed a method to automatically provide rule-type explanations for any machine learning model’s predictions on tabular data and suggest tailored interventions with no loss of model performance [54-58]. When creating the automatic explanation function before the prediction time, our method requires ≥1 expert in the function’s design team to manually provide some information, such as marking the feature–value pairs that could have a positive correlation with the bad outcome value and compiling interventions for these feature–value pair items. This can typically be performed in a few man-hours. Once this information is obtained and stored in the function’s knowledge base, our method can automatically explain the machine learning model’s predictions and suggest tailored interventions at the prediction time.

Main Idea

Our automatic explanation method [54-58] uses 2 models at the same time to separate making predictions and providing explanations. Each model plays a different role. The first model is used to predict the outcome. This model can be any model that takes continuous and categorical features as its inputs and is typically chosen to be the model that performs the best at making predictions. The second model comprises class-based association rules [63,64] mined from the training set. We use the second model to explain the first model’s predictions rather than to make predictions. After we convert each continuous feature into ≥1 categorical feature via automatic discretization [63,65], the association rules are mined using the Apriori algorithm, whereas other standard methods such as frequent pattern growth can also be used [64]. Every rule shows that a feature pattern links to a value z of the outcome variable in the form of: Here, each item p (1≤i≤k) is a feature-value pair (x, c), indicating that feature x has a value c if c is a value or a value within c if c is a range. The values of k and z can vary by rules. For the binary classification of good versus bad outcomes, z is usually the bad outcome value. The rule indicates that a patient’s outcome tends to take the value z if the patient satisfies all of p, p,..., and p. The following is an example of a rule: The patient’s last diagnosis of acute COPD exacerbation was from the past 81.4 days AND the patient’s COPD reliever prescriptions in the past year included >10 distinct medications → The patient will probably have at least one severe COPD exacerbation in the following 12 months.

Mining and Pruning Rules

Each rule has two quality measures: commonality and confidence. For a rule: its commonality is defined as the percentage of data instances satisfying p, p,..., and p among all the data instances linked to z. Its confidence is defined as the percentage of data instances linked to z among all the data instances satisfying p, p,..., and p. Commonality measures the coverage of a rule within the context of z. Confidence measures the precision of a rule. The process of mining and pruning rules is controlled by five parameters: the number of top features that are used to form rules, upper limit of the number of items on the left-hand side of a rule, lower limit of confidence, lower limit of commonality, and upper limit of the confidence difference. Our method uses rules that each contains at most the upper limit number of items on its left-hand side, has a commonality that is greater than or equal to the lower limit of commonality, and has a confidence that is greater than or equal to the lower limit of confidence. Our automatic explanation method is intended to be used for real-time clinical decision support. Once the first model provides its predicted outcome of a patient, we need to use the second model to provide automatic explanations for the prediction quickly, ideally within a subsecond. For this purpose, we need to control the number of association rules in the second model to help reduce the overhead of retrieving and ranking the relevant rules at the prediction time. We used the following three techniques to cut the number of rules: Some machine learning algorithms, such as XGBoost [52], automatically calculate the importance value of each feature. When the data set included many features, we used only the top few features in the first model with the highest importance values to form rules. Usually, we set the number of top features to be used to the maximum possible number without making the association rule mining process run out of memory. A rule r was dropped if there exists another rule r satisfying three conditions: r and r have the same value on their right-hand sides; the items on the left-hand side of r are a proper subset of the items on the left-hand side of r (ie, r is more general than r); and the confidence of r is greater than or equal to the confidence of r− the upper limit of the confidence difference. All distinct feature–value pairs were examined and labeled by a clinical expert in the automatic explanation function’s design team. When forming rules, we used only those feature–value pairs that the clinical expert deemed could have a positive correlation with the bad outcome value. For every feature-value pair item used to form association rules, a clinical expert in the automatic explanation function’s design team compiled ≥0 intervention. An item is termed actionable if it is associated with ≥1 intervention. These interventions are automatically attached to the rules whose left-hand sides contain this item. A rule is termed actionable if its left-hand side contains ≥1 actionable item and, in turn, is associated with ≥1 intervention. In theory, for each combination of feature–value pair items that appears on the left-hand side of ≥1 mined rule, the clinical expert could compile additional interventions to be automatically attached to the rules whose left-hand sides contain this combination if these interventions have not already been compiled for any individual feature–value pair item in the combination. In practice, we have not needed to do this for predicting severe COPD exacerbations, whereas such a need could occur in some other clinical prediction tasks in the future.

Explaining the Predictions

For each patient predicted by the first model to have a bad outcome, we explained the prediction by presenting the association rules in the second model whose left-hand sides are fulfilled by the patient and whose right-hand sides have the bad outcome value. The rules were sorted using the method given in our paper [57]. This method incorporates 5 factors into a rule-scoring function, striking a balance among them. These factors include confidence, commonality, number of items on the left-hand side of the rule, whether the rule is actionable, and the degree of information redundancy with the higher-ranked rules. The rules are ranked based on the computed scores in an iterative fashion. Every rule offers an explanation for why the patient is predicted to have a bad outcome. For each actionable rule that is presented, the associated interventions are shown next to it. This helps the user of the automatic explanation function pinpoint suitable interventions for the patient. Typically, the rules in the second model provide common reasons for a patient to have a bad outcome. Although some patients could have bad outcomes because of rare reasons not covered by these rules, the second model usually explains most, although not all, of the bad outcomes correctly predicted by the first model.

Parameter Setting

Our model [17] used 229 features to predict patient outcomes. In this study, we used the top 80 features that our model ranked with the highest importance values to form association rules. Regardless of whether all 229 features or only the top 80 features were used, our model had the same area under the receiver operating characteristic curve of 0.866. As in our prior study on automatically explaining predictions of asthma outcomes on the UWM data [55], we set the upper limit of the number of items on the left-hand side of a rule to 5, the lower limit of commonality to 1%, and the lower limit of confidence to 50%. The last 2 values were commonly used to mine association rules [63], whereas commonality was essentially support computed on all the data instances linked to the bad outcome [54]. The first value struck a balance between the explanation power of our automatic explanation method and not making the rules too complex to understand. To set the upper limit value of the confidence difference, we plotted the number of association rules remaining from the rule pruning process versus the upper limit of the confidence difference. Our prior automatic explanation papers [54-56,58] showed that the number of remaining rules first decreased rapidly as the upper limit of the confidence difference increased and then slowly decreased after the upper limit of the confidence difference became large enough. The upper limit value of the confidence difference was set at a point where a further increase in the confidence difference had a minor impact on reducing the number of remaining rules.

Data Analysis

Split of the Training and Test Sets

We adopted the method from our previous predictive model paper [17] to split the entire data set into the training and test sets. As the outcomes were from the following year, the data set contained 9 years of effective data (2011-2019) over the 10-year period of 2011 to 2020. To reflect how our predictive model and our automatic explanation method will be used in clinical practice in the future, we used the 2011 to 2018 data as the training set to train our model and compute the association rules used by our automatic explanation method and the 2019 data as the test set to assess the performance of our model and our automatic explanation method.

Providing Examples of Automatic Explanations

To give the reader a concrete feeling of the results produced by our automatic explanation method, we randomly selected 3 example patients from the patients who were correctly predicted by our model to have ≥1 severe COPD exacerbation in the following 12 months and for whom our automatic explanation method could offer ≥1 explanation. For each example patient, we listed the top 3 explanations given by our automatic explanation method.

Performance Metrics

We examined the performance of our automatic explanation method using the following performance metrics from our prior automatic explanation papers [54-56,58]. Regarding the explanation power of our automatic explanation method, a performance metric is the percentage of patients for whom our method could provide explanations among the patients with COPD who were correctly predicted by our model to have ≥1 severe COPD exacerbation in the following 12 months. We assessed both the average and median number of (actionable) rules matching such a patient. A rule matches a patient if the patient satisfies all items on its left-hand side. As shown by our prior automatic explanation papers [54-56,58], many rules matching a patient often differ from each other by only 1 item on their left-hand sides. In this case, the number of rules greatly exceeded the amount of nonrepeated information contained in these rules. To provide a comprehensive overview of the amount of information provided by the automatic explanations, we examined the distributions of (1) the number of (actionable) rules and (2) the number of unique actionable items in the rules matching a patient who was correctly predicted by our model to have ≥1 severe COPD exacerbation in the following 12 months.

Results

Characteristics of Our Patient Cohort

Each data instance corresponds to a distinct patient and index year pair. Tables 1 and 2 summarize the patient demographic and clinical characteristics of the data instances in the training and test sets, respectively. These 2 sets of characteristics were relatively similar to each other. In the training set, 5.66% (2040/36,047) of the data instances were related to severe COPD exacerbations in the following 12 months. In the test set, 2.42% (182/7529) of the data instances were related to severe COPD exacerbations in the following 12 months. A detailed comparison of these 2 sets of characteristics was provided in our previous predictive model paper [17].
Table 1

The patient demographic and clinical characteristics of the data instances in the training set.

Patient characteristicsData instances related to no severe COPDa exacerbation in the following 12 months (n=34,007), n (%)Data instances related to severe COPD exacerbations in the following 12 months (n=2040), n (%)Data instances (n=36,047), n (%)
Sex
Female14,665 (43.12)749 (36.72)15,414 (42.76)
Male19,342 (56.88)1291 (63.28)20,633 (57.24)
Age (years)
40-6517,574 (51.68)1219 (59.75)18,793 (52.13)
>6516,433 (48.32)821 (40.25)17,254 (47.87)
Race
White26,117 (76.8)1330 (65.2)27,447 (76.14)
Black or African American4271 (12.56)524 (25.69)4795 (13.3)
Asian1948 (5.73)144 (7.06)2092 (5.8)
American Indian or Alaska Native687 (2.02)26 (1.27)713 (1.98)
Native Hawaiian or other Pacific Islander176 (0.52)8 (0.39)184 (0.51)
Other, unknown, or not reported808 (2.37)8 (0.39)816 (2.27)
Ethnicity
Hispanic804 (2.36)53 (2.6)857 (2.38)
Non-Hispanic30,644 (90.11)1941 (95.15)32,585 (90.39)
Unknown or not reported2559 (7.53)46 (2.25)2605 (7.23)
Insurance
Public27,831 (81.84)1767 (86.62)29,598 (82.11)
Private16,679 (49.05)834 (40.88)17,513 (48.58)
Self-paid or charity1765 (5.19)229 (11.23)1994 (5.53)
Number of years since the first encounter related to COPD in the data set
≤328,749 (84.54)1566 (76.76)30,315 (84.1)
>35258 (15.46)474 (23.24)5732 (15.90)
Smoking status
Current smoker15,863 (46.65)1089 (53.38)16,952 (47.03)
Former smoker7022 (20.65)345 (16.91)7367 (20.44)
Never smoker or unknown11,122 (32.7)606 (29.71)11,728 (32.53)
COPD medication prescription
SABAb20,865 (61.36)1684 (82.55)22,549 (62.55)
SAMAc8566 (25.19)1042 (51.08)9608 (26.65)
SABA and SAMA combination6364 (18.71)810 (39.71)7174 (19.9)
LABAd8062 (23.71)842 (41.27)8904 (24.7)
LAMAe9242 (27.18)1001 (49.07)10,243 (28.42)
LABA and LAMA combination386 (1.14)40 (1.96)426 (1.18)
ICSf12,208 (35.9)1119 (54.85)13,327 (36.97)
ICS and LABA combination7544 (22.18)782 (38.33)8326 (23.1)
ICS, LABA, and LAMA combination16 (0.05)0 (0)16 (0.04)
Systemic corticosteroid10,149 (29.84)1144 (56.08)11,293 (31.33)
Phosphodiesterase-4 inhibitor84 (0.25)10 (0.49)94 (0.26)
Comorbidity
Anxiety or depression10,061 (29.59)725 (35.54)10,786 (29.92)
Allergic rhinitis2271 (6.68)174 (8.53)2445 (6.78)
Asthma4377 (12.87)417 (20.44)4794 (13.3)
Diabetes7177 (21.1)446 (21.86)7623 (21.15)
Congestive heart failure5568 (16.37)495 (24.26)6063 (16.82)
Eczema1460 (4.29)98 (4.8)1558 (4.32)
Hypertension17,211 (50.61)1150 (56.37)18,361 (50.94)
Gastroesophageal reflux6655 (19.57)507 (24.85)7162 (19.87)
Ischemic heart disease6934 (20.39)486 (23.82)7420 (20.58)
Obesity3232 (9.5)255 (12.5)3487 (9.67)
Lung cancer742 (2.18)52 (2.55)794 (2.2)
Sleep apnea2926 (8.6)253 (12.4)3179 (8.82)
Sinusitis1299 (3.82)83 (4.07)1382 (3.83)

aCOPD: chronic obstructive pulmonary disease.

bSABA: short-acting beta-2 agonist.

cSAMA: short-acting muscarinic antagonist.

dLABA: long-acting beta-2 agonist.

eLAMA: long-acting muscarinic antagonist.

fICS: inhaled corticosteroid.

Table 2

The patient demographic and clinical characteristics of the data instances in the test set.

Patient characteristicsData instances related to no severe COPDa exacerbation in the following 12 months (n=7347), n (%)Data instances related to severe COPD exacerbations in the following 12 months (n=182), n (%)Data instances (n=7529), n (%)
Sex
Female3242 (44.13)47 (25.8)3289 (43.68)
Male4105 (55.87)135 (74.2)4240 (56.32)
Age (years)
40-653324 (45.24)118 (64.8)3442 (45.72)
>654023 (54.76)64 (35.2)4087 (54.28)
Race
White5682 (77.34)111 (61.0)5793 (76.94)
Black or African American839 (11.42)57 (31.3)896 (11.9)
Asian432 (5.88)7 (3.9)439 (5.83)
American Indian or Alaska Native151 (2.06)5 (2.7)156 (2.07)
Native Hawaiian or other Pacific Islander51 (0.69)2 (1.1)53 (0.71)
Other, unknown, or not reported192 (2.61)0 (0.0)192 (2.55)
Ethnicity
Hispanic185 (2.52)3 (1.6)188 (2.5)
Non-Hispanic6909 (94.04)179 (98.4)7088 (94.14)
Unknown or not reported253 (3.44)0 (0)253 (3.36)
Insurance
Public6722 (91.49)179 (98.4)6901 (91.66)
Private4532 (61.69)110 (60.4)4642 (61.65)
Self-paid or charity499 (6.79)41 (22.5)540 (7.17)
Number of years since the first encounter related to COPD in the data set
≤35073 (69.05)81 (44.5)5154 (68.46)
>32274 (30.95)101 (55.5)2375 (31.54)
Smoking status
Current smoker3781 (51.46)112 (61.5)3893 (51.71)
Former smoker1242 (16.91)25 (13.7)1267 (16.83)
Never smoker or unknown2324 (31.63)45 (24.7)2369 (31.47)
COPD medication prescription
SABAb4083 (55.57)158 (86.8)4241 (56.33)
SAMAc1134 (15.43)68 (37.4)1202 (15.96)
SABA and SAMA combination1694 (23.06)115 (63.2)1809 (24.03)
LABAd1683 (22.91)77 (42.3)1760 (23.38)
LAMAe1951 (26.56)110 (60.4)2061 (27.37)
LABA and LAMA combination388 (5.28)12 (6.6)400 (5.31)
ICSf2537 (34.53)98 (53.8)2635 (35)
ICS and LABA combination1729 (23.53)75 (41.2)1804 (23.96)
ICS, LABA, and LAMA combination68 (0.93)1 (0.5)69 (0.92)
Systemic corticosteroid2282 (31.06)103 (56.6)2385 (31.68)
Phosphodiesterase-4 inhibitor24 (0.33)2 (1.1)26 (0.35)
Comorbidity
Anxiety or depression2090 (28.45)63 (34.6)2153 (28.6)
Allergic rhinitis396 (5.39)14 (7.7)410 (5.45)
Asthma1053 (14.33)43 (23.6)1096 (14.56)
Diabetes1649 (22.44)40 (22)1689 (22.43)
Congestive heart failure1369 (18.63)43 (23.6)1412 (18.75)
Eczema247 (3.36)11 (6)258 (3.43)
Hypertension3686 (50.17)105 (57.7)3791 (50.35)
Gastroesophageal reflux1396 (19)47 (25.8)1443 (19.17)
Ischemic heart disease1604 (21.83)54 (29.7)1658 (22.02)
Obesity648 (8.82)21 (11.5)669 (8.89)
Lung cancer200 (2.72)3 (1.6)203 (2.7)
Sleep apnea887 (12.07)28 (15.4)915 (12.15)
Sinusitis272 (3.7)7 (3.8)279 (3.71)

aCOPD: chronic obstructive pulmonary disease.

bSABA: short-acting beta-2 agonist.

cSAMA: short-acting muscarinic antagonist.

dLABA: long-acting beta-2 agonist.

eLAMA: long-acting muscarinic antagonist.

fICS: inhaled corticosteroid.

The patient demographic and clinical characteristics of the data instances in the training set. aCOPD: chronic obstructive pulmonary disease. bSABA: short-acting beta-2 agonist. cSAMA: short-acting muscarinic antagonist. dLABA: long-acting beta-2 agonist. eLAMA: long-acting muscarinic antagonist. fICS: inhaled corticosteroid. The patient demographic and clinical characteristics of the data instances in the test set. aCOPD: chronic obstructive pulmonary disease. bSABA: short-acting beta-2 agonist. cSAMA: short-acting muscarinic antagonist. dLABA: long-acting beta-2 agonist. eLAMA: long-acting muscarinic antagonist. fICS: inhaled corticosteroid.

The Number of Association Rules

Using the top 80 features ranked with the highest importance values in our predictive model, 7,729,134 association rules were mined from the training set. Figure 1 shows the number of remaining rules versus the upper limit of the confidence difference. The number of remaining rules first rapidly decreased as the upper limit of the confidence difference increased and then slowly decreased after the upper limit of the confidence difference became ≥0.15. We set the upper limit of the confidence difference to the value of 0.15, resulting in 492,803 remaining rules.
Figure 1

The number of remaining association rules versus the upper limit of the confidence difference.

The number of remaining association rules versus the upper limit of the confidence difference. The top 80 features totally had 219 distinct feature–value pairs, 141 (64.4%) of which were actionable. A clinical expert on COPD (MA) in our team reviewed all distinct feature–value pairs of the top 80 features and labeled those that could have a positive correlation with severe COPD exacerbations in the following 12 months. After dropping the rules containing any other feature–value pair items, 460,592 rules were left. These rules were all actionable.

Examples of the Produced Automatic Explanations

To give the reader a concrete feeling of the results produced by our automatic explanation method, we randomly selected 3 example patients from the patients who were correctly predicted by our model to have ≥1 severe COPD exacerbation in the following 12 months and for whom our automatic explanation method could offer ≥1 explanation. Tables 3-5 show the top 3 explanations that our automatic explanation method provided for every example patient.
Table 3

The top 3 association rules generated for the first example patient.

Rank, rule, and item on the rule’s left-hand sideInterpretation of the itemInterventions linked to the item
Rank 1: The patient’s last diagnosis of acute COPDa exacerbation was from the past 81.4 days AND the patient’s COPD reliever prescriptions in the past year included >10 distinct medications the patient will probably have at least one severe COPD exacerbation in the following 12 months
The patient’s last diagnosis of acute COPD exacerbation was from the past 81.4 daysHaving a recent acute COPD exacerbation shows a need for better control of the disease.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

Ensure that the patient has a primary care provider or is referred to a specialist

The patient’s COPD reliever prescriptions in the past year included >10 distinct medicationsUsing many rescue medications for COPD indicates ineffective regimen, poor treatment adherence, or poor control of the disease.

Simplify COPD medications to once-a-day formulations or combination medications

Address concerns for adverse interactions between medications

Provide education on the correct use of COPD medications or inhalers

Consider strategies to improve medication adherence such as providing reminders for taking medications in time

Medication reconciliation review by a physician or a pharmacist

Rank 2: The patient had between 8 and 19 diagnoses of acute COPD exacerbation in the past year AND the patient’s last COPD diagnosis was from the past 25.6 days AND the patient’s nebulizer medication prescriptions in the past year included >11 medications the patient will probably have at least one severe COPD exacerbation in the following 12 months
The patient had between 8 and 19 diagnoses of acute COPD exacerbation in the past yearFrequently having acute COPD exacerbations shows a need for better control of the disease.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

The patient’s last COPD diagnosis was from the past 25.6 daysHaving a recent COPD diagnosis associated with an EDb visit or an inpatient stay indicates poor control of the disease.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

The patient’s nebulizer medication prescriptions in the past year included >11 medicationsUsing many medications for COPD with a nebulizer indicates an ineffective regimen, poor treatment adherence, or poor control of the disease. Using nebulizer medications could be a sign of having a mild exacerbation or more severe COPD.

Simplify COPD medications to once-a-day formulations or combination medications

Address concerns for adverse interactions between medications

Provide education on the correct use of COPD medications or inhalers

Consider strategies to improve medication adherence such as providing reminders for taking medications in time

Medication reconciliation review by a physician or a pharmacist

Rank 3: The patient’s average length of an inpatient stay in the past year was between 0.61 and 7.66 days AND the patient’s last outpatient visit on COPD occurred in the past 82.4 days AND the patient’s nebulizer medication prescriptions in the past year included >11 medications AND the patient’s maximum percentage of neutrophils in the past year was >76.5% the patient will probably have at least one severe COPD exacerbation in the following 12 months
The patient’s average length of an inpatient stay in the past year was between 0.61 and 7.66 daysHaving a long inpatient stay can indicate that the patient has a more severe disease or comorbidities.

Ensure that the patient has a primary care provider

Assess the need for home care or referral to a skilled nursing facility

Provide education on managing COPD and resources for care

Ensure use of appropriate COPD medications

The patient’s last outpatient visit on COPD occurred in the past 82.4 daysIf the patient’s last outpatient visit on COPD was for acute problems with COPD, it could indicate poor control of the disease and a need for additional support to control COPD.

Provide education on managing COPD and resources for care

Ensure use of appropriate COPD medications

Assess the need for home care or pulmonary rehabilitation

The patient’s nebulizer medication prescriptions in the past year included >11 medicationsUsing many medications for COPD with a nebulizer indicates an ineffective regimen, poor treatment adherence, or poor control of the disease. Using nebulizer medications could be a sign of having a mild exacerbation or more severe COPD.

Simplify COPD medications to once-a-day formulations or combination medications

Address concerns for adverse interactions between medications

Provide education on the correct use of COPD medications or inhalers

Consider strategies to improve medication adherence such as providing reminders for taking medications in time

Medication reconciliation review by a physician or a pharmacist

The patient’s maximum percentage of neutrophils in the past year was >76.5%Having a large percentage of neutrophils can indicate infections or distress.

Evaluate the respiratory system, for example, using radiographic imaging

Consider doing diagnostic tests such as viral panel, sputum culture, or procalcitonin

Evaluate other potential morbidities such as cardiovascular disease with an electrocardiogram, echocardiography, or laboratory tests such as brain natriuretic peptide or D-dimer

aCOPD: chronic obstructive pulmonary disease.

bED: emergency department.

Table 5

The top 3 association rules generated for the third example patient.

Rank, rule, and item on the rule’s left-hand sideInterpretation of the itemInterventions linked to the item
Rank 1: The patient had between 24 and 49 COPDa diagnoses in the past year AND the patient had >11 nebulizer medication prescriptions in the past year AND the patient is Black or an African American→ the patient will probably have at least one severe COPD exacerbation in the following 12 months
The patient had between 24 and 49 COPD diagnoses in the past yearFrequently receiving COPD diagnoses indicates poor control of the disease.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

The patient had >11 nebulizer medication prescriptions in the past yearUsing many medications for COPD with a nebulizer indicates an ineffective regimen, poor treatment adherence, or poor control of the disease. Using nebulizer medications could be a sign of having a mild exacerbation or more severe COPD.

Simplify COPD medications to once-a-day formulations or combination medications

Address concerns for adverse interactions between medications

Provide education on the correct use of COPD medications or inhalers

Consider strategies to improve medication adherence such as providing reminders for taking medications in time

Medication reconciliation review by a physician or a pharmacist

The patient is a Black or an African AmericanPoor respiratory outcomes and low quality of life are more prevalent in Black and African American patients.

Ensure that the patient has needed resources and access to care

Assess the need for social work or home care

Rank 2: The patient’s last EDb visit related to COPD occurred no less than 27.2 days ago and no more than 94.3 days ago AND the patient’s COPD medication prescriptions in the past year included between 13 and 16 distinct medications AND the patient’s last outpatient visit on COPD occurred no less than 82.4 days ago and no more than 327.6 days ago AND the patient’s maximum percentage of neutrophils in the past year was >76.5% the patient will probably have at least one severe COPD exacerbation in the following 12 months
The patient’s last ED visit related to COPD occurred no less than 27.2 days ago and no more than 94.3 days agoHaving a recent ED visit related to COPD shows a need for better control of the disease.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

Ensure that the patient has a primary care provider or is referred to a specialist

The patient’s COPD medication prescriptions in the past year included between 13 and 16 distinct medicationsUsing many COPD medications can indicate an ineffective regimen, poor treatment adherence, or poor control of the disease.

Simplify COPD medications to once-a-day formulations or combination medications

Address concerns for adverse interactions between medications

Provide education on the correct use of COPD medications or inhalers

Consider strategies to improve medication adherence such as using a pill organizer or providing reminders for taking medications in time

Medication reconciliation review by a physician or a pharmacist

The patient’s last outpatient visit on COPD occurred no less than 82.4 days ago and no more than 327.6 days agoIf the patient’s last outpatient visit on COPD was for acute problems with COPD, it could indicate poor control of the disease and a need for additional support to control COPD.

Provide education on managing COPD and resources for care

Ensure use of appropriate COPD medications

Assess the need for home care

The patient’s maximum percentage of neutrophils in the past year was >76.5%Having a large percentage of neutrophils can indicate infections or distress.

Evaluate the respiratory system, for example, using radiographic imaging

Consider doing diagnostic tests such as viral panel, sputum culture, or procalcitonin

Evaluate other potential morbidities such as cardiovascular disease with an electrocardiogram, echocardiography, or laboratory tests such as brain natriuretic peptide or D-dimer

Rank 3: The patient had between 8 and 19 diagnoses of acute COPD exacerbation in the past year AND the relative decline of the patient’s BMI in the past year was >0.44% AND the patient’s total length of inpatient stays in the past year was >0.6 day the patient will probably have at least one severe COPD exacerbation in the following 12 months
The patient had between 8 and 19 diagnoses of acute COPD exacerbation in the past yearFrequently having acute COPD exacerbations shows a need for better control of the disease.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

Ensure that the patient has a primary care provider or is referred to a specialist

The relative decline of the patient’s BMI in the past year was >0.44%Having an unintentional weight loss can indicate comorbidities or other complications, such as malnutrition or metabolic syndrome.

Optimize nutritional status to address low BMI

Provide dietary education and advise appropriate exercise

The patient’s total length of inpatient stays in the past year was >0.6 dayHaving a long inpatient stay can indicate that the patient has a more severe disease or comorbidities. Having frequent inpatient stays shows a need for better control of the disease.

Ensure that the patient has a primary care provider

Assess the need for home care or referral to a skilled nursing facility

Provide education on managing COPD and resources for care

Ensure use of appropriate COPD medications

aCOPD: chronic obstructive pulmonary disease.

bED: emergency department.

The top 3 association rules generated for the first example patient. Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Ensure that the patient has a primary care provider or is referred to a specialist Simplify COPD medications to once-a-day formulations or combination medications Address concerns for adverse interactions between medications Provide education on the correct use of COPD medications or inhalers Consider strategies to improve medication adherence such as providing reminders for taking medications in time Medication reconciliation review by a physician or a pharmacist Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Simplify COPD medications to once-a-day formulations or combination medications Address concerns for adverse interactions between medications Provide education on the correct use of COPD medications or inhalers Consider strategies to improve medication adherence such as providing reminders for taking medications in time Medication reconciliation review by a physician or a pharmacist Ensure that the patient has a primary care provider Assess the need for home care or referral to a skilled nursing facility Provide education on managing COPD and resources for care Ensure use of appropriate COPD medications Provide education on managing COPD and resources for care Ensure use of appropriate COPD medications Assess the need for home care or pulmonary rehabilitation Simplify COPD medications to once-a-day formulations or combination medications Address concerns for adverse interactions between medications Provide education on the correct use of COPD medications or inhalers Consider strategies to improve medication adherence such as providing reminders for taking medications in time Medication reconciliation review by a physician or a pharmacist Evaluate the respiratory system, for example, using radiographic imaging Consider doing diagnostic tests such as viral panel, sputum culture, or procalcitonin Evaluate other potential morbidities such as cardiovascular disease with an electrocardiogram, echocardiography, or laboratory tests such as brain natriuretic peptide or D-dimer aCOPD: chronic obstructive pulmonary disease. bED: emergency department. The top 3 association rules generated for the second example patient. Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Ensure that the patient has a primary care provider or is referred to a specialist Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Ensure that the patient has a primary care provider or is referred to a specialist Simplify COPD medications to once-a-day formulations or combination medications Address concerns for adverse interactions between medications Provide education on the correct use of COPD medications or inhalers Consider strategies to improve medication adherence such as providing reminders for taking medications in time Medication reconciliation review by a physician or a pharmacist Optimize nutritional status to address low BMI Provide dietary education and advise appropriate exercise Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Ensure that the patient has a primary care provider or is referred to a specialist Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Ensure that the patient has a primary care provider or is referred to a specialist Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Ensure that the patient has a primary care provider or is referred to a specialist Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Ensure that the patient has a primary care provider or is referred to a specialist Evaluate for cardiopulmonary causes of hypoxemia Consider nighttime oximetry or sleep study to evaluate for nighttime hypoxemia or sleep apnea Assess the need for home oxygen or nighttime noninvasive ventilation Evaluate the respiratory system, for example, using radiographic imaging Consider doing diagnostic tests such as viral panel, sputum culture, or procalcitonin Evaluate other potential morbidities such as cardiovascular disease with an electrocardiogram, echocardiography, or laboratory tests such as brain natriuretic peptide or D-dimer Provide education on the health risks of smoking Suggest and provide support for smoking cessation aCOPD: chronic obstructive pulmonary disease. bED: emergency department. cSPO2: peripheral capillary oxygen saturation. The top 3 association rules generated for the third example patient. Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Simplify COPD medications to once-a-day formulations or combination medications Address concerns for adverse interactions between medications Provide education on the correct use of COPD medications or inhalers Consider strategies to improve medication adherence such as providing reminders for taking medications in time Medication reconciliation review by a physician or a pharmacist Ensure that the patient has needed resources and access to care Assess the need for social work or home care Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Ensure that the patient has a primary care provider or is referred to a specialist Simplify COPD medications to once-a-day formulations or combination medications Address concerns for adverse interactions between medications Provide education on the correct use of COPD medications or inhalers Consider strategies to improve medication adherence such as using a pill organizer or providing reminders for taking medications in time Medication reconciliation review by a physician or a pharmacist Provide education on managing COPD and resources for care Ensure use of appropriate COPD medications Assess the need for home care Evaluate the respiratory system, for example, using radiographic imaging Consider doing diagnostic tests such as viral panel, sputum culture, or procalcitonin Evaluate other potential morbidities such as cardiovascular disease with an electrocardiogram, echocardiography, or laboratory tests such as brain natriuretic peptide or D-dimer Provide education on managing COPD and more frequent follow-ups Ensure use of appropriate COPD medications Consider influenza shot, pneumonia vaccination, or smoking cessation Assess the need for pulmonary rehabilitation or home care Ensure that the patient has a primary care provider or is referred to a specialist Optimize nutritional status to address low BMI Provide dietary education and advise appropriate exercise Ensure that the patient has a primary care provider Assess the need for home care or referral to a skilled nursing facility Provide education on managing COPD and resources for care Ensure use of appropriate COPD medications aCOPD: chronic obstructive pulmonary disease. bED: emergency department.

Performance of the Automatic Explanation Method

The automatic explanation method was evaluated using the test set. Our method explained the predictions for 97.1% (100/103) of the patients with COPD who were correctly predicted by our model to have severe COPD exacerbations in the following 12 months. For each such patient, our method gave an average of 13,880.19 (SD 18,700.60) explanations covering 39.80 (SD 11.98) distinct actionable items, a median of 4474 explanations, and a median of 41 distinct actionable items covered by the explanations. Each explanation corresponds to an association rule. For the patients with COPD who were correctly predicted by our model to have severe COPD exacerbations in the following 12 months, Figure 2 shows the distribution of the number of actionable rules matching a patient. This distribution is highly skewed toward the left with a long tail. As the number of actionable rules matching a patient increases, the frequency of cases in the corresponding equal-width bucket tends to rapidly decrease in a nonmonotonic way. The largest number of actionable rules matching a patient is rather large (111,062). Nevertheless, only 1 patient matches so many rules.
Figure 2

The distribution of the number of actionable rules matching a patient who was correctly predicted by our model to have ≥1 severe chronic obstructive pulmonary disease exacerbation in the following 12 months.

The distribution of the number of actionable rules matching a patient who was correctly predicted by our model to have ≥1 severe chronic obstructive pulmonary disease exacerbation in the following 12 months. For the patients with COPD who were correctly predicted by our model to have severe COPD exacerbations in the following 12 months, Figure 3 shows the distribution of the number of unique actionable items in the rules matching a patient. The largest number of unique actionable items in the rules matching a patient is 57, which is much smaller than the largest number of actionable rules matching a patient. As shown in Tables 3-5, the same intervention could be linked to ≥1 distinct actionable item in the rules matching a patient.
Figure 3

The distribution of the number of unique actionable items in the rules matching a patient who was correctly predicted by our model to have ≥1 severe chronic obstructive pulmonary disease exacerbation in the following 12 months.

The distribution of the number of unique actionable items in the rules matching a patient who was correctly predicted by our model to have ≥1 severe chronic obstructive pulmonary disease exacerbation in the following 12 months. Our automatic explanation method explained the predictions for 73.6% (134/182) of the patients with COPD who had ≥1 severe COPD exacerbation in the following 12 months.

Discussion

Principal Findings

Our automatic explanation method generalizes well in predicting severe COPD exacerbations. Our method explained the predictions for 97.1% (100/103) of the patients with COPD who were correctly predicted by our model to have severe COPD exacerbations in the following 12 months. This percentage is comparable with the corresponding percentages of 87.6% to 97.6% that we previously obtained to explain the predictions of asthma outcomes [54-56]. This percentage is sufficiently large to apply our automatic explanation method to routine clinical use for COPD management. After further improving the performance of our model for predicting severe COPD exacerbations and our automatic explanation method, we hope our model can be used in conjunction with our automatic explanation method to provide decision support for allocating COPD care management resources and improve outcomes. Our automatic explanation method explained the predictions for 73.6% (134/182) of the patients with COPD who had ≥1 severe COPD exacerbation in the following 12 months. This percentage is <97.1% (100/103), the success rate at which our method explained the predictions for the patients with COPD whom our model correctly predicted to have severe COPD exacerbations in the following 12 months. This seems likely to be because of the correlation between the prediction results of our model and the association rules. Among the patients whom our model correctly predicted to have severe COPD exacerbations in the following 12 months, many seem to be easy cases for using association rules to explain the outcomes. Among the patients who had severe COPD exacerbations but were incorrectly predicted by our model to have no severe COPD exacerbation in the following 12 months, many seem to be difficult cases for any model to correctly predict or explain the outcomes.

Related Work

Several years ago, we designed our automatic explanation method to handle relatively balanced data and demonstrated our method for predicting the diagnosis of type 2 diabetes [58]. Later, other researchers demonstrated our method on several other clinical predictive modeling tasks, such as predicting lung transplantation or mortality in patients with cystic fibrosis [66] and predicting cardiac mortality in patients with cancer [67]. Recently, we extended our automatic explanation method so it can also handle imbalanced data, where one value of the outcome variable appears much less often than another. We demonstrated our extended method for predicting hospital encounters for asthma in patients with asthma in 3 health care systems separately [54-56]. Imbalanced data also appear in the case of predicting severe COPD exacerbations, which is the use case of this paper. As discussed in the reviews [68,69], other researchers have developed a variety of methods to automatically explain the predictions made by machine learning models. Many of these methods lower the model performance or work only for a specific machine learning algorithm. Most of these methods provide explanations that are not of rule types. More importantly, none of these methods can automatically suggest tailored interventions, which is desired in many clinical applications. In comparison, our automatic explanation method has four properties that make it particularly suitable for providing clinical decision support: (1) it provides rule-type explanations, which are easier to understand than other kinds of explanations; (2) it works for any machine learning model on tabular data; (3) it does not lower model performance; and (4) it is the only automatic explanation method that can automatically suggest tailored interventions. Rudin et al [70], Ribeiro et al [71], Rasouli et al [72], Pastor and Baralis [73], Guidotti et al [74], and Panigutti et al [75] used rules to automatically explain machine learning predictions. These rules are not known before the time of prediction, making it impossible to use them to automatically suggest tailored interventions at the time of prediction. Except for the case of Pastor and Baralis [73], these rules are not association rules. In comparison, our automatic explanation method mines association rules before the time of prediction and uses them to automatically suggest tailored interventions at the time of prediction.

Limitations

This study has 5 limitations that are worth addressing in future work. First, this study used data from a single health care system. It is worth assessing our automatic explanation method’s performance in explaining the predictions of severe COPD exacerbations in other health care systems. Second, this study focuses on the prediction of one outcome—whether a patient with COPD will have ≥1 severe COPD exacerbation in the following 12 months. It is worth assessing our automatic explanation method’s performance in explaining the predictions of other outcomes. Third, our automatic explanation method currently works for explaining the predictions that traditional non–deep-learning machine learning algorithms make on tabular data. It is worth investigating the extension of our method to handle the predictions made by deep learning models on longitudinal data [76,77]. Fourth, we currently know no optimal way to present automatic explanations and automatically suggested interventions. It is worth investigating an optimal way to present this information based on a user-centered design. Finally, researchers have assessed the impact of automatic explanations on decision-making for several other applications [78-82] before but not for care management. For the automatic explanation function for predicting severe COPD exacerbations presented in this paper, it is worth assessing the impact of showing automatic explanations and automatically suggested interventions on care management enrollment and intervention decisions.

Conclusions

Our automatic explanation method generalizes well in predicting severe COPD exacerbations. After further improving the performance of our model for predicting severe COPD exacerbations and our automatic explanation method, we hope our model can be used in conjunction with our automatic explanation method to provide decision support for allocating COPD care management resources and improve outcomes.
Table 4

The top 3 association rules generated for the second example patient.

Rank, rule, and item on the rule’s left-hand sideInterpretation of the itemInterventions linked to the item
Rank 1: The patient’s last diagnosis of acute COPDa exacerbation was from the past 81.4 days AND the patient had >2 EDb visits in the past 6 months AND the patient’s nebulizer medication prescriptions in the past year included >11 medications the patient will probably have at least one severe COPD exacerbation in the following 12 months
The patient’s last diagnosis of acute COPD exacerbation was from the past 81.4 daysHaving a recent acute COPD exacerbation shows a need for better control of the disease.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

Ensure that the patient has a primary care provider or is referred to a specialist

The patient had >2 ED visits in the past 6 monthsUsing the ED indicates poor control of conditions or a lack of access to primary, specialty, or home care.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

Ensure that the patient has a primary care provider or is referred to a specialist

The patient’s nebulizer medication prescriptions in the past year included >11 medicationsUsing many medications for COPD with a nebulizer indicates an ineffective regimen, poor treatment adherence, or poor control of the disease. Using nebulizer medications could be a sign of having a mild exacerbation or more severe COPD.

Simplify COPD medications to once-a-day formulations or combination medications

Address concerns for adverse interactions between medications

Provide education on the correct use of COPD medications or inhalers

Consider strategies to improve medication adherence such as providing reminders for taking medications in time

Medication reconciliation review by a physician or a pharmacist

Rank 2: The patient’s maximum BMI in the past year was <22.81 AND the patient’s last ED visit related to COPD occurred no less than 27.2 days ago and no more than 94.3 days ago AND the patient’s average length of stay of an ED visit in the past year was between 0.03 and 0.29 day AND the patient had between 2 and 4 encounters related to acute COPD exacerbation or respiratory failure in the past year→ the patient will probably have at least one severe COPD exacerbation in the following 12 months
The patient’s maximum BMI in the past year was <22.81Having an unintentional weight loss can indicate comorbidities or other complications, such as malnutrition or metabolic syndrome.

Optimize nutritional status to address low BMI

Provide dietary education and advise appropriate exercise

The patient’s last ED visit related to COPD occurred no less than 27.2 days ago and no more than 94.3 days agoHaving a recent ED visit related to COPD shows a need for better control of the disease.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

Ensure that the patient has a primary care provider or is referred to a specialist

The patient’s average length of stay of an ED visit in the past year was between 0.03 and 0.29 dayUsing the ED indicates poor control of conditions or a lack of access to primary, specialty, or home care.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

Ensure that the patient has a primary care provider or is referred to a specialist

The patient had between 2 and 4 encounters related to acute COPD exacerbation or respiratory failure in the past yearFrequently having acute COPD exacerbations or respiratory failures shows a need for better control of the disease.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

Ensure that the patient has a primary care provider or is referred to a specialist

Rank 3: The patient had between 3 and 5 ED visits in the past year AND the patient’s minimum SpO2 c in the past year was between 17% and 89.5% AND the patient’s maximum percentage of neutrophils in the past year was >76.5% AND the patient smoked >0.48 pack of cigarettes per day in the past year the patient will probably have at least one severe COPD exacerbation in the following 12 months
The patient had between 3 and 5 ED visits in the past yearUsing the ED indicates poor control of conditions or a lack of access to primary, specialty, or home care.

Provide education on managing COPD and more frequent follow-ups

Ensure use of appropriate COPD medications

Consider influenza shot, pneumonia vaccination, or smoking cessation

Assess the need for pulmonary rehabilitation or home care

Ensure that the patient has a primary care provider or is referred to a specialist

The patient’s minimum SpO2 in the past year was between 17% and 89.5%Having a low SpO2 indicates worsening of symptoms or other complications such as hypoxemia.

Evaluate for cardiopulmonary causes of hypoxemia

Consider nighttime oximetry or sleep study to evaluate for nighttime hypoxemia or sleep apnea

Assess the need for home oxygen or nighttime noninvasive ventilation

The patient’s maximum percentage of neutrophils in the past year was >76.5%Having a large percentage of neutrophils can indicate infections or distress.

Evaluate the respiratory system, for example, using radiographic imaging

Consider doing diagnostic tests such as viral panel, sputum culture, or procalcitonin

Evaluate other potential morbidities such as cardiovascular disease with an electrocardiogram, echocardiography, or laboratory tests such as brain natriuretic peptide or D-dimer

The patient smoked >0.48 pack of cigarettes per day in the past yearSmoking is a key risk factor for COPD complications.

Provide education on the health risks of smoking

Suggest and provide support for smoking cessation

aCOPD: chronic obstructive pulmonary disease.

bED: emergency department.

cSPO2: peripheral capillary oxygen saturation.

  60 in total

1.  Comparison of clinico-physiologic and CT imaging risk factors for COPD exacerbation.

Authors:  Jung-Wan Yoo; Yoonki Hong; Joon Beom Seo; Eun Jin Chae; Seung Won Ra; Ji-Hyun Lee; Eun Kyung Kim; Seunghee Baek; Tae-Hyung Kim; Woo Jin Kim; Jin Hwa Lee; Sang-Min Lee; Sangyeub Lee; Seong Yong Lim; Tae Rim Shin; Ho Il Yoon; Seung Soo Sheen; Jae Seung Lee; Jin Won Huh; Yeon-Mok Oh; Sang-Do Lee
Journal:  J Korean Med Sci       Date:  2011-11-29       Impact factor: 2.153

2.  Predicting Severe Chronic Obstructive Pulmonary Disease Exacerbations. Developing a Population Surveillance Approach with Administrative Data.

Authors:  Hamid Tavakoli; Wenjia Chen; Don D Sin; J Mark FitzGerald; Mohsen Sadatsafavi
Journal:  Ann Am Thorac Soc       Date:  2020-09

3.  Lower respiratory illnesses promote FEV(1) decline in current smokers but not ex-smokers with mild chronic obstructive pulmonary disease: results from the lung health study.

Authors:  R E Kanner; N R Anthonisen; J E Connett
Journal:  Am J Respir Crit Care Med       Date:  2001-08-01       Impact factor: 21.405

4.  Disease management program for chronic obstructive pulmonary disease: a randomized controlled trial.

Authors:  Kathryn L Rice; Naresh Dewan; Hanna E Bloomfield; Joseph Grill; Tamara M Schult; David B Nelson; Sarita Kumari; Mel Thomas; Lois J Geist; Caroline Beaner; Michael Caldwell; Dennis E Niewoehner
Journal:  Am J Respir Crit Care Med       Date:  2010-01-14       Impact factor: 21.405

5.  Validation of a New Risk Measure for Chronic Obstructive Pulmonary Disease Exacerbation Using Health Insurance Claims Data.

Authors:  Richard H Stanford; Arpita Nag; Douglas W Mapel; Todd A Lee; Richard Rosiello; Francis Vekeman; Marjolaine Gauthier-Loiselle; Mei Sheng Duh; J F Philip Merrigan; Michael Schatz
Journal:  Ann Am Thorac Soc       Date:  2016-07

6.  Predicting Hospitalization Due to COPD Exacerbations in Swedish Primary Care Patients Using Machine Learning - Based on the ARCTIC Study.

Authors:  Björn Ställberg; Karin Lisspers; Kjell Larsson; Christer Janson; Mario Müller; Mateusz Łuczko; Bine Kjøller Bjerregaard; Gerald Bacher; Björn Holzhauer; Pankaj Goyal; Gunnar Johansson
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2021-03-16

7.  A clinical prediction model for hospitalized COPD exacerbations based on "treatable traits".

Authors:  Anthony C A Yii; C H Loh; P Y Tiew; Huiying Xu; Aza A M Taha; Jansen Koh; Jessica Tan; Therese S Lapperre; Antonio Anzueto; Augustine K H Tee
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2019-03-27

8.  Generalizability of an Automatic Explanation Method for Machine Learning Prediction Results on Asthma-Related Hospital Visits in Patients With Asthma: Quantitative Analysis.

Authors:  Gang Luo; Claudia L Nau; William W Crawford; Michael Schatz; Robert S Zeiger; Corinna Koebnick
Journal:  J Med Internet Res       Date:  2021-04-15       Impact factor: 5.428

9.  Predicting Acute Exacerbations in Chronic Obstructive Pulmonary Disease.

Authors:  Jennifer C Samp; Min J Joo; Glen T Schumock; Gregory S Calip; A Simon Pickard; Todd A Lee
Journal:  J Manag Care Spec Pharm       Date:  2018-03
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.