| Literature DB >> 35162261 |
Yao Tong1,2,3,4, Beilei Lin1, Gang Chen3,4, Zhenxiang Zhang1.
Abstract
Continuity of care (COC) has been shown to possess numerous health benefits for chronic diseases. Specifically, the establishment of its level can facilitate clinical decision-making and enhanced allocation of healthcare resources. However, the use of a generalizable predictive methodology to determine the COC in patients has been underinvestigated. To fill this research gap, this study aimed to develop a machine learning model to predict the future COC of asthma patients and explore the associated factors. We included 31,724 adult outpatients with asthma who received care from the University of Washington Medicine between 2011 and 2018, and examined 138 features to build the machine learning model. Following the 10-fold cross-validations, the proposed model yielded an accuracy of 88.20%, an average area under the receiver operating characteristic curve of 0.96, and an average F1 score of 0.86. Further analysis revealed that the severity of asthma, comorbidities, insurance, and age were highly correlated with the COC of patients with asthma. This study used predictive methods to obtain the COC of patients, and our excellent modeling strategy achieved high performance. After further optimization, the model could facilitate future clinical decisions, hospital management, and improve outcomes.Entities:
Keywords: asthma; continuity of care; feature engineering; machine learning; predicting; retrospective study
Mesh:
Year: 2022 PMID: 35162261 PMCID: PMC8835449 DOI: 10.3390/ijerph19031237
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
The distributions of the patients and the data instances between 2014 to 2018.
| The Category of Outpatient with Asthma Who Received Care from UWM 1 | Number in 2014 (N = 9635), | Number in 2015 (N = 10,192), | Number in 2016 (N = 11,017), | Number in 2017 (N = 12,151), | Number in 2018 (N = 12,894), |
|---|---|---|---|---|---|
| Returned patients | 4926 (51.13) | 6010 (58.97) | 6453 (58.57) | 7549 (62.13) | 8186 (63.49) |
| New patients | 4709 (48.87) | 4182 (41.03) | 4564 (41.43) | 4602 (37.87) | 4708 (36.51) |
| Same COCI 2 (accuracy) | 4708 (48.86) | 5667 (55.60) | 6041 (54.83) | 6965 (57.32) | 7471 (57.94) |
1 UWM: university of Washington medicine. 2 COCI: continuity of care index.
Figure 1The patient cohort in the study.
The list of candidate features considered in the final model.
| Feature Category | Features |
|---|---|
| Features on patient demographics | Race; age; ethnicity (Hispanic/non-Hispanic); marital status (married, divorced, separated, single, widowed, or partnered); gender; and language. |
| Features that are concerning diagnoses and calculated based on ICD-10 and ICD-9 diagnosis codes | No. of diagnoses of asthma; No. of diagnosis codes concerns ICD-10 and ICD-9; no. of primary asthma diagnoses; no. of years since first diagnosed with asthma in the data set; no. of diagnoses of status asthmaticus; whether the latest diagnoses of asthma is a primary one; the severity of the latest asthma diagnoses; the utmost exacerbation severity among all of the asthma diagnoses; no. of diagnoses of acute asthma; no. of days since the latest asthma diagnosis; the severity of the utmost severity of asthma diagnosis; no. of diagnosis codes of noncompliance with the medication regimen; no. of days since the latest diagnoses with acute asthma or status asthmaticus; the latest diagnoses of asthma that indicate the exacerbation severity (uncomplicated, exacerbation, or asthmaticus); allergic rhinitis; sleep apnea; gastrostomy tube; immunoglobulin A (IgA) deficiency; cystic fibrosis; cirrhosis; chronic obstructive pulmonary disease (COPD); no. of years since first diagnosed with COPD in the data set; vitamin D deficiency; upper respiratory tract infection; congestive heart failure; esophagitis; anxiety or depression; ischemic heart disease; eczema; obesity; paraplegia or hemiplegia; decreased tone; metastatic solid tumor; increased tone; pneumonia; vocal cord dysfunction; psoriasis; anaphylaxis; vasculitis; gastrointestinal obstruction; inflammatory bowel disease; dementia; mental disorder; breathing abnormality like dyspnea; mild liver disease; Alzheimer’s or Parkinson’s disease; pregnancy; myocardial infarction; folate deficiency; gastrointestinal bleeding; malignancy; moderate or severe liver disease; peripheral vascular disease; acquired immunodeficiency syndrome; peptic ulcer disease; cerebrovascular disease; gastroesophageal reflux; substance use; rheumatic disease; renal disease; diabetes without chronic complication; cataract; bronchopulmonary dysplasia; tracheostomy; sinusitis; and family history of asthma. |
| Features concerning medications | The sum of medications ordered; the sum of various medications ordered; no. of medication orders; the sum of medication refills authorized; the sum of asthma medications ordered; the sum of units of medications ordered; no. of medication orders concerning asthma; the sum of asthma medication refills authorized; the sum of various asthma medications ordered; the sum of units of medications ordered concerning asthma; no. of medication prescribers; no. of medication prescribers concerning asthma; the sum of short-acting beta-2 agonists (SABAs) ordered; the sum of units of SABAs ordered; the sum of refills authorized for SABAs; the sum of systemic corticosteroids ordered; the sum of units of systemic corticosteroids ordered; the sum of refills authorized for systemic corticosteroids; no. of reliever orders concerning asthma; the sum of refills authorized for asthma relievers; the sum of relievers ordered concerning asthma; the sum of diverse asthma relievers ordered; the sum of units of relievers ordered concerning asthma that are neither SABAs nor systemic corticosteroids; the sum of units of relievers ordered concerning asthma; the sum of relievers ordered concerning asthma that are neither SABAs nor systemic corticosteroids; the sum of controllers ordered concerning asthma; no. of controller orders concerning asthma; the sum of various asthma controllers ordered; the sum of units of controllers ordered concerning asthma; the sum of refills authorized for asthma controllers; the sum of refills authorized for inhaled corticosteroids; the sum of inhaled corticosteroids ordered; the sum of units of inhaled corticosteroids ordered; the sum of refills authorized for mast cell stabilizers; the sum of ordered for mast cell stabilizers; the sum of units of ordered for mast cell stabilizers; the sum of nebulizer medications ordered; no. of nebulizer medication orders; the sum of various nebulizer medications ordered; the sum of units of ordered concerning nebulizer medications; the sum of refills authorized for nebulizer medications; whether spacer was used; and whether nebulizer was used. |
| Features concerning insurances | Whether the patient enrolled in any public insurance; whether the patient was paid by charity or self-paid; and whether the patient enrolled in any private insurance. We calculate the features related to insurances on the last day of the specific year. |
| Features concerning the visit types of the patient | No. of ED visits; the latest length of stay of ED visit; no. of ED visits concerning asthma; the average ED visit’s length of stay; no. of outpatient visits; no. of all type (ED visit, hospital stay, and outpatient visit) of visits; no. of outpatient visits who diagnosed with asthma as the primary diagnosis; the total length of hospital stay; no. of hospitalizations; the average a hospitalization’s length of stay; the latest visit’s admission type (trauma, urgent, elective, or emergency); the most emergent hospital admission type among all of the visits; no. of prime asthma visits; and the latest visit’s type (ED visit, hospital stay, or outpatient visit). According to our prior paper [ |
| Features concerning appointment and visit status | No. of no shows; and no. of canceled appointments. |
| Features concerning the family location of the patient | Whether the distance from the patient’s home to UMW is less than 5-miles. |
The confusion matrix.
| Prediction Class | Predicted COCI-Class = 1 2 | Predicted COCI-Class = 2 3 | Predicted COCI-Class = 3 4 |
|---|---|---|---|
| COCI 1-class = 1 |
|
|
|
| COCI-class = 2 |
|
|
|
| COCI-class = 3 |
|
|
|
1 COCI: the continuity of care index. 2 class = 1: the low level of COCI. 3 class = 2: the moderate level of COCI. 4 class = 3: the high level of COCI.
The list of candidate features considered in model_2.
| Feature Category | Features |
|---|---|
| Features on patient demographics | Race; age; ethnicity (Hispanic/non-Hispanic); marital status (married, divorced, separated, single, widowed, or partnered); gender; and language. |
The list of candidate features considered in model_3.
| Feature Category | Features |
|---|---|
| Features on patient demographics | Race; age; ethnicity (Hispanic/non-Hispanic); marital status (married, divorced, separated, single, widowed, or partnered); gender; and language. |
| Features that are concerning diagnoses and calculated based on ICD-10 and ICD-9 diagnosis codes (Comorbidity features) | Allergic rhinitis; sleep apnea; cystic fibrosis; COPD; anxiety or depression; eczema; obesity; gastroesophageal reflux; bronchopulmonary dysplasia; and sinusitis. |
The distributions of the COCI category and the data instances between 2011 to 2018.
| Data Category | Data Instances Connecting to Asthma COCI (N = 31,724), |
|---|---|
| Number of class = 1 | 12,905 (40.68%) |
| Number of class = 2 | 1804 (5.69%) |
| Number of class = 3 | 17,015 (53.63%) |
The characteristics of the patients at the UWM from 2011 to 2018.
| Characteristics of Patients | Data Instances ( | Data Instances Connecting to Asthma COCI Class = 1 ( | Data Instances Connecting to Asthma COCI Class = 2 ( | Data Instances Connecting to Asthma COCI Class = 3 ( | |
|---|---|---|---|---|---|
|
| |||||
| <40 | 11,611 (36.60) | 5738 (44.46) | 759 (42.07) | 5114 (30.06) | <0.001 |
| 40 to 65 | 14,839 (46.78) | 5524 (42.81) | 834 (46.23) | 8481 (49.84) | |
| 65+ | 5274 (16.62) | 1643 (12.73) | 211 (11.70) | 3420 (20.10) | |
|
| |||||
| Male | 11,200 (35.30) | 4720 (36.57) | 643 (35.64) | 5837 (34.31) | <0.001 |
| Female | 20,521 (64.69) | 8182 (63.40) | 1161 (64.36) | 11,178 (65.69) | |
| Unknown or not reported | 3 (0.01) | 3(0.02) | 0 (0.00) | 0 (0.00) | |
|
| |||||
| American Indian or Alaska native | 500 (1.58) | 174 (1.35) | 28 (1.55) | 298 (1.75) | <0.001 |
| Asian | 2909 (9.17) | 1150 (8.91) | 174 (9.65) | 158 (0.93) | |
| Black or African American | 2911 (9.18) | 890 (6.90) | 230 (12.75) | 1791 (10.53) | |
| Native Hawaiian or other Pacific islander | 302 (0.95) | 114 (0.88) | 24 (1.33) | 164 (0.96) | |
| Other | 82 (0.26) | 49 (0.38) | 3 (0.17) | 30 (0.18) | |
| White | 22,361 (70.49) | 9058 (70.19) | 1232 (68.29) | 12,071 (70.94) | |
| Unknown or not reported | 2659 (8.38) | 1470 (11.39) | 113 (6.26) | 1076 (6.32) | |
|
| |||||
| Hispanic | 1625 (5.12) | 614 (4.76) | 100 (5.54) | 911 (5.35) | <0.001 |
| Non-Hispanic | 25,783 (81.27) | 9757 (75.61) | 1554 (86.14) | 14,472 (85.05) | |
| Unknown or not reported | 4316 (13.60) | 2534 (19.64) | 150 (8.31) | 1632 (9.59) | |
|
| |||||
| Private | 23,446 (73.91) | 9224 (71.48) | 1374 (76.16) | 12,848 (75.51) | <0.001 |
| Public | 14,322 (45.15) | 4833 (37.45) | 893 (49.50) | 8596 (50.52) | <0.001 |
| Self-paid or charity | 1289 (4.06) | 298 (2.31) | 109 (6.04) | 882 (5.18) | <0.001 |
|
| |||||
| ≤3 | 25,527 (80.47) | 12,901 (99.97) | 1179 (65.35) | 11,447 (67.28) | <0.001 |
| >3 | 6197 (19.53) | 4 (0.03) | 625 (34.65) | 5568 (32.72) | |
|
| |||||
| Inhaled corticosteroid | 19,734 (62.21) | 5482 (42.48) | 1259 (69.79) | 12,993 (76.36) | <0.001 |
| Inhaled corticosteroid/long-acting beta-2 agonist combination | 16,537 (52.13) | 4261 (33.02) | 1080 (59.87) | 11,196 (65.80) | <0.001 |
| Leukotriene modifier | 6784 (21.38) | 1463 (11.34) | 429 (23.78) | 4892 (28.75) | <0.001 |
| Long-acting beta-2 agonist | 8502 (26.80) | 1881 (14.58) | 548 (30.38) | 6073 (35.69) | <0.001 |
| Mast cell stabilizer | 51 (0.16) | 13 (0.10) | 3 (0.17) | 35 (0.21) | <0.001 |
| Short-acting inhaled beta-2 agonist | 29,019 (91.47) | 11,009 (85.31) | 1770 (98.12) | 16,240 (95.45) | <0.001 |
| Systemic corticosteroid | 15,556 (49.04) | 4491 (34.80) | 950 (52.66) | 10,115 (59.45) | <0.001 |
|
| |||||
| Allergic rhinitis | 8421 (54.13) | 1872 (14.51) | 602 (33.37) | 5947 (34.95) | <0.001 |
| Anxiety or depression | 10,891 (34.33) | 3008 (23.31) | 758 (42.02) | 7125 (41.87) | <0.001 |
| Bronchopulmonary dysplasia | 3 (0.01) | 1 (0.01) | 1 (0.06) | 1 (0.01) | 0.99 |
| Chronic obstructive pulmonary disease | 2265 (7.14) | 471 (3.65) | 143 (7.93) | 1651 (9.70) | <0.001 |
| Cystic fibrosis | 36 (0.11) | 16 (0.12) | 4 (0.22) | 16 (0.09) | 0.02 |
| Eczema | 3138 (9.89) | 606 (4.70) | 223 (12.36) | 2309 (13.57) | <0.001 |
| Gastroesophageal reflux | 6571 (20.71) | 1408 (10.91) | 408 (22.62) | 4755 (27.95) | <0.001 |
| Obesity | 3962 (12.49) | 829 (6.42) | 285 (15.80) | 2848 (16.74) | <0.001 |
| Sinusitis | 5906 (18.62) | 1392 (10.79) | 357 (19.79) | 4157 (24.43) | <0.001 |
| Sleep apnea | 3192 (10.06) | 623 (4.83) | 208 (11.53) | 2361 (13.88) | <0.001 |
Prediction performance of various machine learning models.
| Model | Accuracy | Precision | Recall | F1 Score | AUROC |
|---|---|---|---|---|---|
| Baseline | 57.94% | - | - | - | - |
| C4.5 | 87.37% | 0.84 | 0.87 | 0.85 | 0.90 |
| 59.62% | 0.60 | 0.60 | 0.60 | 0.63 | |
| Naive Bayes | 46.04% | 0.71 | 0.46 | 0.38 | 0.88 |
| SVM | 84.90% | 0.81 | 0.85 | 0.82 | 0.87 |
| Random forest | 87.87% | 0.86 | 0.87 | 0.85 | 0.94 |
| XGBoost (our final model) | 88.20% | 0.85 | 0.88 | 0.86 | 0.96 |
Figure 2The receiver operating characteristic curve of the final model.
Figure 3The confusion matrix of the final model.
The features used in our final model and their importance values.
| Rank | Feature | Importance Calculated as the Feature’s Apportioned Contribution to the Model |
|---|---|---|
| 1 | No. of diagnoses | 0.5311 |
| 2 | No. of outpatient visits who diagnosed with asthma as the primary diagnosis | 0.0792 |
| 3 | No. of asthma diagnoses | 0.0102 |
| 4 | Whether the latest diagnosis of asthma is a primary one | 0.0078 |
| 5 | The severity of the latest asthma diagnoses | 0.0065 |
| 6 | No. of prime asthma visits | 0.0061 |
| 7 | No. of medication orders | 0.0059 |
| 8 | The sum of refills authorized for asthma controllers; | 0.0057 |
| 9 | No. of years since first diagnosed with asthma in the data set | 0.0057 |
| 10 | The sum of units of controllers ordered concerning asthma | 0.0050 |
| 11 | The severity of the utmost severity of asthma diagnosis | 0.0044 |
| 12 | Whether the patient has AIDS/HIV | 0.0044 |
| 13 | Whether the patient has mental disorder | 0.0043 |
| 14 | No. of ED visits concerning asthma | 0.0041 |
| 15 | The sum of units of relievers ordered concerning asthma | 0.0039 |
| 16 | Whether the patient has sinusitis | 0.0038 |
| 17 | The sum of units of SABAs ordered | 0.0038 |
| 18 | Whether the patient has substance use | 0.0038 |
| 19 | No. of outpatient visits | 0.0038 |
| 20 | No. of primary asthma diagnoses | 0.0038 |
| 21 | No. of all type (ED visit, hospital stay, and outpatient visit) of visits | 0.0038 |
| 22 | The sum of asthma medication refills authorized | 0.0038 |
| 23 | The sum of refills authorized for SABAs | 0.0036 |
| 24 | The total length of hospital stay | 0.0036 |
| 25 | No. of ED visits | 0.0035 |
| 26 | The sum of units of inhaled corticosteroids ordered | 0.0035 |
| 27 | Age | 0.0035 |
| 28 | Whether the patient is single | 0.0035 |
| 29 | Whether the patient is Hispanic | 0.0035 |
| 30 | The sum of refills authorized for inhaled corticosteroids | 0.0035 |
| 31 | No. of reliever orders concerning asthma | 0.0035 |
| 32 | Whether the patient has rhinitis | 0.0035 |
| 33 | Whether the patient has vitamin D deficiency | 0.0035 |
| 34 | Whether the patient was paid by charity or self-paid | 0.0034 |
| 35 | Whether the patient has psoriasis | 0.0034 |
| 36 | The sum of various asthma medications ordered | 0.0034 |
| 37 | Whether the distance from the patient’s home to UMW is less than 5-mile | 0.0034 |
| 38 | No. of diagnoses of status asthmaticus | 0.0034 |
| 39 | No. of controller orders concerning asthma | 0.0034 |
| 40 | No. of canceled appointments | 0.0033 |
| 41 | Whether the patient has dyspnea | 0.0033 |
| 42 | The sum of diverse asthma relievers ordered | 0.0033 |
| 43 | Whether the patient has pneumonia | 0.0032 |
| 44 | Whether the patient has rheumatic_disease | 0.0032 |
| 45 | No. of medication orders | 0.0032 |
| 46 | The sum of units of medications ordered | 0.0032 |
| 47 | The sum of refills authorized for systemic corticosteroids | 0.0032 |
| 48 | Whether the patient has COPD | 0.0032 |
| 49 | The sum of refills authorized for asthma relievers | 0.0032 |
| 50 | No. of no shows | 0.0031 |
| 51 | The sum of various asthma controllers ordered | 0.0031 |
| 52 | Whether the patient has folate deficiency | 0.0031 |
| 53 | The sum of units of systemic corticosteroids ordered | 0.0031 |
| 54 | The sum of SABAs ordered | 0.0031 |
| 55 | the average a hospitalization’s length of stay | 0.0031 |
| 56 | The sum of various medications ordered | 0.0031 |
| 57 | Whether the patient is pacific islander | 0.0031 |
| 58 | No. of diagnoses of acute asthma | 0.0031 |
| 59 | The sum of units of medications ordered | 0.0031 |
| 60 | No. of medication prescribers | 0.0030 |
| 61 | Whether the patient is married | 0.0030 |
| 62 | No. of medication prescribers concerning asthma | 0.0030 |
| 63 | Whether the patient is separated | 0.0030 |
| 64 | The sum of medication refills authorized | 0.0030 |
| 65 | Whether the patient has sleep apnea | 0.0030 |
| 66 | The sum of various nebulizer medications ordered | 0.0030 |
| 67 | Whether the patient has myocardial infarction | 0.0030 |
| 68 | The average ED visit’s length of stay | 0.0030 |
| 69 | Whether the patient has AP dementia | 0.0029 |
| 70 | Whether the patient has moderate or severe liver disease | 0.0029 |
| 71 | Whether the patient is female | 0.0029 |
| 72 | The utmost exacerbation severity among all of the asthma diagnoses | 0.0029 |
| 73 | Whether the patient is pregnant | 0.0029 |
| 74 | No. of diagnosis codes of noncompliance with the medication regimen | 0.0029 |
| 75 | Whether nebulizer was used | 0.0029 |
| 76 | Whether the patient is White | 0.0029 |
| 77 | Whether the patient has obesity diagnosis code | 0.0028 |
| 78 | Whether the patient enrolled in any public insurance | 0.0028 |
| 79 | No. of nebulizer medication orders | 0.0028 |
| 80 | Whether the patient has ischemic heart disease | 0.0028 |
| 81 | Whether spacer was used | 0.0028 |
| 82 | Whether the patient has peripheral vascular disease | 0.0028 |
| 83 | Whether the patient is widowed | 0.0028 |
| 84 | Whether the patient has inflammatory bowel disease | 0.0028 |
| 85 | Whether the patient enrolled in any private insurance | 0.0028 |
| 86 | Whether the patient has gastrointestinal bleeding | 0.0028 |
| 87 | Whether the patient has renal disease | 0.0028 |
| 88 | Whether the patient is Asian | 0.0028 |
| 89 | Whether the patient has reflux | 0.0027 |
| 90 | Whether the patient is Black | 0.0027 |
| 91 | Whether the patient has esophagitis | 0.0027 |
| 92 | The sum of units of ordered concerning nebulizer medications | 0.0027 |
| 93 | No. of years since first diagnosed with COPD in the data set | 0.0027 |
| 94 | Whether the patient has anxiety depression | 0.0027 |
| 95 | The sum of systemic corticosteroids ordered | 0.0027 |
| 96 | The severity of the utmost severity of asthma diagnosis | 0.0027 |
| 97 | Whether the patient has mild liver disease | 0.0026 |
| 98 | Whether the patient is divorced | 0.0026 |
| 99 | The sum of relievers ordered concerning asthma that are neither SABAs nor systemic corticosteroids | 0.0026 |
| 100 | The sum of units of inhaled corticosteroids ordered | 0.0026 |
| 101 | Whether the patient has vocal cord dysfunction | 0.0026 |
| 102 | Whether the patient speaks Spanish | 0.0025 |
| 103 | Whether the patient has eczema | 0.0025 |
| 104 | Whether the patient has diabetes with chronic complication | 0.0025 |
| 105 | Whether the patient has malignancy | 0.0024 |
| 106 | Whether the patient has gastrostomy tube | 0.0023 |
| 107 | Whether the patient has URTI | 0.0023 |
| 108 | The sum of units of relievers ordered concerning asthma that are neither SABAs nor systemic corticosteroids | 0.0023 |
| 109 | Whether the patient has anaphylaxis | 0.0023 |
| 110 | Whether the patient has metastatic | 0.0022 |
| 111 | Whether the patient has cerebrovascular | 0.0022 |
| 112 | Whether the patient has vasculitis | 0.0022 |
| 113 | No. of hospitalizations | 0.0022 |
| 114 | The sum of refills authorized for nebulizer medications | 0.0022 |
| 115 | Whether the patient has cirrhosis | 0.0020 |
| 116 | Whether the patient has diabetes without chronic complication | 0.0020 |
| 117 | Whether the patient speaks English | 0.0020 |
| 118 | Whether the patient has congestive heart failure | 0.0019 |
| 119 | Whether the patient has decreased tone | 0.0018 |
| 120 | Whether the patient has cystic fibrosis | 0.0017 |
| 121 | Whether the patient has increased tone | 0.0014 |
| 122 | Whether the patient has GI obstruction | 0.0012 |
| 123 | Whether the patient has hemiplegia | 0.0012 |
| 124 | Whether the patient has IgA deficiency | 0.0009 |
| 125 | Whether the patient has peptic ulcer disease | 0.0008 |
| 126 | Whether the patient has Charlson dementia | 0.0008 |
| 127 | Whether the patient has bronchiolitis | 0.0005 |
Performance results between the final and compared models.
| Model | Accuracy | Precision | Recall | F1 Score | AUROC |
|---|---|---|---|---|---|
| Baseline | 57.94% | - | - | - | - |
| Model_2 | 57.42% | 0.54 | 0.57 | 0.54 | 0.78 |
| Model_3 | 63.75% | 0.60 | 0.64 | 0.62 | 0.82 |
| Final model | 88.20% | 0.85 | 0.88 | 0.86 | 0.96 |
Figure 4The receiver operating characteristic curve of the models. (a) The final model built by full features; (b) the model_2 built by only demographic features; (c) the model_3 built by demographic and comorbidity features.