| Literature DB >> 33104210 |
Vitej Bari1, Jamie S Hirsch1,2,3, Joseph Narvaez4, Robert Sardinia4, Kevin R Bock1, Michael I Oppenheim1,2, Marsha Meytlis1.
Abstract
OBJECTIVE: Improving the patient experience has become an essential component of any healthcare system's performance metrics portfolio. In this study, we developed a machine learning model to predict a patient's response to the Hospital Consumer Assessment of Healthcare Providers and Systems survey's "Doctor Communications" domain questions while simultaneously identifying most impactful providers in a network.Entities:
Keywords: electronic medical record; machine learning; patient experience; physician-patient relations; social network analysis
Mesh:
Year: 2020 PMID: 33104210 PMCID: PMC7727354 DOI: 10.1093/jamia/ocaa194
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Explanatory variables
| Patient | Doc 1 | Doc 2 | …. | Doc N | CCI | LOS | Acuity | Age (y) | Allergies count |
|---|---|---|---|---|---|---|---|---|---|
| Pat 1 | 4 | 1 | 3 | 6 | 1 | 0 | 21 | 0 | |
| Pat 2 | 7 | 0 | 0 | 5 | 2 | 0 | 53 | 0 | |
| Pat 3 | 0 | 0 | 0 | 3 | 6 | 3 | 78 | 3 | |
| Pat4 | 3 | 0 | 2 | 8 | 2 | 0 | 28 | 7 | |
| Pat 5 | 1 | 0 | 0 | 2 | 5 | 0 | 57 | 3 | |
| … | |||||||||
| Pat M | 0 | 9 | 0 | 1 | 3 | 0 | 19 | 0 |
Different features are represented as columns in the matrix. The features consist of 2342 provider features and 22 patient-centric features. For the provider features, each element in the matrix corresponds to the number of interactions between a provider and a patient.
CCI: Charlson Comorbidity Index; Doc: doctor/provider; LOS: length of stay; Pat: patient.
Figure 1.Data flow used in the machine learning model. This diagram shows how the original patient population is reduced for the input to the machine learning model.
Figure 2.Area under the receiver-operating characteristic curve (AUC) for the prediction of responses of Hospital Consumer Assessment of Healthcare Providers and Systems questions in the doctor domain. The performance of the random forest model is compared with the performance of the logistic regression model and decision tree model. The questions are the following: During your hospital stay, how often did doctors (A) treat you with courtesy and respect, (B) explain things in a way that you could understand, and (C) listen carefully to you?
Retrospective and prospective validation test performed using the random forest model for the 3 models
| Parameters | Validation | Question 1 (courtesy and respect) | Question 2 (understand) | Question 3 (listen carefully) |
|---|---|---|---|---|
| AUC | Retrospective | 0.876 (0.865-0.886) | 0.819 (0.809-0.829) | 0.819 (0.808-0.829) |
| Prospective | 0.874 (0.861-0.886) | 0.767 (0.755-0.780) | 0.781 (0.768-0.794) | |
| Sensitivity | Retrospective | 0.633 | 0.483 | 0.483 |
| Prospective | 0.808 | 0.72 | 0.728 | |
| Specificity | Retrospective | 0.981 | 0.988 | 0.991 |
| Prospective | 0.759 | 0.652 | 0.659 | |
| PPV | Retrospective | 0.96 | 0.968 | 0.975 |
| Prospective | 0.762 | 0.649 | 0.664 | |
| NPV | Retrospective | 0.782 | 0.715 | 0.717 |
| Prospective | 0.805 | 0.723 | 0.723 |
Values are AUC (95% confidence interval).
AUC: area under the receiver-operating characteristic curve; NPV: negative predictive value; PPV: positive predictive value.
Figure 3.Metric optimizer revealing how the performance characteristics vary with threshold adjustments over a constant area under the receiver-operating characteristic curve (AUC) for each of the 3 models. This type of adjustment can be used in business decisions to find the right balance between false negative and false positive cases. (A) Retrospective of question 1: courtesy and respect; (B) retrospective of question 2: understanding; (C) retrospective of question 3: listen carefully; (D) prospective of question 1: courtesy and respect; (E) prospective of question 2: understanding; (F) prospective of question 3: listen carefully. NPV: negative predictive value; PPV: positive predictive value.
Feature rank of input features by decrease in impurity and in accuracy based upon the random forest model for the 3 patient experience questions
| Question 1 (courtesy and respect) | Question 2 (understand) | Question 3 (listen carefully) | ||||
|---|---|---|---|---|---|---|
| Rank decrease in impurity | Rank decrease in accuracy | Rank decrease in impurity | Rank decrease in accuracy | Rank decrease in impurity | Rank decrease in accuracy | |
| Age | 11 | 27 | 9 | 8 | 9 | 12 |
| Gender | 77 | 69 | 57 | 344 | 58 | 45 |
| Race | 35 | 38 | 33 | 45 | 35 | 16 |
| Primary language | 122 | 133 | 104 | 341 | 151 | 924 |
| Marital status | 34 | 18 | 19 | 154 | 20 | 161 |
| Religion | 18 | 26 | 26 | 32 | 24 | 37 |
| Length of stay | 15 | 10 | 13 | 10 | 14 | 9 |
| Admit via ED | 7 | 9 | 7 | 7 | 7 | 8 |
| CCI | 13 | 19 | 11 | 12 | 12 | 10 |
| Number of allergies | 59 | 34 | 51 | 254 | 62 | 238 |
| Pain with activity | ||||||
| Max | 29 | 77 | 25 | 208 | 28 | 38 |
| Average | 31 | 99 | 28 | 24 | 29 | 20 |
| Standard deviation | 17 | 14 | 16 | 212 | 18 | 44 |
| Difference between first and last record | 25 | 416 | 23 | 170 | 21 | 40 |
| Pain at rest | ||||||
| Max | 30 | 22 | 27 | 217 | 26 | 96 |
| Average | 32 | 66 | 24 | 37 | 30 | 19 |
| Standard deviation | 19 | 13 | 17 | 109 | 17 | 31 |
| Difference between first and last record | 23 | 198 | 22 | 62 | 23 | 160 |
| Month | 8 | 7 | 15 | 9 | 13 | 15 |
| Previous Response | 16 | 23 | 20 | 15 | 16 | 23 |
| Temperature Recorded | ||||||
| Max | 2 | 1 | 2 | 4 | 5 | 1 |
| Average | 4 | 5 | 1 | 6 | 4 | 6 |
| Standard deviation | 5 | 6 | 3 | 2 | 3 | 2 |
| Difference between first and last record | 9 | 12 | 8 | 13 | 8 | 13 |
| MEWS | ||||||
| Max | 1 | 3 | 4 | 5 | 1 | 3 |
| Average | 3 | 2 | 5 | 1 | 2 | 4 |
| Standard deviation | 6 | 4 | 6 | 3 | 6 | 5 |
| Difference between first and last record | 12 | 11 | 12 | 14 | 11 | 14 |
| Location hours | ||||||
| Inpatient | 14 | 8 | 14 | 16 | 15 | 7 |
| ED | 10 | 16 | 10 | 11 | 10 | 11 |
| ICU | 49 | 65 | 32 | 361 | 36 | 411 |
CCI: Charlson Comorbidity Index; ED: emergency department; ICU: intensive care unit; MEWS: modified early warning score.
Figure 4.Average feature rank over all 3 models. Feature rank by mean decrease in impurity (Gini impurity) is plotted against feature rank by mean decrease in accuracy. Gini impurity is computed during model training, based on how much each feature decreases the weighted impurity in a tree. Then, the impurity decrease from each feature is averaged over all the trees in the forest. Finally, the features are ranked according to this average measure. Rank by mean decrease in accuracy is computed by scrambling each feature and then measuring how much this decreases the accuracy of the model. When low-ranking variables are scrambled, it has little or no effect on model accuracy, while scrambling high-ranking variables significantly decreases accuracy. AdmissionAcuity: binary flag indicating if patient was admitted through the emergency department vs electively; Avg_Active: average pain score during activity; CARDIO: cardiology; CCI: Charlson comorbidity index; ED_Hours: number of hours spent in the emergency department; Inpatient_Hours: number of hours spent in an inpatient unit; LenStay: overall hospital length of stay; MED: internal medicine; OBGYN: obstetrics/gynecology; SD_Rest: the standard deviation of the pain score at rest; SURGR: surgery.
Degree of various populations of doctors
| Population | Average degree |
|---|---|
| All doctors | 401.1 |
| Medicine doctors | 410.9 |
| Surgery Doctors | 376.8 |
| Emergency medicine doctors | 491.9 |
| Obstetrics/gynecology doctors | 164.0 |
| Anesthesia Doctors | 443.7 |
| Other doctors | 433.8 |
| Doctors with rank 1-50 as measured by decrease in accuracy of the random forest model | 858.9 |
Providers who have higher degree interact more with other providers and have a larger influence on the patient experience.
Figure 5.Social network analysis of provider interactions for a single hospital. Each point represents an individual provider and the distance between the points indicates how often the providers interact with one another. Providers who interact frequently are close together while providers who interact infrequently are far apart. Within each network, individuals who have more interactions are also more connected and more influential.
Figure 6.Correlation plot of social network node degree vs average random forest model feature rank. The correlation coefficient is −0.75.