Literature DB >> 30911219

Applying Supervised Machine Learning to Identify Which Patient Characteristics Identify the Highest Rates of Mortality Post-Interhospital Transfer.

Andrew P Reimer1,2, Nicholas K Schiltz1, Vanessa P Ho3, Elizabeth A Madigan4, Siran M Koroukian5.   

Abstract

OBJECTIVE: To demonstrate the usefulness of applying supervised machine-learning analyses to identify specific groups of patients that experience high levels of mortality post-interhospital transfer.
METHODS: This was a cross-sectional analysis of data from the Health Care Utilization Project 2013 National Inpatient Sample, that applied supervised machine-learning approaches that included (1) classification and regression tree to identify mutually exclusive groups of patients and their associated characteristics of those experiencing the highest levels of mortality and (2) random forest to identify the relative importance of each characteristic's contribution to post-transfer mortality.
RESULTS: A total of 21 independent groups of patients were identified, with 13 of those groups exhibiting at least double the national average rate of mortality post-transfer. Patient characteristics identified as influencing post-transfer mortality the most included: diagnosis of a circulatory disorder, comorbidity of coagulopathy, diagnosis of cancer, and age.
CONCLUSIONS: Employing supervised machine-learning analyses enabled the computational feasibility to assess all potential combinations of available patient characteristics to identify groups of patients experiencing the highest rates of mortality post-interhospital transfer, providing potentially useful data to support developing clinical decision support systems in future work.

Entities:  

Keywords:  Transportation of patients; patient outcome assessment; supervised machine learning

Year:  2019        PMID: 30911219      PMCID: PMC6425528          DOI: 10.1177/1178222619835548

Source DB:  PubMed          Journal:  Biomed Inform Insights        ISSN: 1178-2226


Approximately 1.6 million patients undergo interhospital transfer annually.[1] Patients undergoing interhospital transfer experience up to three times higher mortality,[2,3] use double the resources and experience twice the length of stay than those not transferred from another hospital.[1] Interhospital transfers consist of two primary patient types: those experiencing an immediately life-threatening condition (e.g. myocardial infarction, trauma) and those who are not experiencing an immediately life-threatening condition. Transfer for patients experiencing an immediately life-threatening condition has been shown to be a life-saving measure, with reductions in mortality for trauma[4-8] and heart attack patients[9] but has yielded conflicting results for stroke[10,11] and minimally injured trauma patients.[12-14] The decision to transfer patients from lower to higher levels of care for an immediately life-threatening condition are common and often supported by referral networks established within local regions like trauma and stroke networks. For those patients not experiencing an immediately life-threatening condition, the decision to transfer is complicated and is based on individual provider judgment, family request, or other factors. Currently, no national guidelines[15] exists to guide interhospital transfer; furthermore, there is limited understanding of who does and does not benefit from being transferred and exactly when those transfers should occur. The overall poor outcomes that interhospital transfer patients experience and mixed outcomes for patients that are immediately transferred for time-sensitive conditions suggest that we do not have a good understanding of immediately life-threatening conditions. Outside of patients that are transferred for intervention that must be performed immediately upon arrival at the receiving hospital (e.g. cardiac catheterization and surgical procedure), our recognition of what constitutes a patient experiencing an immediately life-threatening condition needs to be reconceptualized. Reconceptualizing type of transfer patients require the focus to move beyond the currently used broad categories (e.g. trauma and stroke) to categories that support patient-specific characteristics that identify those who should be considered for transfer. Therefore, to begin moving toward a more patient-centric approach, the purpose of this study was to identify specific groups of patients and their associated characteristics that experience high levels of mortality post-transfer.

Methods

Data source

We used the 2013 Nationwide Inpatient Sample (NIS). The NIS is part of the Healthcare Cost and Utilization Project (HCUP) from the Agency for Healthcare Research and Quality (AHRQ) and is the largest all-payer inpatient database in the United States with a nationally representative sample of approximately 8 million inpatient discharges each year.[16] We identified all adult patients aged 19 years or older that were transferred from one acute care hospital to another to compose an interhospital transfer cohort.

Measures

Our main outcome measure is in-hospital mortality, as recorded on the hospital billing record discharge status. To identify patient characteristics and variables that are clinically meaningful and where available in the data set, we only incorporated covariates that are useful in guiding clinical decision-making or practice. Patient-level covariates included the following: age (continuous), gender, payer type, race, comorbidity, and primary diagnosis. To include the primary diagnosis and to make the analysis computationally feasible, we accounted for the primary diagnosis via the Clinical Classification System (CCS) for the International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM),[17,18] using the multi-level diagnosis category labels—a total of 17 categories. The multi-level CCS category is a standard, established method to collapse over 14 000 diagnosis codes and 3900 procedure codes into clinically meaningful categories.[18] Refer to Supplementary Material Table 1 for variables included in analysis, for a listing of the covariates and CCS categories used in the model. We measured the presence of comorbid conditions using the Elixhauser comorbidity index list. The Elixhauser index contains 30 comorbid conditions defined through secondary ICD-9-CM diagnosis codes and Diagnosis Related Group (DRG) codes.[19-21] We excluded both arthritis and fluid and electrolyte disorder comorbidities. Many patients have arthritis, and for the purposes of this study, it was not considered a factor that differentiates patients for transfer. In addition, most patients hospitalized and undergoing interhospital transfer experience some form of abnormal laboratory value, making it not clinically useful for identifying discrete subgroups of patients who will provide new insight to enable reconceptualizing patient categories for transfer. To describe the severity of the patient population and to enabling comparison between the data subsets used in the analysis, we used the All Patient Refined Diagnosis Related Groups (APR-DRG) Risk Mortality covariate provided by HCUP. The APR-DRGs are assigned using proprietary software developed by 3M Health Information Systems that include the base APR-DRG, the severity of illness subclass, and the risk of mortality subclass within each base APR-DRG.[16] We only used this variable to provide a description of the study samples and did not include it in the model development and analyses due to it being a combination of other covariates already included in the model (e.g. age, gender, and diagnosis) while also including proprietary calculations that are not available within the electronic medical record (EMR) and thus would not be useable in decision-support tools or other patient care activities relying on primary data. System-level covariates in the analysis included the following: admission month, admission on a weekend, hospital bed size, hospital teaching status, hospital region, hospital control/ownership, and patient location before hospitalization. We also accounted for whether patients received a major operating room procedure that was either diagnostic or therapeutic occurring post-transfer. The University Hospitals Case Medical Center Institutional Review Board determined that this study meets the exemption criteria for human subject research (IRB #em-14-30).

Statistical analysis

Frequency counts and percentages were tabulated for the categorical outcome—mortality. For descriptive analysis, we used discharge-level survey weights provided in the NIS that accounted for complex survey design effects. The final sample for this study is a nationally representative sample generated via the weighting variable provided with the data set. However, the classification and regression tree (CART) analysis does not apply the sample weights, which leads to smaller samples in the terminal nodes. We excluded cases where the mortality variable was missing. We did not exclude any observations with missing values for the independent variables specifically because a robust feature of the CART algorithm is that it handles missing data using the surrogate split method—a method that finds an alternative variable that is highly correlated with the missing variable to determine the split.[22] While there are other methods for handling missing data in CART analysis,[23] the default setting in CART packages is to skip missing variables to streamline the analysis.[24] In this analysis, we employed the surrogate split method that identifies and supplements a surrogate variable.

Supervised machine-learning approaches

We used CART analysis to identify combinations of predictors associated with post-transfer mortality. The CART involves a tree-building technique in which the choice of “splitting” variables is based on an exhaustive search of all possibilities, using a recursive partitioning algorithm, resulting in mutually exclusive groups that are the most different with respect to the dependent variable.[25] The tree-building process leads to terminal nodes (or leaves), at which point the nodes cannot be divided anymore and need to be pruned to avoid over-fitting and increase efficiency.[26] First, CART recursively partitions the patients into smaller and smaller homogeneously distributed groups—in this case, based on the presence of specific combinations of clinical conditions. The purpose is to reduce variations within the group and to improve the fit as best as possible. Next, CART uses these groups to predict post-transfer mortality. We used the following stopping criteria (based on model tuning described below): a maximum tree depth of 10 splits, a minimum node size of 50 subjects, requiring a split to increase the complexity parameter by a minimum of 0.001 and using the information impurity index to determine node splits. To build our model, we partitioned the study data into a training data set (70% of the data) and validation data set (remaining 30%) using random sampling within each class of the outcome variables. We used 10-fold cross-validation repeated three times on the training data set to build the CART models. Since mortality is highly unbalanced, we weighted the “cost” of a false negative to be higher than a false positive to improve sensitivity and produce a more meaningful model. We then tested the accuracy of our models on the testing data set using a confusion matrix and by calculating the area under the curve. We also used the Matthews correlation coefficient measure, a measure of accuracy that accounts for imbalanced outcomes.[27] We chose our final model for the outcome based on accuracy and interpretability. In addition, we compared our final models with those from a random forest model to see if they were in agreement on variables that are the most important predictors. Random forest is a bootstrap aggregation method that creates multiple decision trees using random variable selection. Breiman et al[22] provides a detailed description of random forest. We used SAS software version 9.4[28] for data management; for our statistical analyses, we used R version 3.3.1 and RStudio 1.0.136[29] and the “rpart” (CART), partykit (tree graphics), “randomForest” (random forest), and “caret” (model tuning and cross-validation) packages.

Results

In 2013, approximately 1 456 422 adult patients underwent interhospital transfer, 52% were male, 66% White, 11% Black, and 7% Hispanic. The primary payers for the interhospital transfer were Medicare 44%, Medicaid 19%, and private insurance 26%. Further demographic characteristics of the nationally weighted sample are provided in Table 1, and the frequency of the primary diagnosis categorized by the multi-level diagnosis category of the CCS in Table 2. As expected, circulatory disease was the most frequent diagnosis in the older age groups (45 and older), whereas mental health was the most frequent in the youngest age groups (19-44). Frequency of comorbidities across age cohorts is presented in Table 3. The distribution of patient characteristics across the total study population and between the training and testing data sets are available in Supplementary Material Table 2.
Table 1.

Sample characteristics*.

Age (19-34)Age (35-44)Age (45-54)Age (55-64)Age (65-74)Age (75-84)Age over 85
n%n%n%n%n%n%n%
Total subjects179 895125 525212 665275 530290 259245 504127 044
Sex
 Male80 53044.863 44550.5119 25556156 23556.7154 42553.2119 24948.650 68539.9
 Female99 36055.262 07549.593 40043.9119 29543.3135 83046.8126 23951.476 34460.1
Race
 White104 06557.877 49061.7137 92564.9188 15568.3212 47073.2183 95474.998 32477.4
 Black27 44015.317 42513.929 79514.032 75011.923 9508.315 5156.36 4955.1
 Hispanic16 6759.310 1558.114 0706.614 5355.313 6204.711 7104.850504.0
 Other31 71517.620 45516.330 87514.540 08914.640 21913.934 32414.017 17513.5
Admission status
 Non-elective admission150 18083.5105 82084.3176 97583.2225 71081.9229 17579.0185 90575.792 63072.9
 Elective admission29 01516.119 23015.335 05516.548 95517.860 10420.758 68423.933 85926.7
Mortality risk
 Minor133 94574.577 12561.4106 01049.8109 94539.970 79024.433 84013.813 13510.3
 Moderate22 55012.523 66018.850 49023.775 70027.592 07031.789 62436.550 94940.1
 Major13 3907.414 51511.634 06516.054 99520.080 22027.682 60533.645 80536.1
 Extreme99355.510 1758.122 02010.434 78512.647 09516.239 30516.017 10513.5
Payer type
 Medicare15 1258.422 78518.249 13523.181 74029.7246 00984.8224 18491.3116 76991.9
 Medicaid58 89532.732 77526.145 75521.541 99515.240401.422450.910450.8
 Private62 25534.643 02034.377 20036.3112 86541.030 78010.613 2205.460154.7
 Self-pay25 76514.316 38513.124 62011.619 4807.114550.59250.46700.5
 Other14 3058.083006.612 7006.016 7256.172652.544051.823351.8
Hospital teaching status
 Nonteaching44 00024.530 33524.251 40524.265 45023.874 77525.870 21528.639 50031.1
 Teaching124 78969.487 40969.6149 34970.2193 97470.4195 46967.3154 37462.974 61958.7
Hospital region
 Northeast25 90014.418 50014.732 32015.240 53014.742 52514.736 85515.021 08516.6
 Midwest54 56530.336 89529.459 16027.877 99428.380 81427.872 52929.537 82929.8
 South69 91538.949 68539.688 21541.5110 05539.9115 36539.790 18536.742 04033.1
 West29 51516.420 44516.332 97015.546 95017.051 55417.845 93418.726 09020.5

Sample characteristics total subjects and % represent the data set weighted to reflect national estimates.

Table 2.

Clinical classification frequencies*.

Age (19-34)Age (35-44)Age (45-54)Age (55-64)Age (65-74)Age (75-84)Age over 85
n%n%n%n%n%n%n%
Total subjects179 895125 525212 665275 530290 259245 504127 044
Clinical Classification
 Infection59053.357404.611 6155.516 7806.116 6855.712 7855.265205.1
 Cancer27551.535252.885704.013 7555.014 0204.897454.032302.5
 Hematologic23401.213651.015950.719950.725100.816750.724001.1
 Metabolic38252.128702.349252.354702.047701.632001.315151.2
 Mental56 35031.329 97023.936 97017.422 5658.211 5104.074453.040003.1
 Nervous74954.253954.374703.582353.070552.451752.118101.4
 Circulatory10 6955.921 48017.156 11026.488 61532.2100 70034.783 43534.037 35529.4
 Respiratory61653.455504.411 7705.518 8256.821 5107.417 5257.189807.1
 Digestive12 7357.111 8409.419 3809.123 3608.523 1958.018 6857.610 2958.1
 Genitourinary37102.137803.055352.674252.787303.080353.346503.7
 Pregnancy28 54515.947453.8800.000.000.000.000.0
 Skin26901.520851.729701.427651.020250.715350.67950.6
 Musculoskeletal25101.425202.039901.950001.852551.838701.617551.4
 Congenital3900.22650.23100.12600.11900.11550.1600.0
 Injury/poison26 76514.917 28513.825 06511.830 50011.131 21510.829 25511.920 10515.8

Sample characteristics total subjects and % represent the data set weighted to reflect national estimates.

Table 3.

Comorbidity Frequencies*.

Age (19-34)Age (35-44)Age (45-54)Age (55-64)Age (65-74)Age (75-84)Age over 85
n%n%n%n%n%n%n%
Total subjects179 895125 525212 665275 530290 259245 504127 044
Comorbidities
 AIDS3450.24650.49100.44800.21700.1450.000.0
 Alcohol abuse18 23010.115 17012.130 82514.524 0858.713 0354.547351.97500.6
 Deficiency anemias19 54010.917 77014.234 99516.553 59519.564 06022.160 21024.531 72025.0
 Rheumatoid arthritis24101.328602.352552.584803.110 6253.793903.838853.1
 Long-term blood loss anemia58453.218951.519350.929951.134301.232051.317901.4
 Congestive heart failure24551.438003.011 7455.524 8359.036 61012.640 20516.426 36520.8
 Long-term pulmonary disease18 77010.416 43513.140 60019.166 61024.278 96527.261 53525.124 75519.5
 Coagulopathy79204.475156.016 7357.923 7458.623 9208.219 7608.086306.8
 Depression15 8808.815 52012.428 21513.337 83013.736 18512.526 92511.012 5009.8
 Diabetes—uncomplicated88104.916 46513.143 64020.573 31026.689 27530.868 14027.825 63020.2
 Diabetes—long-term complications23051.349904.012 4205.821 4507.824 2508.416 8956.950654.0
 Drug abuse37 30520.718 36014.623 97011.313 6054.938401.310200.42600.2
 Hypertension—combine22 86012.743 87535.0110 65052.0174 47563.3205 79070.9182 14474.294 36474.3
 Hypothyroidism62703.587857.018 9208.930 87011.242 89014.844 91518.328 72022.6
 Liver disease37802.152404.213 8056.517 1856.210 2153.542551.79700.8
 Lymphoma4000.25900.512950.624400.939701.431751.311050.9
 Fluid and electrolyte disorders27 63015.426 51021.154 19025.578 81528.686 10029.774 35030.339 36531.0
 Metastatic cancer7350.413501.146152.283903.093553.260902.516801.3
 Other neurological disorders12 5357.010 5508.417 8658.423 2358.426 3059.126 04010.615 06511.9
 Obesity18 11510.121 46517.137 15017.549 95018.145 99515.822 5759.242353.3
 Paralysis59803.353404.310 6605.016 1505.918 3506.314 1155.763455.0
 Peripheral vascular disorders16950.927602.298804.622 7808.334 32011.831 79012.914 30011.3
 Psychoses11 9406.694607.515 0657.116 1455.912 3054.279853.336902.9
 Pulmonary circulation disorders23401.322651.849652.385003.110 3903.610 4204.256454.4
 Renal failure46152.676606.119 8759.338 77014.155 32519.156 86523.231 25524.6
 Solid tumor without metastasis8500.510450.835151.770052.597503.480403.333452.6
 Peptic ulcer disease150.0400.0650.0900.01200.01150.0150.0
 Valvular disease20951.222051.846352.286303.113 8504.817 9407.312 6109.9
 Weight loss71053.965205.214 5756.923 0258.426 1409.023 0109.412 0959.5

Sample characteristics total subjects and % represent the data set weighted to reflect national estimates.

Sample characteristics*. Sample characteristics total subjects and % represent the data set weighted to reflect national estimates. Clinical classification frequencies*. Sample characteristics total subjects and % represent the data set weighted to reflect national estimates. Comorbidity Frequencies*. Sample characteristics total subjects and % represent the data set weighted to reflect national estimates. The final CART identified 21 discrete subgroups of patients (Figure 1). Trees from the training holdout data set and the testing holdout data set contained the same splits and terminal nodes. Of the 21 subgroups, 12 were for patients with a primary cardiac diagnosis (n = 16 798 patients), the next eight groups primary diagnoses were cancer (n = 35 030 patients), and the remaining subgroup had neither cardiac nor cancer as a primary diagnosis (n = 151 464).
Figure 1.

Classification and regression tree.

The classification and regression tree with variables identified within the ovals and the value of each variable at the split signified and defined by each connecting line to the next variable and split. The bars at the bottom identify a distinct clinical group with the total number of subjects contained in that group (n), the bar represents the % mortality for patients in that group.

Classification and regression tree. The classification and regression tree with variables identified within the ovals and the value of each variable at the split signified and defined by each connecting line to the next variable and split. The bars at the bottom identify a distinct clinical group with the total number of subjects contained in that group (n), the bar represents the % mortality for patients in that group. Subgroups with a primary cardiac diagnosis (Figure 1—right side) experiencing the highest rates of post-transfer mortality included (1) patients greater than 40 years old with either coagulopathy (30% mortality) or with metastasis (~35%), (2) patients greater than 52 years old with cardiac arrhythmia and either liver failure (~35%) or pulmonary circulatory comorbidity (30%), and (3) patients greater than 72 years without Medicare (35%). The payer mix of the patients in the subgroup that was greater than 72 years and without Medicare consisted of 10% on Medicaid, 56% private insurance, 10% self-pay, and 24% not specified. Alternatively, patients that were less than 40 years (5% mortality) or greater than 40 years and underwent an operating room procedure (5% mortality) experienced the highest rates of survival. Subgroups of patients that had cancer as the primary diagnosis (Figure 1—left side) that experienced the highest rates of mortality post-transfer included (1) those greater than 83 years old (35% mortality), (2) those >68 years with either hypertension (15% mortality) or on Medicare (10% mortality), and (3) for those <68 years old with coagulopathy and either arrhythmia (25% mortality) or pulmonary circulatory comorbidity (35% mortality). The results from the random forest analysis are presented in Figure 2. Variables identified as being important via random forest, but not included in any of the CART pathways include weight loss, congestive heart failure, and genitourinary.
Figure 2.

Random forest results.

Abbreviations: Dx, diagnosis; CM, comorbidity.

Variables identified as contributing the most to post-transfer mortality are displayed with the most important starting at the top and descending to least important. The highlighted box contains the variables with the highest importance. Variables with an * are not included in the classification and regression tree.

Random forest results. Abbreviations: Dx, diagnosis; CM, comorbidity. Variables identified as contributing the most to post-transfer mortality are displayed with the most important starting at the top and descending to least important. The highlighted box contains the variables with the highest importance. Variables with an * are not included in the classification and regression tree.

Model performance

We tested the performance of our model on a holdout data set. The area under the curve was 0.69, and the Matthews correlation coefficient was 0.198. The model had a positive predictive value (PPV) of 0.291 and a negative predictive value (NPV) of 0.960. The sensitivity was 0.18 and the specificity was 0.98. As we further describe below, the aim of this model was to identify clinically meaningful rather than most accurately predict mortality post-transfer.

Discussion

This analysis identified 21 distinct groups of patients, 13 of which experienced mortality rates more than double the national average ranging from 4.7% to 5.2% post-transfer mortality.[1] In 2013, the national mortality for all-cause hospital admissions was 2%. This analysis included all patients, even patients who underwent transfer for routine procedures such as orthopedic cases or appendectomies, who were accounted for in the far left of the tree in the lowest mortality group (n = 151 464). Alternatively, the other lowest mortality group consisted those with a circulatory diagnosis and who were aged younger than 40.5 years. The left side of the tree, or the non-cardiac side, was dominated by patients with cancer, composing the second largest group of patients undergoing transfer (n = 35 020), with the highest mortality experienced by those with coagulopathy as a comorbid condition. Coagulopathy is also represented on the right side as significant contributor to increased mortality post-transfer. Of note, comorbid conditions in the AHRQ NIS are not directly related to the primary diagnosis or necessarily the main reason for admission, likely having originated before the current hospitalization, thus representing a pre-existing condition.[16] The finding that coagulopathy is a significant predictor of post-transfer mortality was surprising, but its significance is reinforced by the random forest analysis (Figure 2) and our other work looking at surgical populations.[30] Coagulopathy typically manifests as a secondary physiologic response to a primary disturbance such as cancer and trauma induced and has been found to be an independent predictor of in-hospital mortality, regardless of transfer status.[31,32] This study reinforces including coagulopathy, whether it is a comorbidity or a condition on the active problem list for the current hospitalization, as a covariate in future modeling efforts. This study identified that patients with a cardiac diagnosis and aged less than 40 years or were older than 40 years and received an operating room procedure experienced the highest survival rates post-transfer. While we cannot ascertain the specific operating room procedures performed, the high survival rates for this clinical group receiving a major therapeutic or diagnostic operating room procedure supports the role that transfer plays in improving mortality. Likely, these patients without concomitant comorbidity or other significant clinical characteristics, represent those experiencing a myocardial infarction or other time-sensitive condition that benefits from rapid transport and subsequent intervention. While the primary focus of this study was not to predict patient mortality, the methods employed identified groups of patients that experience mortality at rates two to three times higher than the expected rate of post-transfer mortality of 5% and thus provides specific groups of patients that warrant focused inquiry. Current efforts to leverage EMR data to support developing clinical decision-support systems (e.g. health system transfer command centers)[33] can benefit by initially focusing on high-risk target populations like those identified in this analysis. The random forest model identified several important variables not included in the individual tree, those being weight loss, congestive heart failure, and genitourinary conditions. The variable importance results reported in the random forest are the average results of many individual trees—many trees included the three omitted variables while others did not. Given that the CART tree represents an individual tree and sample; in this case sample, 789 out of 10 000, it is possible that variables identified in the random forest analysis are not represented in this specific tree. Omission of these variables in the individual tree can be due to the greedy splitting procedure that identifies the best split at that particular point in the tree without considering the impact on the full model. Therefore, depending on the random sample chosen to run the CART, the tree for each sample can include different variables and split points. During the analytic process, we randomly select the samples and “freeze” them, otherwise we would get a different training and testing sample each time the analysis was performed. The omission of the variables underlines the importance of running complementary or additional analyses when using atheoretical approaches. Our model had an area under the curve of 0.69, which is reasonable performance for rare and difficult events to predict like mortality. The area under the curve (AUC) is in-line with other studies that have used the Elixhauser or Charleston comorbidity indices to predict mortality that ranged between 0.65-0.80.[34,35] It is difficult to compare the performance of AUC across studies that assess different patient populations, and to our knowledge, this is the first model to predict mortality among all-diagnoses of transferred patients. Finally, employing the supervised machine-learning techniques provides distinct analytical advantages over traditional modeling techniques that we have used in past analyses. The primary advantage is the ability to assess all available covariates in every possible combination. Rather than identifying the influence of a given covariate while the others are held constant, the supervised machine-learning techniques employed allow us to test every possible combination of the covariates to identify clinically meaningful combinations and report those combinations in mutually exclusive groups capable of being easily incorporated into decision-support modeling or other approaches such as developing more precise clinical nomograms. In addition, the mutually exclusive groups provide easily recognizable patient characteristics in specific combinations that are more descriptive than the odds of change in one variable while the others are held constant. For example, our past work employing regression identified that the odds of death increased with age, with age being included in the regression via seven categories.[1] Alternatively, in CART, we are able to include age as a continuous variable and let the technique determine what the significant splits in age are for a given combination of characteristics. For example, in Figure 1, age is split five different times in the tree with each split signifying a significant difference in outcome for those patients above or below that age threshold. Attempting to identify these age categories via other approaches, would be burdensome, if achievable at all.

Limitations

Secondary analyses of existing databases present several limitations. First, we were only able to include basic demographic characteristics, the Elixhauser comorbidities, primary diagnosis via the CCS, and basic hospital descriptors. While nationally representative, the lack of rich clinical descriptors limits the depth of the analyses and applicability of the findings. Second, primary diagnosis determination is complex and is influenced by the clinical course of care as well as coding for payment. This well-known limitation has been identified by others. Third, we included all patients that were transferred between hospitals, including groups of patients that on one end would not impact overall transfer mortality rates (e.g. mental health) and, on the other end, patients who exceeded the level of care available at their current hospital (i.e. community hospital) and had to be transferred to a tertiary center. Fourth, inclusion of variables such as operating room procedure are only broad indicators of care and do not provide specificity in differentiating between normal and unexpected rates of mortality. However, the inclusion of operating procedure across the models highlights the need to conduct further in-depth investigations into specifically which transfers and corresponding procedures impart improved morbidity and mortality, highlighting a strength of this broad approach to focus future inquiry. Finally, we do not know why the patient was transferred and the elements contributing to the decision. This will be future work.

Conclusions

This study analyzed a nationally representative sample of hospital discharges to identify groups of patients who experience increased mortality after undergoing interhospital transfer. The supervised machine-learning approach implemented identified 13 distinct groups of patients who experience post-transfer mortality more than double the national average mortality of post-transfer patients. Of the 13 groups, 10 experience mortality rates of 20% or greater, identifying specific groups of patients that may benefit from being transferred sooner based on their individual characteristics. The individual characteristics identified do not necessarily fall into the currently used categories of transfer patients, supporting the reconceptualization of which patient groups should be considered for immediate transfer to another hospital. Click here for additional data file. Supplemental material, Supplementary_Material_Table_1_xyz1364584bb4390 for Applying Supervised Machine Learning to Identify Which Patient Characteristics Identify the Highest Rates of Mortality Post-Interhospital Transfer by Andrew P Reimer, Nicholas K Schiltz, Vanessa P Ho, Elizabeth A Madigan and Siran M Koroukian in Biomedical Informatics Insights Click here for additional data file. Supplemental material, Supplementary_Material_Table_2_xyz13645265ff583 for Applying Supervised Machine Learning to Identify Which Patient Characteristics Identify the Highest Rates of Mortality Post-Interhospital Transfer by Andrew P Reimer, Nicholas K Schiltz, Vanessa P Ho, Elizabeth A Madigan and Siran M Koroukian in Biomedical Informatics Insights
  26 in total

Review 1.  Classification and regression tree analysis in public health: methodological review and comparison with logistic regression.

Authors:  Stephenie C Lemon; Jason Roy; Melissa A Clark; Peter D Friedmann; William Rakowski
Journal:  Ann Behav Med       Date:  2003-12

2.  Comparison of the Elixhauser and Charlson/Deyo methods of comorbidity measurement in administrative data.

Authors:  Danielle A Southern; Hude Quan; William A Ghali
Journal:  Med Care       Date:  2004-04       Impact factor: 2.983

3.  Interhospital transfer of critically ill patients: demographic and outcomes comparison with nontransferred intensive care unit patients.

Authors:  Andrea D Hill; Evelyn Vingilis; Claudio M Martin; Kathleen Hartford; Kathy N Speechley
Journal:  J Crit Care       Date:  2007-12       Impact factor: 3.425

4.  A modification of the Elixhauser comorbidity measures into a point system for hospital death using administrative data.

Authors:  Carl van Walraven; Peter C Austin; Alison Jennings; Hude Quan; Alan J Forster
Journal:  Med Care       Date:  2009-06       Impact factor: 2.983

5.  Risk factors and clinical significance of trauma-induced coagulopathy in ICU patients with severe trauma.

Authors:  Shan-Xiang Xu; Lian Wang; Guang-Ju Zhou; Mao Zhang; Jian-Xin Gan
Journal:  Eur J Emerg Med       Date:  2013-08       Impact factor: 2.799

Review 6.  Helicopter emergency medical services for adults with major trauma.

Authors:  Samuel M Galvagno; Stephen Thomas; Christopher Stephens; Elliott R Haut; Jon M Hirshon; Douglas Floccare; Peter Pronovost
Journal:  Cochrane Database Syst Rev       Date:  2013-03-28

7.  Does helicopter emergency medical service transfer offer benefit to patients with stroke?

Authors:  Michael D Olson; Alejandro A Rabinstein
Journal:  Stroke       Date:  2011-12-08       Impact factor: 7.914

8.  Impact of prehospital mode of transport after severe injury: a multicenter evaluation from the Resuscitation Outcomes Consortium.

Authors:  Eileen M Bulger; Danielle Guffey; Francis X Guyette; Russell D MacDonald; Karen Brasel; Jeffery D Kerby; Joseph P Minei; Craig Warden; Sandro Rizoli; Laurie J Morrison; Graham Nichol
Journal:  J Trauma Acute Care Surg       Date:  2012-03       Impact factor: 3.313

9.  Coagulopathy and inhospital deaths in patients with acute subdural hematoma.

Authors:  Eric M Bershad; Saeid Farhadi; M Fareed K Suri; Eliahu S Feen; Olga H Hernandez; Warren R Selman; Jose I Suarez
Journal:  J Neurosurg       Date:  2008-10       Impact factor: 5.115

10.  Accepting critically ill transfer patients: adverse effect on a referral center's outcome and benchmark measures.

Authors:  Andrew L Rosenberg; Timothy P Hofer; Cathy Strachan; Charles M Watts; Rodney A Hayward
Journal:  Ann Intern Med       Date:  2003-06-03       Impact factor: 25.391

View more
  1 in total

1.  Factors associated with Interhospital transfers of emergency general surgery patients from emergency departments.

Authors:  Sara Fernandes-Taylor; Dou-Yan Yang; Jessica Schumacher; Fiona Ljumani; Baruch S Fertel; Angela Ingraham
Journal:  Am J Emerg Med       Date:  2020-12-13       Impact factor: 2.469

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.