Literature DB >> 35710407

Comparison between machine learning methods for mortality prediction for sepsis patients with different social determinants.

Hanyin Wang1, Yikuan Li1, Andrew Naidech2, Yuan Luo3.   

Abstract

BACKGROUND: Sepsis is one of the most life-threatening circumstances for critically ill patients in the United States, while diagnosis of sepsis is challenging as a standardized criteria for sepsis identification is still under development. Disparities in social determinants of sepsis patients can interfere with the risk prediction performances using machine learning.
METHODS: We analyzed a cohort of critical care patients from the Medical Information Mart for Intensive Care (MIMIC)-III database. Disparities in social determinants, including race, sex, marital status, insurance types and languages, among patients identified by six available sepsis criteria were revealed by forest plots with 95% confidence intervals. Sepsis patients were then identified by the Sepsis-3 criteria. Sixteen machine learning classifiers were trained to predict in-hospital mortality for sepsis patients on a training set constructed by random selection. The performance was measured by area under the receiver operating characteristic curve (AUC). The performance of the trained model was tested on the entire randomly conducted test set and each sub-population built based on each of the following social determinants: race, sex, marital status, insurance type, and language. The fluctuations in performances were further examined by permutation tests.
RESULTS: We analyzed a total of 11,791 critical care patients from the MIMIC-III database. Within the population identified by each sepsis identification method, significant differences were observed among sub-populations regarding race, marital status, insurance type, and language. On the 5783 sepsis patients identified by the Sepsis-3 criteria statistically significant performance decreases for mortality prediction were observed when applying the trained machine learning model on Asian and Hispanic patients, as well as the Spanish-speaking patients. With pairwise comparison, we detected performance discrepancies in mortality prediction between Asian and White patients, Asians and patients of other races, as well as English-speaking and Spanish-speaking patients.
CONCLUSIONS: Disparities in proportions of patients identified by various sepsis criteria were detected among the different social determinant groups. The performances of mortality prediction for sepsis patients can be compromised when applying a universally trained model for each subpopulation. To achieve accurate diagnosis, a versatile diagnostic system for sepsis is needed to overcome the social determinant disparities of patients.
© 2022. The Author(s).

Entities:  

Keywords:  Disparity; Machine learning; Mortality prediction; Sepsis; Social determinants

Mesh:

Year:  2022        PMID: 35710407      PMCID: PMC9204861          DOI: 10.1186/s12911-022-01871-0

Source DB:  PubMed          Journal:  BMC Med Inform Decis Mak        ISSN: 1472-6947            Impact factor:   3.298


Background

Sepsis, one of the most life-threatening circumstances for critically ill patients in the United States, is the culmination of complex interactions between the infecting microorganism and the host immune, inflammatory, and coagulation responses [1, 2]. Each year, more than 1.7 million adults in the United States develop sepsis, and approximately 270,000 die because of sepsis. The prevalence of sepsis is around one-third among hospitalized patients [3]. With a few identification methods currently available, a standardized criteria is still under development [4] Disparities in critical care can be induced by multi-factored causes [5-8]. Biases are observed in healthcare for patients from different social status groups [9, 10]. With more data-driven and artificial intelligence (AI) involved in healthcare, disparities among sub-populations are more frequently observed and attracted more attention [11-15]. Machine learning applications for risk prediction in healthcare are becoming more powerful with the development of electronic health records (EHRs) [16-20]. Risk predictions for sepsis patients using machine learning techniques have been studied [21-24]. However, the discussions over how the disparities and biases interact with risk prediction models for sepsis patients remain undefined. In this study, we revealed the disparities in the proportions of sepsis in subpopulations of social determinants groups from a cohort of patients admitted for critical care services and examined the fluctuations in the performances of mortality prediction for subpopulations of sepsis patients when using machine learning classifiers.

Methods

Data

Medical Information Mart in Intensive Care (MIMIC)-III v1.4 is an open-sourced large scale database of critical care patients with enriched features [25]. From a total of 23,620 intensive care unit (ICU) admission records, 11,791 patients with their initial admission records were identified and utilized in this study. Selection criteria were applied to filter out nonadults, patients with suspected infection more than 24 h before the ICU admission or more than 24 h after the ICU admission, patients with missing data, and patients admitted for cardiothoracic surgery services. The data selection algorithms were elaborated in a previous study [4].

Social determinants

Five social determinants were studied, including race, sex, insurance type, marital status, and language. Race of all subjects was re-leveled into five categories, Asian, Black or African American, Hispanic or Latino, White and other, where the “other” category covers American Indian and Alaska Native, Native Hawaiian or other Pacific Islander, multi-race, unspecified race, and other races not mentioned above. Dichotomous sex, female and male, was considered. Insurance types were taken directly from the MIMIC-III database, which includes government, Medicaid, Medicare, private, and self-pay. Marital status was re-factorized into the following categories: significant other, single, separated, widowed, and unknown, where the “significant other” category covers the situations if life partner or married was indicated in the MIMIC-III database, the “separated” category covers the circumstances if divorced or separated was displayed in the database, the “unknown” category covers the situation if unknown (default) was indicated in the database and was coded for those patients did not specify the marital status. Languages were re-grouped into English, Spanish and other, where the “other” category covers any languages documented in the database other than the two stated.

Disparities in social determinants across various sepsis criteria

We compared the disparities between each sub-category of social determinants in the sepsis population detected by the six identification methods for sepsis: (1) explicit criteria: two codes explicitly mentioning sepsis (995.92 and 785.52 for severe sepsis and septic shock, respectively) defined by International Classification of Diseases, 9th version (ICD-9); (2) Angus methodology [26]; (3) Martin methodology [27]; (4) criteria presented by Centers for Medicare & Medicaid Services (CMS) [28]; (5) the complete surveillance criteria presented by Center of Disease Control and Prevention (CDC) [29]; (6) Sepsis-3 [30]. Forest plots were generated for the proportion of each subpopulation that was identified as sepsis by each method. For example, a proportion of 0.274 for Asian and “Angus” represents 27.4% of the Asians in the dataset were identified as sepsis by the Angus criteria. A 95% confidence interval was constructed by bootstrapping (1000 simulations) and shown in the forest plots for each proportion.

Mortality prediction for sepsis patients using machine learning

We built machine learning classifiers to predict mortality for sepsis patients. The sepsis patient population was constructed using the Sepsis-3 identification method since it is the latest and most conservative among the six methods being discussed [4]. The entire cohort of patients was split into training and testing sets to a proportion of 7:3. Sixteen machine learning configurations were built and trained to predict in-hospital mortality for the sepsis patients, that include Ridge classifier, perceptron, passive-aggressive classifier, k-nearest neighbors (kNN), random forest, support vector machine with linear kernel (linearSVC) and L1 or L2 regularization, support vector machine with linear kernel and L2 regularization, stochastic gradient descent (SGD) classifier with L1, L2, or elastic net regularization, multinomial naïve Bayes, Bernoulli naïve Bayes, logistic regression, support vector machine (SVM) with rbf, polynomial, or sigmoid kernel. Sequential organ failure assessment (SOFA) score [31] during the first 24 h of admission, systemic inflammatory response syndrome (SIRS) score [32] during the first 24 h of admission, and age were employed as features. Before training the machine learning configurations, each feature was scaled to 0 to 1 to avoid the impact of different magnitudes. Five-fold cross-validation was employed to find the optimal hyper-parameters for each machine learning configuration. The best-suited thresholds for each classifier were set according to Youden’s J statistics. The performances of the machine learning configurations were measured by the area under the receiver operating characteristic curve (AUC).

Statistical analysis for disparities in performances on sub-populations of social determinants

The training procedures were carried out on the entire training set, after which trained configurations and evaluation metrics on the entire cohort were saved. In the next step, we tested the performance on every sub-population of each of the five social determinants. To detect the disparities in performances, we compared the AUCs on the entire cohort with those on the subpopulations by permutation tests (1000 times). A one-tailed permutation test was employed to determine if the decrease or increase of the performance is significant statistically when testing on sub-groups of patients. To further illustrate the disparities, we conducted pairwise permutation tests (1000 times) among each pair of the sub-populations. A two-tailed permutation test was used to show if there are significant disparities in performances among each pair. The entire workflow can be found in Fig. 1.
Fig. 1

The workflow

The workflow The analysis was conducted using Python 3.6.8. Machine learning classifiers, cross-validation, and evaluation metrics were conducted using Sci-kit Learn 0.23.2.

Results

Forest plots for the disparities in social determinants across various sepsis criteria are shown in Fig. 2. Proportions of sepsis patients identified by different methods showed significant discrepancies, with the Sepsis-3 criteria as the most conservative one. Within the population identified by the same sepsis identification method, significant differences were observed among sub-populations regarding race, marital status, insurance type, and language. Numeric values of the proportions and 95% confidence interval can be found in the Table S1 in the Additional file 1.
Fig. 2

Forest plot for disparities in social determinants across various sepsis criteria. The proportions of sepsis patients of every sub-population identified by each sepsis criteria are shown as a point with a 95% confidence interval. Sepsis criteria are shown in different colors, while results for each subpopulation are shown in a row corresponding to the labels on the y-axis. Explicit: the explicit criteria; Angus: the Angus methodology; Martin: the Martine methodology; CMS: criteria presented by Centers for Medicare & Medicaid Services (CMS); CDC: the complete surveillance criteria presented by Center of Disease Control and Prevention (CDC); Sepsis-3: the Sepsis-3 criteria

Forest plot for disparities in social determinants across various sepsis criteria. The proportions of sepsis patients of every sub-population identified by each sepsis criteria are shown as a point with a 95% confidence interval. Sepsis criteria are shown in different colors, while results for each subpopulation are shown in a row corresponding to the labels on the y-axis. Explicit: the explicit criteria; Angus: the Angus methodology; Martin: the Martine methodology; CMS: criteria presented by Centers for Medicare & Medicaid Services (CMS); CDC: the complete surveillance criteria presented by Center of Disease Control and Prevention (CDC); Sepsis-3: the Sepsis-3 criteria In total, 5783 patients were identified as sepsis by the Sepsis-3 criteria. Statistics of this cohort of sepsis patients are shown in Table 1. The detailed testing performances on the entire testing set are shown in Table 2.
Table 1

Statistics of 5783 sepsis patients

Social DeterminantsCategoryn% sepsis populationIn-hospital mortality% in-hospital mortalityTrainingTesting
RaceAsian1793.102614.5312950
Black or African American5018.665210.38348153
Hispanic or Latino1883.25189.5713256
Other71412.3516523.11527187
White420172.6457513.6929121289
SexFemale256244.3038414.991798764
Male322155.7045214.032250971
Marital statusSeparated3986.885213.07287111
Significant other255944.2536314.191788771
Single163828.3217410.621157481
Unknown3325.7410230.7224884
Widowed85614.8014516.94568288
Insurance typeGovernment1662.87137.8311551
Medicaid5709.866711.75395175
Medicare335858.0756016.6823351023
Private163928.3418511.291168471
Self-pay500.861122.003515
LanguageEnglish516789.3572714.0736311536
Other4998.639418.84339160
Spanish1172.021512.827839

Categories of each social determinant are ranked alphabetically; n: number of sepsis patients in the category; % sepsis population: percentage of the number of sepsis patients in the category among the 5,783 sepsis patients; In-hospital mortality: number of patients in the category deceased in-hospital; % in-hospital mortality: percentage of patients in the category deceased in-hospital; Training: number of patients of the given category that were assigned to the training set during train-test split; Testing: number of patients of the given category that were assigned to the test set during train-test split

Table 2

Detailed performances on the entire testing set

AccuracyAUCPrecisionRecallF1_binaryF1_macroSpecificity
Ridge classifier0.67900.77740.26820.70520.38860.58550.6745
Perceptron0.67200.77860.26340.70520.38350.58010.6664
Passive-aggressive0.68410.75820.27330.71310.39510.59070.6792
kNN0.71350.72990.27800.61350.38260.59810.7305
Random forest0.75160.64590.28260.46610.35190.59910.7999
LinearSVC_L10.67490.77810.26540.70520.38560.58230.6698
LinearSVC_L20.67840.77770.26780.70520.38820.58500.6739
SGDClassifier_L10.67900.77590.26820.70520.38860.58550.6745
SGDClassifier_L20.67900.77490.26680.69720.38590.58430.6759
SGDClassifier_EN0.68010.77530.26830.70120.38810.58580.6765
MultinomialNB0.63920.70400.23480.66140.34660.54870.6354
BernoulliNB0.31070.57240.16650.94020.28300.30960.2042
Logistic regression0.68240.77610.27200.71310.39380.58930.6772
SVC_rbf0.68470.77440.27020.69320.38880.58820.6833
SVC_poly0.67490.77510.26540.70520.38560.58230.6698
SVC_sigmoid0.62770.68730.23490.69720.35140.54510.6159

F1 binary: F1 score for the positive class; F1_macro: macro-averaged F1 score; Passive-aggressive: passive-aggressive classifier; kNN: k-Nearest Neighbors; LinearSVC_L1 or _L2: support vector machine with linear kernel coupled with L1 or L2 regularization; SGDClassifier_L1 or _L2 or _EN: stochastic gradient descent with L1 or L2 or Elastic Net regularization; MultinomialNB: Multinomial naïve Bayes; BernoulliNB: Bernoulli naïve Bayes; SVC_rbf or _poly or _sigmoid: support vector machine with rbf kernel or polynomial kernel or sigmoid kernel

Statistics of 5783 sepsis patients Categories of each social determinant are ranked alphabetically; n: number of sepsis patients in the category; % sepsis population: percentage of the number of sepsis patients in the category among the 5,783 sepsis patients; In-hospital mortality: number of patients in the category deceased in-hospital; % in-hospital mortality: percentage of patients in the category deceased in-hospital; Training: number of patients of the given category that were assigned to the training set during train-test split; Testing: number of patients of the given category that were assigned to the test set during train-test split Detailed performances on the entire testing set F1 binary: F1 score for the positive class; F1_macro: macro-averaged F1 score; Passive-aggressive: passive-aggressive classifier; kNN: k-Nearest Neighbors; LinearSVC_L1 or _L2: support vector machine with linear kernel coupled with L1 or L2 regularization; SGDClassifier_L1 or _L2 or _EN: stochastic gradient descent with L1 or L2 or Elastic Net regularization; MultinomialNB: Multinomial naïve Bayes; BernoulliNB: Bernoulli naïve Bayes; SVC_rbf or _poly or _sigmoid: support vector machine with rbf kernel or polynomial kernel or sigmoid kernel We compared the performances (AUC) for each of the sixteen classifiers on the entire testing set and every sub-population by permutations tests. Significant results at a confidence level of 0.05 were found for race and languages. The observed differences and the corresponding p-values yielded from the permutation tests are shown in Tables 3 and 4. Among all the racial groups, we observed significant decreases in the performances of most of the classifiers for Asian and Hispanic patients (Table 3). Interestingly, significant performance drops were observed when applying the classifiers for the group of patients that speak Spanish (Table 4). We put the results of the social determinants associated with very few to no significant findings in the Table S2-S4 in the Additional file 1. For a further illustration of the disparities, we showed the pairwise comparison results in Tables 5 and 6. Among all the pairs of the racial groups, discrepancies were observed between Asian and White, as well as Asian and other races in most of the classifiers. Significant differences were also observed between Asian and Black sepsis patients in a few classifiers. The disparities between patients speaking various languages were majorly detected between the English-speaking patients and the Spanish-speaking patients. The pairwise comparison results with very few to no significant findings in the Table S5-S7 in the Additional file 1.
Table 3

Observed differences between the testing results and each race with p values from permutation tests

AsianBlack or African AmericanHispanic or LatinoOtherWhite
Observed differencep_valObserved differencep_valObserved differencep_valObserved differencep_valObserved differencep_val
Ridge classifier− 0.28120.009− 0.02410.366− 0.22080.0380.00110.5280.01750.286
Perceptron− 0.27480.0070.00780.448− 0.24530.025− 0.00260.5020.01580.312
Passive-aggressive− 0.31880.003− 0.00750.464− 0.17490.0830.01590.3810.01110.370
kNN− 0.13140.141− 0.06280.219− 0.18650.069− 0.01440.3720.02140.245
Random forest− 0.08340.207− 0.09390.046− 0.12260.1120.05360.1200.00560.429
LinearSVC_L1− 0.28190.012− 0.01720.410− 0.22470.0390.00030.4810.01730.285
LinearSVC_L2− 0.28150.009− 0.02210.385− 0.22110.0450.00050.4900.01750.294
SGDClassifier_L1− 0.28720.008− 0.00410.482− 0.21590.0440.00360.4780.01840.266
SGDClassifier_L2− 0.29000.008− 0.00870.455− 0.21820.0390.00440.4690.01910.263
SGDClassifier_EN− 0.29050.010− 0.00460.461− 0.21860.0580.00500.4970.01810.300
MultinomialNB− 0.27970.0100.06710.182− 0.23730.0330.00510.4840.00510.416
BernoulliNB− 0.19740.025− 0.00340.4830.04760.3310.01730.3680.00120.490
Logistic regression− 0.28750.010− 0.02570.377− 0.20610.0540.00430.4950.01740.273
SVC_rbf− 0.30850.0050.00420.480− 0.23110.031− 0.01760.3830.01750.275
SVC_poly− 0.29780.0060.00270.483− 0.27510.0170.00870.4310.01540.287
SVC_sigmoid− 0.13430.144− 0.09410.083− 0.06060.332− 0.00990.4550.02080.247

Observe difference: observed difference in AUC when compared with the performance on the entire testing set; p_val: p value, p values less than or equal to 0.05 were highlighted; Passive-aggressive: passive-aggressive classifier; kNN: k-Nearest Neighbors; LinearSVC_L1 or _L2: support vector machine with linear kernel coupled with L1 or L2 regularization; SGDClassifier_L1 or _L2 or _EN: stochastic gradient descent with L1 or L2 or Elastic Net regularization; MultinomialNB: Multinomial naïve Bayes; BernoulliNB: Bernoulli naïve Bayes; SVC_rbf or _poly or _sigmoid: support vector machine with rbf kernel or polynomial kernel or sigmoid kernel

Table 4

Observed differences between the testing results and each language with p values from permutation tests

EnglishOtherSpanish
Observed differencep_valObserved differencep_valObserved differencep_val
Ridge classifier0.01540.279− 0.07600.107− 0.34220.012
Perceptron0.01820.252− 0.09160.053− 0.35510.004
Passive-aggressive0.01220.301− 0.05550.172− 0.22880.053
kNN0.01660.263− 0.07680.102− 0.30630.017
Random forest0.00370.409− 0.00570.489− 0.23420.002
LinearSVC_L10.01600.299− 0.07720.102− 0.34280.003
LinearSVC_L20.01560.297− 0.07630.121− 0.34240.007
SGDClassifier_L10.01840.246− 0.07830.093− 0.33470.008
SGDClassifier_L20.01870.269− 0.07520.107− 0.33960.004
SGDClassifier_EN0.01810.259− 0.07600.105− 0.32830.006
MultinomialNB0.02210.224− 0.12100.021− 0.27460.031
BernoulliNB0.00760.389− 0.06210.0820.03060.422
Logistic regression0.01450.293− 0.07030.125− 0.31730.014
SVC_rbf0.01590.306− 0.08250.080− 0.33320.012
SVC_poly0.01760.275− 0.08600.079− 0.36330.002
SVC_sigmoid− 0.00300.4540.03410.288− 0.18140.089

Observe difference: observed difference in AUC when compared with the performance on the entire testing set; p_val: p-value, p values less than or equal to 0.05 were highlighted; Passive-aggressive: passive-aggressive classifier; kNN: k-Nearest Neighbors; LinearSVC_L1 or _L2: support vector machine with linear kernel coupled with L1 or L2 regularization; SGDClassifier_L1 or _L2 or _EN: stochastic gradient descent with L1 or L2 or Elastic Net regularization; MultinomialNB: Multinomial naïve Bayes; BernoulliNB: Bernoulli naïve Bayes; SVC_rbf or _poly or _sigmoid: support vector machine with rbf kernel or polynomial kernel or sigmoid kernel

Table 5

Pairwise comparisons among different racial groups

Asian v.s. Black or African AmericanAsian v.s. Hispanic or LatinoAsian v.s. OtherAsian v.s. WhiteBlack or African American v.s. Hispanic or Latino
Observed differencep_valObserved differencep_valObserved differencep_valObserved differencep_valObserved differencep_val
Ridge classifier0.25720.0740.06050.7380.28240.0330.29880.018− 0.19670.189
Perceptron0.28270.0420.02950.8830.27220.0510.29060.021− 0.25310.081
Passive-aggressive0.31140.0450.14390.4320.33480.0180.32990.008− 0.16740.238
kNN0.06860.647− 0.05520.7630.11700.3800.15280.224− 0.12380.413
Random forest− 0.01040.916− 0.03920.7150.13700.2110.08900.372− 0.02870.781
LinearSVC_L10.26470.0750.05710.7560.28220.0430.29910.020− 0.20760.156
LinearSVC_L20.25940.0840.06050.7520.28200.0420.29900.019− 0.19900.179
SGDClassifier_L10.28320.0520.07140.6680.29080.0360.30570.022− 0.21180.136
SGDClassifier_L20.28130.0500.07180.6920.29440.0190.30910.015− 0.20950.151
SGDClassifier_EN0.28580.0580.07180.7060.29540.0350.30860.015− 0.21400.142
MultinomialNB0.34680.0130.04240.8000.28480.0350.28480.029− 0.30440.032
BernoulliNB0.19400.0430.24500.0680.21470.0150.19860.0210.05100.609
Logistic regression0.26170.0820.08140.6200.29180.0370.30490.019− 0.18040.198
SVC_rbf0.31270.0250.07740.6530.29090.0300.32590.013− 0.23520.093
SVC_poly0.30050.0240.02270.8890.30660.0250.31320.016− 0.27780.056
SVC_sigmoid0.04020.7800.07360.6660.12440.3750.15510.2350.03340.796

Observe difference: observed difference in AUC when comparing the performance between the sub-populations; p_val: p-value, p values less than or equal to 0.05 were highlighted; Passive-aggressive: passive-aggressive classifier; kNN: k-Nearest Neighbors; LinearSVC_L1 or _L2: support vector machine with linear kernel coupled with L1 or L2 regularization; SGDClassifier_L1 or _L2 or _EN: stochastic gradient descent with L1 or L2 or Elastic Net regularization; MultinomialNB: Multinomial naïve Bayes; BernoulliNB: Bernoulli naïve Bayes; SVC_rbf or _poly or _sigmoid: support vector machine with rbf kernel or polynomial kernel or sigmoid kernel

Table 6

Pairwise comparisons among different language groups

English v.s. OtherEnglish v.s. SpanishOther v.s. Spanish
Observed differencep_valObserved differencep_valObserved differencep_val
Ridge classifier− 0.09150.135− 0.35760.008− 0.26610.081
Perceptron− 0.10980.070− 0.37330.005− 0.26350.095
Passive-aggressive− 0.06770.266− 0.24100.087− 0.17330.281
kNN− 0.09340.159− 0.32300.021− 0.22950.121
Random forest− 0.00940.833− 0.23790.023− 0.22850.064
LinearSVC_L1− 0.09310.132− 0.35870.007− 0.26560.097
LinearSVC_L2− 0.09190.135− 0.35800.008− 0.26610.095
SGDClassifier_L1− 0.09670.113− 0.35310.015− 0.25640.097
SGDClassifier_L2− 0.09390.143− 0.35830.009− 0.26430.078
SGDClassifier_EN− 0.09400.136− 0.34630.009− 0.25230.093
MultinomialNB− 0.14320.017− 0.29670.034− 0.15350.295
BernoulliNB− 0.06970.0910.02300.8180.09270.397
Logistic regression− 0.08490.174− 0.33180.025− 0.24690.093
SVC_rbf− 0.09840.112− 0.34910.013− 0.25070.104
SVC_poly− 0.10350.100− 0.38090.009− 0.27730.072
SVC_sigmoid0.03720.544− 0.17840.203− 0.21550.151

Observe difference: observed difference in AUC when comparing the performance between the sub-populations; p_val: p-value, p-values less than or equal to 0.05 were highlighted; Passive-aggressive: passive-aggressive classifier; kNN: k-Nearest Neighbors; LinearSVC_L1 or _L2: support vector machine with linear kernel coupled with L1 or L2 regularization; SGDClassifier_L1 or _L2 or _EN: stochastic gradient descent with L1 or L2 or Elastic Net regularization; MultinomialNB: Multinomial naïve Bayes; BernoulliNB: Bernoulli naïve Bayes; SVC_rbf or _poly or _sigmoid: support vector machine with rbf kernel or polynomial kernel or sigmoid kernel

Observed differences between the testing results and each race with p values from permutation tests Observe difference: observed difference in AUC when compared with the performance on the entire testing set; p_val: p value, p values less than or equal to 0.05 were highlighted; Passive-aggressive: passive-aggressive classifier; kNN: k-Nearest Neighbors; LinearSVC_L1 or _L2: support vector machine with linear kernel coupled with L1 or L2 regularization; SGDClassifier_L1 or _L2 or _EN: stochastic gradient descent with L1 or L2 or Elastic Net regularization; MultinomialNB: Multinomial naïve Bayes; BernoulliNB: Bernoulli naïve Bayes; SVC_rbf or _poly or _sigmoid: support vector machine with rbf kernel or polynomial kernel or sigmoid kernel Observed differences between the testing results and each language with p values from permutation tests Observe difference: observed difference in AUC when compared with the performance on the entire testing set; p_val: p-value, p values less than or equal to 0.05 were highlighted; Passive-aggressive: passive-aggressive classifier; kNN: k-Nearest Neighbors; LinearSVC_L1 or _L2: support vector machine with linear kernel coupled with L1 or L2 regularization; SGDClassifier_L1 or _L2 or _EN: stochastic gradient descent with L1 or L2 or Elastic Net regularization; MultinomialNB: Multinomial naïve Bayes; BernoulliNB: Bernoulli naïve Bayes; SVC_rbf or _poly or _sigmoid: support vector machine with rbf kernel or polynomial kernel or sigmoid kernel Pairwise comparisons among different racial groups Observe difference: observed difference in AUC when comparing the performance between the sub-populations; p_val: p-value, p values less than or equal to 0.05 were highlighted; Passive-aggressive: passive-aggressive classifier; kNN: k-Nearest Neighbors; LinearSVC_L1 or _L2: support vector machine with linear kernel coupled with L1 or L2 regularization; SGDClassifier_L1 or _L2 or _EN: stochastic gradient descent with L1 or L2 or Elastic Net regularization; MultinomialNB: Multinomial naïve Bayes; BernoulliNB: Bernoulli naïve Bayes; SVC_rbf or _poly or _sigmoid: support vector machine with rbf kernel or polynomial kernel or sigmoid kernel Pairwise comparisons among different language groups Observe difference: observed difference in AUC when comparing the performance between the sub-populations; p_val: p-value, p-values less than or equal to 0.05 were highlighted; Passive-aggressive: passive-aggressive classifier; kNN: k-Nearest Neighbors; LinearSVC_L1 or _L2: support vector machine with linear kernel coupled with L1 or L2 regularization; SGDClassifier_L1 or _L2 or _EN: stochastic gradient descent with L1 or L2 or Elastic Net regularization; MultinomialNB: Multinomial naïve Bayes; BernoulliNB: Bernoulli naïve Bayes; SVC_rbf or _poly or _sigmoid: support vector machine with rbf kernel or polynomial kernel or sigmoid kernel

Discussion

Currently, the “gold standard” for sepsis diagnosis is still absent. Among those available criteria, we observed different sensitivities in identifying patients. Meanwhile, we observed disparities in the proportions of population identified by each criteria across various social determinant groups. This brings us the concern that a universal diagnostic system might not work equally on each sub-population. By systematically examining the discrepancies, we hope to provide evidence for a more versatile detection system that takes the disparities in social determinants into consideration. In this study, we have excluded patients with missing data and performed complete case analysis. In future study, we plan to apply advanced missing data imputation techniques [33-35] to relax this exclusion criteria and investigate the potential links between missing data and social determinants of health. The discrepancies among subpopulations of social determinants groups hinder the performance of a machine learning model trained on the entire population. In a previous study, racial disparities [36] and region disparities [37] in sepsis-related mortality were revealed by retrospective studies. Prediction of mortality using machine learning has been well-discussed during recent years. However, more effort was devoted to improving the overall performances on the entire given population. While what was being less discussed was the fairness of applying trained machine learning algorithms on various groups of patients. It is by nature that patients are of various social status and it is essential not to underestimate such discrepancies. In this current study, we tested the performance fluctuations when applying the same trained model on patients from each social determinant groups and revealed statistically significant shifts in the performance. Even though the overall performance of a given classifier is descent, it should be kept in mind that there are still sub-populations not benefitting from the model as others. On the one hand, we hope such evidence provides a perspective on the impacts of social determinants for not only the medical society that is working diligently towards a fairer diagnostic method but also the artificial intelligence researchers trying to improve the predictive algorithms one more step towards clinically ready. Additionally, in future studies, we would take the interaction between features into consideration for a more thorough perspective.

Conclusions

Disparities in social determinants were observed in the groups of sepsis patients identified by various currently available diagnostic criteria. The performance of risk prediction tasks for sepsis patients can be compromised when applying a universally trained model for each sub-population. To achieve more accurate identification, a more versatile diagnostic system for sepsis is in need to overcome the social determinant disparities of patients. Additional file 1. Supplementary tables for proportions of subpopulations, observed differences for sex, marital status, and insurance type groups, and pairwise comparisons among sex, marital status, and insurance type groups.
  30 in total

Review 1.  Management of sepsis.

Authors:  James A Russell
Journal:  N Engl J Med       Date:  2006-10-19       Impact factor: 91.245

2.  Disparities in Sepsis Mortality by Region, Urbanization, and Race in the USA: a Multiple Cause of Death Analysis.

Authors:  Funmilola Ogundipe; Vijay Kodadhala; Temitayo Ogundipe; Alem Mehari; Richard Gillum
Journal:  J Racial Ethn Health Disparities       Date:  2019-01-03

3.  Migrant and minority family members in the intensive care unit. A review of the literature.

Authors:  KettyElena Quindemil; Martin Nagl-Cupal; Kathryn Hoehn Anderson; Hanna Mayer
Journal:  HeilberufeScience       Date:  2013-11

4.  Gender disparities in health care.

Authors:  Jennifer A Kent; Vinisha Patel; Natalie A Varela
Journal:  Mt Sinai J Med       Date:  2012 Sep-Oct

5.  The epidemiology of sepsis in the United States from 1979 through 2000.

Authors:  Greg S Martin; David M Mannino; Stephanie Eaton; Marc Moss
Journal:  N Engl J Med       Date:  2003-04-17       Impact factor: 91.245

6.  Simulation of Ventilator Allocation in Critically Ill Patients with COVID-19.

Authors:  Sivasubramanium V Bhavani; Yuan Luo; William D Miller; Lazaro N Sanchez-Pinto; Xuan Han; Chengsheng Mao; Burhaneddin Sandıkçı; Monica E Peek; Craig M Coopersmith; Kelly N Michelson; William F Parker
Journal:  Am J Respir Crit Care Med       Date:  2021-11-15       Impact factor: 30.528

7.  The natural history of the systemic inflammatory response syndrome (SIRS). A prospective study.

Authors:  M S Rangel-Frausto; D Pittet; M Costigan; T Hwang; C S Davis; R P Wenzel
Journal:  JAMA       Date:  1995-01-11       Impact factor: 56.272

8.  Racial and ethnic differences in end-of-life care in the United States: Evidence from the Health and Retirement Study (HRS).

Authors:  Martina Orlovic; Katharine Smith; Elias Mossialos
Journal:  SSM Popul Health       Date:  2018-11-30

9.  On the ethics of algorithmic decision-making in healthcare.

Authors:  Thomas Grote; Philipp Berens
Journal:  J Med Ethics       Date:  2019-11-20       Impact factor: 2.903

10.  Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU.

Authors:  Guilan Kong; Ke Lin; Yonghua Hu
Journal:  BMC Med Inform Decis Mak       Date:  2020-10-02       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.