| Literature DB >> 31651956 |
Praveen Kumar1,2, Anastasiya Nestsiarovich1, Stuart J Nelson3, Berit Kerner4, Douglas J Perkins1, Christophe G Lambert1,2,5.
Abstract
OBJECTIVE: We aimed to impute uncoded self-harm in administrative claims data of individuals with major mental illness (MMI), characterize self-harm incidence, and identify factors associated with coding bias.Entities:
Keywords: coding; electronic health records; machine learning; self-harm; suicide
Mesh:
Year: 2020 PMID: 31651956 PMCID: PMC7647246 DOI: 10.1093/jamia/ocz173
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.Study schema. The target dataset for phenotype imputation and self-harm incidence estimation was the full dataset. Machine learning (ML) approaches were first explored using the balanced dataset, with which we assessed the ability to recover deliberately mislabeled meta-visits, compared the performance of 5 ML algorithms, and explored the importance of covariate classes. The XGboost ML method was chosen for all subsequent modeling. The Full-data-model was used to characterize self-harm incidence, but several additional models were used to validate our approach and derive additional insight. The Mislabeled-full-data-model assessed whether deliberately misclassified meta-visits could be recovered with class imbalance. The Per-person model ensured prediction performance was not skewed by within-individual information leakage. The validation model verified that prediction performance was not explained by overfitting. The gold standard comparison contrasted ML classification with that of clinical experts. The XGboost model was also used to identify the most important meta-visit classification covariates, as well as features associated with uncoded self-harm via building the Coding-bias-model. ER: emergency room; MMI: major mental illness; SVM: support vector machine.
Classification performance of different XGboost-based classification models on different sets of meta-visits in patients with major mental illness
| XGboost model | Validation method | Dataset | Accuracy | MCC | AUC-ROC |
|---|---|---|---|---|---|
| Full-data-model | 5-fold cross-validation repeated 10 times | Full dataset with ∼20 million meta-visits | 0.960 ± 4 × 10−3 | 0.297 ± 2 × 10−4 | 0.990 ± 4 × 10−4 |
| Per-person-model | 5-fold cross-validation | Full dataset subset of ∼6 million meta-visits with 1 random meta-visit per person | 0.966 | 0.334 | 0.991 |
| Validation-model | 5-fold cross-validation on the training set | 70% random meta-visits from the full dataset | 0.964 | 0.298 | 0.991 |
| Testing on the validation set | Remaining 30% of meta-visits from the full dataset | 0.963 | 0.296 | 0.990 | |
| Balanced-data-model | 5-fold cross-validation repeated 100 times | Balanced dataset with 166 000 meta-visits | 0.964 ± 2 × 10-4 | 0.928 ± 4 × 10-4 | 0.991 ± 4 × 10-4 |
| Mislabeled-data-model | 5-fold cross-validation. Original labels of meta-visits were used for assessing performance | Half of the class 1 meta-visits mislabeled in the balanced dataset | 0.962 | 0.924 | 0.989 |
| Half of the class 0 meta-visits mislabeled in the balanced dataset | 0.963 | 0.926 | 0.991 | ||
| Mislabeled-full-data-model | Half of the class 1 meta-visits mislabeled in the full dataset | 0.974 | 0.347 | 0.991 | |
| Coding-bias-model | 5-fold cross-validation | All meta-visits from the full dataset with class 1. Probability threshold ≥0.95 | 0.679 | 0.306 | 0.738 |
| Full-factorial-models | Balanced dataset; only condition covariates | 0.957 | 0.914 | 0.988 | |
| Balanced dataset; only hand-curated covariates | 0.927 | 0.853 | 0.977 | ||
| Balanced dataset; only billing code position covariates. | 0.788 | 0.577 | 0.875 | ||
| Balanced dataset; only observation covariates | 0.775 | 0.562 | 0.813 | ||
| Balanced dataset; only procedure covariates | 0.708 | 0.440 | 0.800 | ||
| Balanced dataset; only measurement covariates | 0.589 | 0.245 | 0.594 | ||
| Balanced dataset; only drug covariates | 0.550 | 0.192 | 0.586 | ||
| Balanced dataset; only device covariates | 0.516 | 0.099 | 0.514 |
The results for the Full-data-model and the Balanced-data-model are shown with 80% and 90% confidence intervals, respectively. AUC-ROC: area under the receiver-operating characteristic curve; MCC: Matthews correlation coefficient.
Classification performance of 5 different machine learning algorithms on the balanced dataset of patients with major mental illness, using 5-fold-cross-validation with 100 repetitions and reported with 90% confidence intervals
| Machine learning model/performance | XGboost balanced-data-model with optimized parameters | XGboost balanced-data-model with default parameters | Logistic regression | Random forest | Decision tree | LinearSVC |
|---|---|---|---|---|---|---|
| Accuracy | 0.964 ± 2 × 10-4 | 0.961 ± 2 × 10-4 | 0.963 ± 3 × 10-4 | 0.946 ± 1 × 10-3 | 0.947 ± 7 × 10-4 | 0.959 ± 3 × 10-4 |
| MCC | 0.928 ± 4 × 10-4 | 0.922 ± 4 × 10-4 | 0.926 ± 6 × 10-4 | 0.892 ± 3 × 10-3 | 0.896 ± 1 × 10-3 | 0.919 ± 7 × 10-4 |
| AUC-ROC | 0.991 ± 4 × 10-4 | 0.990 ± 2 × 10-4 | 0.990 ± 1 × 10-4 | 0.982 ± 6 × 10-4 | 0.948 ± 7 × 10-4 | 0.988 ± 1 × 10-4 |
Optimized parameters for XGboost model: max_depth = 6, base_score = 0.5, gamma = 0, max_delta_step = 0, min_child_weight = 2, objective = ‘binary: logistic’, booster = ‘gbtree’, subsample = 0.6, scale_pos_weight = total negative class/total positive class, colsample_bytree = 1, colsample_bylevel = 0.8, learning_rate = 0.04
The pairwise agreement between the XGboost Full-data model consensus gold standard and 3 clinicians regarding the presence of self-harm (with >0.5 probability) in 200 selected meta-visits of patients with major mental illness
| Classifier | Full-data-model | Clinician 1 | Clinician 2 | Clinician 3 | Gold standard |
|---|---|---|---|---|---|
| 200 randomly selected meta-visits | |||||
| Full-data-model | 1.00 | 0.81 | 0.80 | 0.78 | 0.84 |
| Clinician 1 | 0.81 | 1.00 | 0.77 | 0.88 | 0.88 |
| Clinician 2 | 0.80 | 0.77 | 1.00 | 0.76 | 0.86 |
| Clinician 3 | 0.79 | 0.88 | 0.76 | 1.00 | 0.87 |
| Gold standard | 0.84 | 0.88 | 0.86 | 0.87 | 1.00 |
| 50 meta-visits where self-harm was neither coded nor imputed | |||||
| Full-data-model | 1.00 | 0.98 | 1.00 | 0.96 | 1.00 |
| Clinician 1 | 0.98 | 1.00 | 0.98 | 0.98 | 0.98 |
| Clinician 2 | 1.00 | 0.98 | 1.00 | 0.96 | 1.00 |
| Clinician 3 | 0.96 | 0.98 | 0.96 | 1.00 | 0.96 |
| Gold standard | 1.00 | 0.98 | 1.00 | 0.96 | 1.00 |
| 50 meta-visits where self-harm was not coded but imputed | |||||
| Full-data-model | 1.00 | 0.54 | 0.68 | 0.60 | 0.54 |
| Clinician 1 | 0.54 | 1.00 | 0.70 | 0.78 | 0.76 |
| Clinician 2 | 0.68 | 0.70 | 1.00 | 0.76 | 0.82 |
| Clinician 3 | 0.60 | 0.78 | 0.76 | 1.00 | 0.90 |
| Gold standard | 0.54 | 0.76 | 0.82 | 0.90 | 1.00 |
| 50 meta-visits where self-harm was coded but not imputed | |||||
| Full-data-model | 1.00 | 0.74 | 0.64 | 0.66 | 0.88 |
| Clinician 1 | 0.74 | 1.00 | 0.54 | 0.84 | 0.82 |
| Clinician 2 | 0.64 | 0.54 | 1.00 | 0.50 | 0.68 |
| Clinician 3 | 0.66 | 0.84 | 0.50 | 1.00 | 0.74 |
| Gold Standard | 0.88 | 0.82 | 0.68 | 0.74 | 1.00 |
| 50 meta-visits where self-harm was both coded and imputed | |||||
| Full-data-model | 1.00 | 0.98 | 0.88 | 0.88 | 0.92 |
| Clinician 1 | 0.98 | 1.00 | 0.86 | 0.90 | 0.94 |
| Clinician 2 | 0.88 | 0.86 | 1.00 | 0.80 | 0.92 |
| Clinician 3 | 0.88 | 0.90 | 0.80 | 1.00 | 0.88 |
| Gold standard | 0.92 | 0.94 | 0.92 | 0.88 | 1.00 |
Covariates from the Full-data-model contributing most to XGboost meta-visit classification for self-harm presence
| OMOP concept ID | SNOMED concept ID | Covariate | Relative gain | Relative cover | Relative weight |
|---|---|---|---|---|---|
| 442562 | 75478009 | Poisoning | 0.3200 | 0.0273 | 0.0103 |
| 444100 | 46206005 | Mood disorder | 0.0359 | 0.0109 | 0.0142 |
| 440921 | 417746004 | Traumatic injury | 0.0226 | 0.0060 | 0.0026 |
| — | — | External injury | 0.0158 | 0.0062 | 0.0314 |
| 432586 | 74732009 | Mental disorder | 0.0139 | 0.0089 | 0.0163 |
| 73553 | 399269003 | Arthropathy | 0.0135 | 0.0041 | 0.0014 |
| 4168335 | 416462003 | Wound | 0.0099 | 0.0093 | 0.0047 |
| 438028 | 7895008 | Poisoning by drug and/or medicinal substance | 0.0082 | 0.0045 | 0.0116 |
| 4108646 | 283057008 | Abrasion of upper limb | 0.0079 | 0.0154 | 0.0005 |
| 444187 | 125643001 | Open wound | 0.0067 | 0.0051 | 0.0075 |
| 4130851 | 127278005 | Injury of upper extremity | 0.0063 | 0.0084 | 0.0059 |
| 4219871 | 399963005 | Abrasion | 0.0055 | 0.0014 | 0.0003 |
| — | — | Psychiatric diagnosis | 0.0048 | 0.0044 | 0.0012 |
| 4306645 | 83507006 | Finding of thought content | 0.0046 | 0.0014 | 0.0047 |
| 4111213 | 285261008 | Dangerous and harmful thoughts | 0.0042 | 0.0045 | 0.0075 |
The covariates are sorted by relative gain, which reflects the magnitude of covariate contribution to predicting the class of the meta-visit relative to other features (relative gain = gain of the covariate / ∑gain of all covariates). The weight indicates how many times the covariate was used to split the data across all trees in the model (relative weight = weight of the covariate / ∑weight of all covariates). The cover indicates the average number of observations in which the covariate was used to split the data across all trees in the model (relative cover = cover of the covariate ÷ ∑cover of all covariates).
OMOP: Observational Medical Outcomes Partnership; SNOMED: Systematized Nomenclature of Medicine.
Figure 2.Self-harm meta-visits in patients with major mental illness of different sex per year. The left graph shows the annual percentage incidence of coded self-harm for male individuals (blue line) and female individuals (orange line); the right graph shows the annual percentage incidence of imputed self-harm for male individuals (blue line) and female individuals (orange line).
Figure 3.Self-harm meta-visits in patients with major mental illness of different age and sex. The left graph shows the annual percentage incidence of coded self-harm in male individuals (blue line) and female individuals (orange line). The right graph shows the annual percentage incidence of machine learning-imputed self-harm by sex.
Figure 4.The annual incidence of meta-visits with machine learning-imputed self-harm by category of major mental illness. The left graph shows the data for male individuals and the right graph for female individuals.
Figure 5.Meta-visits with self-harm in patients with major mental illness residing in different states and territories of the United States. The blue plots show the annual percentage incidence of meta-visits with imputed self-harm (blue dots) and coded self-harm (blue hatches). The red bars show the fraction of coded self-harm events among the imputed ones. Due to MarketScan license restrictions, data for South Carolina were excluded from the figure. OT: others (includes DC, Puerto Rico and other U.S. territories).