Rohit B Sangal1, Samah Fodeh1, Andrew Taylor1, Craig Rothenberg1, Emily B Finn2, Kevin Sheth3, Charles Matouk4, Andrew Ulrich1, Vivek Parwani1, John Sather1, Arjun Venkatesh5. 1. Department of Emergency Medicine, United States; Yale University School of Medicine, New Haven CT, United States. 2. Department of Internal Medicine, United States; Yale University School of Medicine, New Haven CT, United States. 3. Department of Neurology, United States; Yale University School of Medicine, New Haven CT, United States. 4. Department of Neurosurgery, United States. 5. Department of Emergency Medicine, United States; Yale University School of Medicine, New Haven CT, United States. Electronic address: arjun.venkatesh@yale.edu.
Abstract
INTRODUCTION: Nontraumatic intracranial hemorrhage (ICH) is a neurological emergency of research interest; however, unlike ischemic stroke, has not been well studied in large datasets due to the lack of an established administrative claims-based definition. We aimed to evaluate both explicit diagnosis codes and machine learning methods to create a claims-based definition for this clinical phenotype. METHODS: We examined all patients admitted to our tertiary medical center with a primary or secondary International Classification of Disease version 9 (ICD-9) or 10 (ICD-10) code for ICH in claims from any portion of the hospitalization in 2014-2015. As a gold standard, we defined the nontraumatic ICH phenotype based on manual chart review. We tested explicit definitions based on ICD-9 and ICD-10 that had been previously published in the literature as well as four machine learning classifiers including support vector machine (SVM), logistic regression with LASSO, random forest and xgboost. We report five standard measures of model performance for each approach. RESULTS: A total of 1830 patients with 2145 unique ICD-10 codes were included in the initial dataset, of which 437 (24%) were true positive based on manual review. The explicit ICD-10 definition performed best (Sensitivity = 0.89 (95% CI 0.85-0.92), Specificity = 0.83 (0.81-0.85), F-score = 0.73 (0.69-0.77)) and improves on an explicit ICD-9 definition (Sensitivity = 0.87 (0.83-0.90), Specificity = 0.77 (0.74-0.79), F-score = 0.67 (0.63-0.71). Among machine learning classifiers, SVM performed best (Sensitivity = 0.78 (0.75-0.82), Specificity = 0.84 (0.81-0.87), AUC = 0.89 (0.87-0.92), F-score = 0.66 (0.62-0.69)). CONCLUSIONS: An explicit ICD-10 definition can be used to accurately identify patients with a nontraumatic ICH phenotype with substantially better performance than ICD-9. An explicit ICD-10 based definition is easier to implement and quantitatively not appreciably improved with the additional application of machine learning classifiers. Future research utilizing large datasets should utilize this definition to address important research gaps.
INTRODUCTION:Nontraumatic intracranial hemorrhage (ICH) is a neurological emergency of research interest; however, unlike ischemic stroke, has not been well studied in large datasets due to the lack of an established administrative claims-based definition. We aimed to evaluate both explicit diagnosis codes and machine learning methods to create a claims-based definition for this clinical phenotype. METHODS: We examined all patients admitted to our tertiary medical center with a primary or secondary International Classification of Disease version 9 (ICD-9) or 10 (ICD-10) code for ICH in claims from any portion of the hospitalization in 2014-2015. As a gold standard, we defined the nontraumatic ICH phenotype based on manual chart review. We tested explicit definitions based on ICD-9 and ICD-10 that had been previously published in the literature as well as four machine learning classifiers including support vector machine (SVM), logistic regression with LASSO, random forest and xgboost. We report five standard measures of model performance for each approach. RESULTS: A total of 1830 patients with 2145 unique ICD-10 codes were included in the initial dataset, of which 437 (24%) were true positive based on manual review. The explicit ICD-10 definition performed best (Sensitivity = 0.89 (95% CI 0.85-0.92), Specificity = 0.83 (0.81-0.85), F-score = 0.73 (0.69-0.77)) and improves on an explicit ICD-9 definition (Sensitivity = 0.87 (0.83-0.90), Specificity = 0.77 (0.74-0.79), F-score = 0.67 (0.63-0.71). Among machine learning classifiers, SVM performed best (Sensitivity = 0.78 (0.75-0.82), Specificity = 0.84 (0.81-0.87), AUC = 0.89 (0.87-0.92), F-score = 0.66 (0.62-0.69)). CONCLUSIONS: An explicit ICD-10 definition can be used to accurately identify patients with a nontraumatic ICH phenotype with substantially better performance than ICD-9. An explicit ICD-10 based definition is easier to implement and quantitatively not appreciably improved with the additional application of machine learning classifiers. Future research utilizing large datasets should utilize this definition to address important research gaps.
Authors: Christianne L Roumie; Edward Mitchel; Patricia S Gideon; Cristina Varas-Lorenzo; Jordi Castellsague; Marie R Griffin Journal: Pharmacoepidemiol Drug Saf Date: 2008-01 Impact factor: 2.890
Authors: Tasnim F Imran; Daniel Posner; Jacqueline Honerlaw; Jason L Vassy; Rebecca J Song; Yuk-Lam Ho; Steven J Kittner; Katherine P Liao; Tianxi Cai; Christopher J O'Donnell; Luc Djousse; David R Gagnon; J Michael Gaziano; Peter Wf Wilson; Kelly Cho Journal: Clin Epidemiol Date: 2018-10-16 Impact factor: 4.790
Authors: Linxin Li; Lucy E Binney; Samantha Carter; Sergei A Gutnikov; Sally Beebe; Karen Bowsher-Brown; Louise E Silver; Peter M Rothwell Journal: J Am Heart Assoc Date: 2019-07-03 Impact factor: 5.501