Sheng-Feng Sung1, Cheng-Yang Hsieh2, Yea-Huei Kao Yang3, Huey-Juan Lin4, Chih-Hung Chen5, Yu-Wei Chen6, Ya-Han Hu7. 1. Division of Neurology, Department of Internal Medicine, Ditmanson Medical Foundation Chia-Yi Christian Hospital, No.539, Zhongxiao Rd., East Dist., Chiayi City 60002, Taiwan. 2. Department of Neurology, Tainan Sin Lau Hospital, No.57, Sec.1, Dongmen Rd., East Dist., Tainan City 70142, Taiwan; Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, No.1, University Road, Tainan City 701, Taiwan. 3. Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, No.1, University Road, Tainan City 701, Taiwan. 4. Department of Neurology, Chi Mei Medical Center, No.901, Zhonghua Rd., Yongkang Dist., Tainan City 710, Taiwan. 5. Department of Neurology, College of Medicine, National Cheng Kung University, No.1, University Road, Tainan City 701, Taiwan. 6. Department of Neurology, Landseed Hospital, No.77, Kwang-Tai Rd., Ping-Jen City, Tao-Yuan County 32449, Taiwan; Department of Neurology, National Taiwan University Hospital, No.1, Changde St., Zhongzheng Dist., Taipei City 10048, Taiwan. 7. Department of Information Management and Institute of Healthcare Information Management, National Chung Cheng University, No.168 University Road, Min-Hsiung, Chiayi County 62102, Taiwan. Electronic address: yahan.hu@mis.ccu.edu.tw.
Abstract
OBJECTIVES: Case-mix adjustment is difficult for stroke outcome studies using administrative data. However, relevant prescription, laboratory, procedure, and service claims might be surrogates for stroke severity. This study proposes a method for developing a stroke severity index (SSI) by using administrative data. STUDY DESIGN AND SETTING: We identified 3,577 patients with acute ischemic stroke from a hospital-based registry and analyzed claims data with plenty of features. Stroke severity was measured using the National Institutes of Health Stroke Scale (NIHSS). We used two data mining methods and conventional multiple linear regression (MLR) to develop prediction models, comparing the model performance according to the Pearson correlation coefficient between the SSI and the NIHSS. We validated these models in four independent cohorts by using hospital-based registry data linked to a nationwide administrative database. RESULTS: We identified seven predictive features and developed three models. The k-nearest neighbor model (correlation coefficient, 0.743; 95% confidence interval: 0.737, 0.749) performed slightly better than the MLR model (0.742; 0.736, 0.747), followed by the regression tree model (0.737; 0.731, 0.742). In the validation cohorts, the correlation coefficients were between 0.677 and 0.725 for all three models. CONCLUSION: The claims-based SSI enables adjusting for disease severity in stroke studies using administrative data.
OBJECTIVES: Case-mix adjustment is difficult for stroke outcome studies using administrative data. However, relevant prescription, laboratory, procedure, and service claims might be surrogates for stroke severity. This study proposes a method for developing a stroke severity index (SSI) by using administrative data. STUDY DESIGN AND SETTING: We identified 3,577 patients with acute ischemic stroke from a hospital-based registry and analyzed claims data with plenty of features. Stroke severity was measured using the National Institutes of Health Stroke Scale (NIHSS). We used two data mining methods and conventional multiple linear regression (MLR) to develop prediction models, comparing the model performance according to the Pearson correlation coefficient between the SSI and the NIHSS. We validated these models in four independent cohorts by using hospital-based registry data linked to a nationwide administrative database. RESULTS: We identified seven predictive features and developed three models. The k-nearest neighbor model (correlation coefficient, 0.743; 95% confidence interval: 0.737, 0.749) performed slightly better than the MLR model (0.742; 0.736, 0.747), followed by the regression tree model (0.737; 0.731, 0.742). In the validation cohorts, the correlation coefficients were between 0.677 and 0.725 for all three models. CONCLUSION: The claims-based SSI enables adjusting for disease severity in stroke studies using administrative data.