| Literature DB >> 35317793 |
Stephen van Gaal1, Arshia Alimohammadi2, Amy Y X Yu3,4, Mohammad Ehsanul Karim5, Wei Zhang5, Jason M Sutherland6.
Abstract
BACKGROUND ANDEntities:
Keywords: Administrative data; Carotid endarterectomy; Machine learning; Statistical model
Mesh:
Year: 2022 PMID: 35317793 PMCID: PMC8941812 DOI: 10.1186/s12913-022-07614-1
Source DB: PubMed Journal: BMC Health Serv Res ISSN: 1472-6963 Impact factor: 2.655
Diagnosis cluster definitions for ICD-9 and ICD-10-CA
| Diagnosis cluster | ICD-10-CA (5-character code) | ICD-9 (3-digit code) |
|---|---|---|
| Ischemic stroke | I63.X (except I63.6, venous thrombosis), I64 (stroke not specified as hemorrhage or infarction), I66.0–2,4,8–9 (cerebral arterial stenosis or occlusion without infarction; included even though these codes specifically exclude infarction) | 434 (some codes exclude infarction, but these are rarely used in BC), 436 (acute cerebrovascular disease), 438 (late effects of cerebrovascular disease) |
| TIA | G45.X (except G45.3, amaurosis fugax) | 435 |
| Retinal | G45.3, H34.X (retinal vascular occlusion), H35.82 (retinal ischemia), H53.1 (transient visual loss) | 362 (retinal disorders), 368 (visual disturbances) |
| Cerebral | G81.0 & G81.9 (hemiparesis), H53.4 & H53.9 (visual non-retinal), R29.5 (transient paralysis of limb), R29.8 (other neurological symptoms), R47.0, R47.1 & R7.8 (aphasia & dysarthria) | 781 (neurological symptoms), 341 (demyelinating symptom, but frequently used in our sample). Omit 784 (embeds speech in context of ‘head and neck’) |
| Stenosis (asymptomatic) | I65.2, I65.3, I65.9 (stenosis of carotid or multiple arteries) | 433 (some codes include infarction, but these are rarely used in BC) |
Characteristics of included participants by train and test population, Vancouver, Canada, 2008–2016. P-values reported for t-test (age) and Fisher exact test (all other variables)
| Train | Test | ||
|---|---|---|---|
| N | 615 | 114 | |
| Data years | 2008–2015 | 2016 | |
| Age – mean (SD) | 72.1 (9.1) | 71.3 (9.7) | 0.4 |
| Female – N (%) | 207 (33.7%) | 33 (28.9%) | 0.4 |
| In metro – N (%) | 482 (78.4%) | 89 (78.1%) | 1 |
| Symptom type – N (%) | 0.02 | ||
| Asymptomatic | 189 (30.7%) | 29 (25.4%) | |
| Retinal | 111 (18.0%) | 22 (19.3%) | |
| TIA | 198 (32.2%) | 27 (23.7%) | |
| Stroke | 117 (19.0%) | 36 (31.6%) |
Fig. 1Model performance assessed by sensitivity at 98.6% specificity (panel A) and area under the receiver operating characteristic curve (panel B). The diagnosis-only rule-based method involves no parameter tuning, so it is reported for all participants. All other models are analysed by train and test population. Training results are calculated using cross-fold hold-outs and are plotted with mean and 95% confidence interval for standard error. Testing results are calculated using n = 2000 bootstrapped samples and are plotted by empiric 95% confidence interval. Area under the curve is not calculable for the rule-based diagnosis-only method
Fig. 2Model performance in the test set by receiver operator characteristic curve (panel A), observed vs. predicted probability calibration curve (panel B). In panel A, a grey bar is positioned at specificity 98–100%, highlighting the limited sensitivity of LogisticDX at this margin
Fig. 3Standardized logistic regression model coefficients and random forest permutation importance for each model. Negative coefficients are shaded in light grey. Variables are sorted by importance in the first model in which they appear