| Literature DB >> 33944791 |
Amy Y X Yu1, Zhongyu A Liu1, Chloe Pou-Prom2, Kaitlyn Lopes1, Moira K Kapral3, Richard I Aviv4, Muhammad Mamdani5.
Abstract
BACKGROUND: Diagnostic neurovascular imaging data are important in stroke research, but obtaining these data typically requires laborious manual chart reviews.Entities:
Keywords: data extraction; diagnostic imaging; imaging; natural language processing; neurovascular; stroke; stroke surveillance; surveillance
Year: 2021 PMID: 33944791 PMCID: PMC8132979 DOI: 10.2196/24381
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Example 1 of a discrepancy between the chart abstractor and CHARTextract tool output. (A) Computed tomography angiography scan showing loss of opacification in the left middle cerebral artery, involving the left M1 segment and extending into the M2 segment. (B) CHARTextract tool output: the chart abstractor labeled that large vessel occlusion was present, but the CHARTextract tool determined this attribute to be absent. The rules were revised to reflect that occlusion involving the “M1 segment” should be considered a large vessel occlusion even if the terms “MCA” or “middle cerebral artery” were absent.
Figure 2Example 2 of a discrepancy between the chart abstractor and CHARTextract tool output. (A) Computed tomography angiography scan showing near-occlusion of the cavernous internal carotid artery with reconstitution of the middle cerebral artery. (B) CHARTextract output: the abstractor labeled that large vessel occlusion was absent, but the CHARTextract tool determined this attribute to be present because the words “occlusion” and “M1 segment” were detected in the same sentence.
Figure 3Example 3 of a discrepancy between the chart abstractor and CHARTextract tool output. The abstractor labeled that large vessel occlusion was present because the abstractor was able to interpret that an occlusion from the internal carotid artery and extending to the M2 segment of the middle cerebral artery involves the M1 segment, but the CHARTextract tool determined this attribute to be absent because the tool detects key words without knowledge of vascular anatomy.
Accuracy of the natural language processing tool CHARTextract to identify stroke-related attributes in diagnostic imaging reports.
| Cohort and stroke-related attribute | Attribute prevalence, n (%) | Sensitivity (%) | Specificity(%) | PPVa (%) | NPVb (%) | Overall accuracy (%) | |
|
| |||||||
|
| Anterior proximal occlusion | 111 (12.1) | 95.5 | 98.1 | 84.1 | 99.4 | 97.3 |
|
| Anterior distal occlusion | 127 (13.8) | 92.9 | 98.0 | 88.1 | 98.9 | 97.3 |
|
| Basilar occlusion | 19 (2.1) | 100 | 99.9 | 95.0 | 100 | 99.9 |
|
| Presence of established ischemia | 287 (31.2) | 82.2 | 91.7 | 80.5 | 91.9 | 88.3 |
|
| Presence of any hemorrhage | 114 (12.4) | 93.0 | 98.2 | 87.6 | 99.0 | 97.5 |
|
| |||||||
|
| Anterior proximal occlusion | 50 (12.5) | 90.0 | 97.4 | 76.3 | 98.5 | 95.2 |
|
| Anterior distal occlusion | 61 (15.3) | 83.6 | 97.7 | 86.4 | 97.1 | 95.5 |
|
| Basilar occlusion | 7 (1.8) | 71.4 | 98.2 | 41.7 | 99.5 | 97.7 |
|
| Presence of established ischemia | 104 (26.1) | 80.8 | 85.1 | 64.1 | 92.5 | 83.2 |
|
| Presence of any hemorrhage | 25 (6.3) | 88.0 | 96.0 | 59.5 | 99.2 | 95.5 |
aPPV: positive predictive value.
bNPV: negative predictive value.
Accuracy of the natural language processing tool CHARTextract to identify Alberta stroke program early CT score (ASPECTS) and collateral vascular status based on diagnostic imaging reports.
| Cohort and stroke-related attributes | Attribute prevalence, n (%) | Sensitivity (%) | Specificity (%) | PPVa (%) | NPVb (%) | Overall accuracy (%) | ||||
|
| ||||||||||
|
|
| 98.8 | ||||||||
|
|
| Not reported | 661 (71.8) | 99.7 | 99.2 | 99.7 | 99.2 |
| ||
|
|
| <5 | 30 (3.3) | 96.7 | 99.2 | 80.6 | 99.9 |
| ||
|
|
| ≥5 | 230 (25.0) | 96.5 | 99.7 | 99.1 | 98.9 |
| ||
|
|
| 98.4 | ||||||||
|
|
| Not reported | 774 (84.0) | 99.2 | 96.6 | 99.4 | 95.9 |
| ||
|
|
| Poor | 34 (3.7) | 94.1 | 100 | 100 | 99.8 |
| ||
|
|
| Intermediate | 19 (2.1) | 78.9 | 100 | 100 | 99.6 |
| ||
|
|
| Good | 94 (10.2) | 96.8 | 98.8 | 90.1 | 99.6 |
| ||
|
| ||||||||||
|
|
| 98.5 | ||||||||
|
|
| Not reported | 275 (68.9) | 99.3 | 96.8 | 98.6 | 98.4 |
| ||
|
|
| <5 | 10 (2.5) | 70.0 | 100 | 100.0 | 99.2 |
| ||
|
|
| ≥5 | 114 (28.6) | 99.1 | 99.3 | 98.3 | 99.6 |
| ||
|
|
| 98.2 | ||||||||
|
|
| Not reported | 330 (82.7) | 99.7 | 91.3 | 98.2 | 98.4 |
| ||
|
|
| Poor | 15 (3.8) | 93.3 | 99.7 | 93.3 | 99.7 |
| ||
|
|
| Intermediate | 7 (1.8) | 71.4 | 100 | 100 | 99.5 |
| ||
|
|
| Good | 47 (11.8) | 93.6 | 100 | 100 | 99.2 |
| ||
aPPV: positive predictive value.
bNPV: negative predictive value.