| Literature DB >> 35659230 |
Catherine Byrd1, Ureka Ajawara2, Ryan Laundry3, John Radin4, Prasha Bhandari1, Ann Leung5, Summer Han6, Stephen M Asch2, Steven Zeliadt3, Alex H S Harris2,7, Leah Backhus8,9.
Abstract
BACKGROUND: We aim to develop and test performance of a semi-automated method (computerized query combined with manual review) for chart abstraction in the identification and characterization of surveillance radiology imaging for post-treatment non-small cell lung cancer patients.Entities:
Keywords: Chart abstraction; Imaging surveillance; Lung neoplasms; Natural language processing; Non-small cell lung carcinoma
Mesh:
Year: 2022 PMID: 35659230 PMCID: PMC9166440 DOI: 10.1186/s12911-022-01863-0
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 3.298
Fig. 1Veterans indexed search for analytics (VISA) tool. This figure includes a nonsense example of the appearance of the Veterans Indexed Search for Analytics (VISA) Tool. The star highlights an example of a Boolean search query. The subsequent results and snippets of text information with highlighted terms from the search query are seen below as represented by the pentagon
Fig. 2VISA data collection instrument. This figure includes a nonsense example of the appearance of the Veterans Indexed Search for Analytics (VISA) Data Collection Instrument tool. The star demarcates the full text of a radiology report. User highlighted text is in pink. Yellow highlighted text represents computer identified queried words and phrases. A pentagon represents an example of the way in which the radiology report may be coded by a user
Characteristics of gold standard manually abstracted radiology reportsα
| Characteristics of radiology reports | Annotated reports (N (%)) |
|---|---|
| Image types | n = 3011 |
| Bone scan | 23 (0.8) |
| Chest X-ray | 1320 (43.8) |
| CT abdomen/pelvis | 127 (4.2) |
| CT chest | 902 (30.0) |
| CT chest/abdomen/pelvis | 123 (4.1) |
| CT head | 149 (4.9) |
| MRI body | 58 (1.9) |
| MRI brain | 42 (1.4) |
| PET scan | 267 (8.9) |
| Image indication | n = 3009 |
| Surveillance | 954 (31.7) |
| Symptomatic | 649 (21.6) |
| Follow up from prior abnormal chest imaging | 244 (8.1) |
| Follow up from prior abnormal other imaging | 30 (1.0) |
| Other | 480 (16.0) |
| Unknown | 652 (21.7) |
| Image findings | n = 3008 |
| Suspicious | 331 (11.0) |
| Recurrence | 110 (3.7) |
| Benign | 959 (31.9) |
| Nonspecific | 1019 (33.9) |
| Second primary lung cancer | 14 (0.5) |
| Second primary cancer, other | 11 (0.4) |
| Other (unrelated to cancer) | 564 (18.8) |
αThe total number of reportss with image type annotated = 3011, the total number of reports with image indication annotated = 3009, the total number of reports with image findings annotated = 3008. 180 reports were found in our manually abstracted dataset that were coded as null, indicating that they were not relevant images. Thus the total number of reports representing 361 patients was 3191
Overall performance of queries in the 361-patient manually abstracted cohort
| Image annotation | ||||
|---|---|---|---|---|
| Image type: any relevant image | Gold standard manual abstraction | |||
| Relevant study | Not relevant study | |||
| Automated Lucene tool result | Relevant study | 2548 | 133 | |
| Not relevant study | 463 | 47 | ||
Sensitivityβ (95% CI) | Specificityγ (95% CI) | PPVδ (95% CI) | F1 scoreε | |
85% (83–86%) | 26% (20–33%) | 95% (94–96%) | 0.90 | |
| Indication: surveillance | Gold standard manual abstraction | |||
| Surveillance study | Not surveillance study | |||
| Automated Lucene tool result | Surveillance study | 690 | 292 | |
| Not surveillance study | 264 | 1763 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
72% (69–75%) | 86% (84–87%) | 70% (67–73%) | 0.71 | |
| Finding: suspicious | Gold standard manual abstraction | |||
| Suspicious finding on study | No suspicious finding on study | |||
| Automated Lucene tool result | Suspicious finding on study | 247 | 755 | |
| No suspicious finding on study | 84 | 1922 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
75% (70–79%) | 72% (70–73%) | 25% (22–27%) | 0.37 | |
| Finding: recurrence only | Gold standard manual abstraction | |||
| Recurrence on study | No recurrence on study | |||
| Automated Lucene tool result | Recurrence on study | 105 | 353 | |
| No recurrence on study | 19 | 2531 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
85% (77–91%) | 88% (87–89%) | 23% (19–27%) | 0.36 | |
βSensitivity = true positive/ (true positive + false negative)
Specificity = true negative/ (true negative + false positive)
δPositive predictive value = true positive/ (true positive + false positive)
εF1 score = 2 ((sensitivity*PPV)/ (sensitivity + PPV))
Performance of the image type query in identifying specific image types in the 361-patient cohortζ
| Image type | ||||
|---|---|---|---|---|
| Bone scan | Gold standard manual abstraction | |||
| Bone scan | Not bone scan | |||
| Automated Lucene tool result | Bone scan | 23 | 0 | |
| Not bone scan | 0 | 2988 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
100% (85–100%) | 100% (100–100%) | 100% (85–100%) | 1.00 | |
| Chest X-ray | Gold standard manual abstraction | |||
| Chest X-ray | Not chest X-ray | |||
| Automated Lucene tool result | Chest X-ray | 1039 | 0 | |
| Not chest X-ray | 281 | 1691 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
79% (76–81%) | 100% (100–100%) | 100% (100–100%) | 0.88 | |
| CT chest | Gold standard manual abstraction | |||
| CT chest | Not CT chest | |||
| Automated Lucene tool result | CT chest | 841 | 0 | |
| Not CT chest | 61 | 2109 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
93% (91–95%) | 100% (100–100%) | 100% (100–100%) | 0.97 | |
| CT abdomen/pelvis | Gold standard manual abstraction | |||
| CT abdomen/pelvis | Not CT abdomen/pelvis | |||
| Automated lucene tool result | CT abdomen/pelvis | 63 | 0 | |
| Not CT abdomen/pelvis | 64 | 2884 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
50% (41–59%) | 100% (100–100%) | 100% (94–100%) | 0.66 | |
| CT chest/abdomen/pelvis | Gold standard manual abstraction | |||
| CT Chest/abdomen/pelvis | Not CT Chest/abdomen/pelvis | |||
| Automated Lucene tool result | CT Chest/abdomen/pelvis | 116 | 0 | |
| Not CT Chest/abdomen/pelvis | 7 | 2888 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
94% (89–98%) | 100% (100–100%) | 100% (97–100%) | 0.97 | |
| CT head | Gold standard manual abstraction | |||
| CT head | Not CT head | |||
| Automated Lucene tool result | CT head | 131 | 0 | |
| Not CT head | 18 | 2862 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
88% (82–93%) | 100% (100–100%) | 100% (97–100%) | 0.94 | |
| MR body | Gold standard manual abstraction | |||
| MR body | Not MR body | |||
| Automated Lucene tool result | MR body | 28 | 0 | |
| Not MR body | 30 | 2953 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
48% (35–62%) | 100% (100–100%) | 100% (88–100%) | 0.65 | |
| MR brain | Gold standard manual abstraction | |||
| MR brain | Not MR brain | |||
| Automated Lucene tool result | MR brain | 41 | 0 | |
| Not MR brain | 1 | 2969 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
98% (87–100%) | 100% (100–100%) | 100% (91–100%) | 0.99 | |
| PET | Gold standard manual abstraction | |||
| PET | Not PET | |||
| Automated Lucene tool result | PET | 266 | 0 | |
| Not PET | 1 | 2744 | ||
Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | F1 score | |
100% (98–100%) | 100% (100–100%) | 100% (99–100%) | 1.00 | |
ζThis 361-patient cohort consists of 3011 manually abstracted reports
Timing for manual and semi-automated chart abstractionζ
| Timing metric | Manual | Semi-automated | p-value | % Reduction in time |
|---|---|---|---|---|
| Total number of reports | 204 | 239 | ||
| Minutes/patient (median, (IQR)) | 21.5 (16.0) | 6.9 (9.5) | 0.0024 | 68 |
| Seconds/patient report (median, (IQR)) | 60.0 (90.0) | 30.0 (80.0) | < 0.0005 | 50 |
| Reports/patient (mean, (SD)) | 12.75 (10.50) | 9.96 (9.41) | 0.0398 |
The semi-automated chart abstraction was performed using the image-type query