| Literature DB >> 32396579 |
Shang Gao1, Mohammed Alawad1, Noah Schaefferkoetter1, Lynne Penberthy2, Xiao-Cheng Wu3, Eric B Durbin4, Linda Coyle5, Arvind Ramanathan6, Georgia Tourassi1.
Abstract
Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence-for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from individual reports, but also to capture aggregate information regarding the entire cancer case based off case-level context from all reports in the sequence. In this paper, we introduce a simple modular add-on for capturing case-level context that is designed to be compatible with most existing deep learning architectures for text classification on individual reports. We test our approach on a corpus of 431,433 cancer pathology reports, and we show that incorporating case-level context significantly boosts classification accuracy across six classification tasks-site, subsite, laterality, histology, behavior, and grade. We expect that with minimal modifications, our add-on can be applied towards a wide range of other clinical text-based tasks.Entities:
Mesh:
Year: 2020 PMID: 32396579 PMCID: PMC7217446 DOI: 10.1371/journal.pone.0232840
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The baseline case for classifying EHRs and the five methods for incorporating case-level context from other reports.
In the figures above, “Model” represents an arbitrary deep learning model designed for text classification, the output of which is an embedding representation of the input document.
Accuracy and macro F-Score (with 95% confidence intervals) of our different methods to capture case-level context on six different classification tasks using the CNN as the baseline.
The top row is our baseline without any report level context, the middle group shows results of methods than can access both future and previous reports in a sequence, and the bottom group show results of methods that can only access previous reports in a sequence.
| Site | Subsite | Laterality | Histology | Behavior | Grade | |
|---|---|---|---|---|---|---|
| 89.07 | 59.82 | 89.64 | 73.82 | 96.91 | 71.94 | |
| Accuracy | (88.91, 89.21) | (59.57, 60.05) | (89.49, 89.79) | (73.59, 74.03) | (96.82, 96.99) | (71.72, 72.15) |
| 56.22 | 24.33 | 46.91 | 22.79 | 67.16 | 73.72 | |
| Macro F-Score | (55.45, 56.83) | (23.92, 24.89) | (46.01, 47.83) | (22.42, 23.46) | (65.52, 68.93) | (72.24, 74.95) |
| 91.95 | 64.17 | 92.44 | 78.26 | 98.49 | 80.13 | |
| Accuracy | (91.74, 92.01) | (63.72, 64.19) | (92.28, 92.54) | (78.12, 78.54) | (98.40, 98.52) | (79.84, 80.22) |
| 58.62 | 22.57 | 21.67 | 74.58 | 75.21 | ||
| Macro F-Score | (57.91, 59.14) | (22.08, 22.75) | (21.23, 22.11) | (71.85, 77.44) | (74.74,79.72) | |
| 92.37 | 63.16 | 92.28 | 79.59 | 98.61 | 79.72 | |
| Accuracy | (91.87, 92.37) | (62.78, 63.68) | (92.01, 92.51) | (78.90, 79.63) | (98.48, 98.70) | (79.26, 80.00) |
| 62.14 | 27.42 | 49.89 | 32.29 | 73.83 | 79.03 | |
| Macro F-Score | (60.26, 62.66) | (26.16, 27.46) | (46.91, 50.33) | (30.65, 32.56) | (69.56, 78.16) | (78.66, 80.02) |
| 92.26 | 63.03 | 92.29 | 79.27 | 98.64 | 80.66 | |
| Accuracy | (91.97, 92.45) | (62.65, 63.59) | (92.21, 92.69) | (78.62, 79.35) | (98.50, 98.71) | (80.53, 81.27) |
| 64.17 | 32.88 | 47.22 | 33.85 | 76.22 | 79.31 | |
| Macro F-Score | (63.89, 66.79) | (31.87, 33.49) | (44.75, 51.84) | (33.88, 35.78) | (74.28, 82.98) | (78.79, 80.20) |
| 98.73 | ||||||
| Accuracy | (98.57, 98.78) | |||||
| 61.92 | 30.20 | 47.52 | 35.27 | 71.48 | ||
| Macro F-Score | (60.75, 62.80) | (29.73, 31.20) | (46.36, 50.64) | (34.02, 35.73) | (70.06, 79.63) | |
| 92.30 | 62.53 | 92.15 | 78.81 | 82.08 | ||
| Accuracy | (92.12, 92.60) | (62.15, 63.06) | (91.95, 92.45) | (78.33, 79.07) | (82.18, 82.87) | |
| 49.29 | 81.27 | |||||
| Macro F-Score | (46.16, 53.53) | (80.79, 82.13) | ||||
| 90.42 | 91.47 | 76.20 | 97.78 | 75.52 | ||
| Accuracy | (90.34, 90.62) | (91.29, 91.57) | (75.85, 76.26) | (97.73, 97.88) | (75.42, 75.84) | |
| 56.53 | 22.25 | 47.43 | 20.41 | 67.44 | 77.62 | |
| Macro F-Score | (55.86, 57.11) | (21.90, 22.68) | (46.22, 48.10) | (20.15, 21.02) | (66.81, 70.65) | (73.61, 78.28) |
| 90.60 | 61.88 | 91.43 | 76.01 | 97.96 | 76.49 | |
| Accuracy | (90.39, 90.91) | (60.99, 61.92) | (91.20, 91.73) | (75.55, 76.32) | (97.81, 98.07) | (76.15, 76.93) |
| 56.78 | 26.11 | 45.73 | 28.79 | 71.15 | 76.80 | |
| Macro F-Score | (55.68, 58.01) | (24.84, 26.10) | (44.30, 52.28) | (28.03, 29.77) | (69.74, 78.59) | (75.77, 77.22) |
| 90.82 | 61.50 | 91.37 | 76.53 | 98.32 | 77.23 | |
| Accuracy | (90.56, 91.09) | (60.73, 61.63) | (91.25, 91.78) | (76.07, 76.85) | (98.18, 98.41) | (76.98, 77,72) |
| 60.19 | 30.24 | 47.65 | 32.57 | 73.05 | 76.11 | |
| Macro F-Score | (59.01, 61.86) | (29.71, 31.37) | (45.04, 48.61) | (31.21, 33.01) | (69.10, 78.35) | (75.92, 77.29) |
| 90.63 | 61.72 | 91.35 | 76.66 | 98.19 | 76.88 | |
| Accuracy | (90.36, 90.88) | (60.89, 61.82) | (90.90, 91.45) | (75.92, 76.71) | (97.91, 98.17) | (76.33, 77.13) |
| 59.48 | 29.42 | 47.44 | 30.67 | 71.33 | 76.69 | |
| Macro F-Score | (57.80, 60.40) | (27.78, 30.30) | (45.02, 49.31) | (29.53, 31.32) | (68.46, 77.09) | (75.68, 77.12) |
| 62.00 | ||||||
| Accuracy | (61.55, 62.42) | |||||
| Macro F-Score |
Accuracy and macro F-Score (with 95% confidence intervals) of our different methods to capture case-level context on six different classification tasks using the HiSAN as the baseline.
The top row is our baseline without any report level context, the middle group shows results of methods than can access both future and previous reports in a sequence, and the bottom group show results of methods that can only access previous reports in a sequence.
| Site | Subsite | Laterality | Histology | Behavior | Grade | |
|---|---|---|---|---|---|---|
| 90.06 | 61.94 | 89.97 | 75.00 | 96.88 | 73.10 | |
| Accuracy | (89.90, 90.20) | (61.71, 62.17) | (89.81, 90.12) | (74.78, 75.21) | (96.80, 96.96) | (72.87, 73.30) |
| 62.98 | 30.31 | 51.46 | 33.20 | 79.73 | 74.45 | |
| Macro F-Score | (62.07, 63.69) | (29.95, 31.10) | (50.64, 52.37) | (32.36, 33.88) | 77.23, 81.89) | (72.80, 75.79) |
| 92.71 | 67.07 | 93.11 | 80.50 | 98.86 | 84.37 | |
| Accuracy | (92.49, 92.96) | (66.83, 67.69) | (92.78, 93.26) | (80.01, 80.75) | (98.85, 99.04) | (84.50, 85.17) |
| 67.63 | 37.26 | 52.72 | 38.26 | 82.81 | 83.69 | |
| Macro F-Score | (65.69, 68.57) | (35.88, 37.69) | (51.24, 56.81) | (37.74, 39.77) | (77.36, 86.03) | (83.29, 84.82) |
| 92.44 | 66.66 | 92.59 | 79.82 | 98.75 | 84.35 | |
| Accuracy | (92.25, 92.75) | (66.10, 66.98) | (92.34, 92.80) | (79.61, 80.34) | (98.61, 98.82) | (83.79, 84.46) |
| 67.92 | 39.54 | 53.17 | 41.62 | 83.42 | 83.80 | |
| Macro F-Score | (66.61, 69.39) | (37.81, 39.81) | (51.40, 56.83) | (39.75, 41.74) | (80.42, 86.70) | (83.00, 84.20) |
| Accuracy | ||||||
| 68.04 | 39.01 | 38.70 | 85.98 | |||
| Macro F-Score | (65.91, 68.25) | (37.44, 39.23) | (38.06, 39.98) | (82.13, 89.89) | ||
| 92.52 | 66.83 | 92.80 | 80.36 | 98.96 | 84.97 | |
| Accuracy | (92.34, 92.83) | (66.54, 67.44) | (92.59, 93.05) | (80.01, 80.74) | (98.79, 98.99) | (84.44, 85.09) |
| 54.74 | 85.35 | |||||
| Macro F-Score | (52.77, 57.99) | (84.56, 85.51) | ||||
| 91.37 | 64.13 | 91.81 | 77.08 | 98.24 | 79.15 | |
| Accuracy | (91.18, 91.70) | (64.06, 64.96) | (91.71, 92.21) | (76.56, 77.30) | (98.14, 98.38) | (78.77, 79.49) |
| 63.59 | 34.50 | 46.81 | 33.42 | 79.54 | 79.22 | |
| Macro F-Score | (62.40, 65.41) | (32.61, 34.83) | (47.53, 51.81) | (33.19, 35.18) | (74.15, 82.77) | (78.60, 79.96) |
| 91.92 | 65.56 | 92.38 | 77.76 | 98.61 | 81.80 | |
| Accuracy | (91.53, 92.03) | (65.14, 65.99) | (92.29, 92.78) | (77.43, 78.18) | (98.43, 98.66) | (81.82, 82.55) |
| 65.62 | 36.99 | 50.38 | 38.76 | 85.25 | 81.58 | |
| Macro F-Score | (64.84, 67.95) | (36.45, 38.43) | (49.79, 59.41) | (38.43, 40.45) | (77.71, 86.18) | (81.10, 82.32) |
| 91.50 | 64.82 | 91.94 | 77.54 | 98.20 | 79.38 | |
| Accuracy | (91.26, 91.77) | (64.56, 65.42) | (91.87, 92.37) | (77.13, 77.86) | (98.15, 98.39) | (79.12, 79.86) |
| 63.81 | 35.32 | 50.34 | 36.00 | 81.77 | 80.29 | |
| Macro F-Score | (62.66, 65.73) | (34.07, 35.88) | (49.09, 55.02) | (34.91, 36.93) | (76.55, 84.38) | (79.27, 80.55) |
| Accuracy | ||||||
| Macro F-Score |
Fig 2The top two subfigures show the cancer site document embeddings generated by the HiSAN for each pathology report in our test set with and without the self-attention module for capturing case-level context.
The bottom two figures only show the document embeddings of misclassified reports in our test set. All document embeddings are colored by the ground truth organ system and visualized using t-SNE.
Fig 3The cancer site document embeddings generated by the HiSAN for the pathology reports associated with four unique tumor IDs, with and without the self-attention module for capturing case-level context.
These figures share the same axes as Fig 2 and thus can be directly compared. Within each of the four trajectories, document embeddings are numbered from earliest to latest and are colored by the predicted organ system. We notice that without case-level context, reports belonging to the same tumor ID are classified under different organ systems. Adding case-level context addresses this problem and all document embeddings from the same tumor ID are placed in the same location in the embedding space.