| Literature DB >> 35020599 |
Andrew E Blanchard, Shang Gao, Hong-Jun Yoon, J Blair Christian, Eric B Durbin, Xiao-Cheng Wu, Antoinette Stroup, Jennifer Doherty, Stephen M Schwartz, Charles Wiggins, Linda Coyle, Lynne Penberthy, Georgia D Tourassi.
Abstract
Recent applications ofdeep learning have shown promising results for classifying unstructured text in the healthcare domain. However, the reliability of models in production settings has been hindered by imbalanced data sets in which a small subset of the classes dominate. In the absence of adequate training data, rare classes necessitate additional model constraints for robust performance. Here, we present a strategy for incorporating short sequences of text (i.e. keywords) into training to boost model accuracy on rare classes. In our approach, we assemble a set of keywords, including short phrases, associated with each class. The keywords are then used as additional data during each batch of model training, resulting in a training loss that has contributions from both raw data and keywords. We evaluate our approach on classification of cancer pathology reports, which shows a substantial increase in model performance for rare classes. Furthermore, we analyze the impact of keywords on model output probabilities for bigrams, providing a straightforward method to identify model difficulties for limited training data.Entities:
Mesh:
Year: 2022 PMID: 35020599 PMCID: PMC9533247 DOI: 10.1109/JBHI.2022.3141976
Source DB: PubMed Journal: IEEE J Biomed Health Inform ISSN: 2168-2194 Impact factor: 7.021
Development Dataset Descriptions for Pathology Reports
| Task | Train Docs | Val Docs | Test Docs | Unique Labels |
|---|---|---|---|---|
| Site | 124289 | 26666 | 26230 | 70 |
| Subsite | 124289 | 26666 | 26230 | 315 |
| Histology | 124289 | 26666 | 26230 | 534 |
Production Dataset Descriptions for Pathology Reports
| Task | Train Docs | Val Docs | Test Docs | Unique Labels |
|---|---|---|---|---|
| Site | 3371508 | 579127 | 454307 | 70 |
| Subsite | 3371508 | 579127 | 454307 | 328 |
| Histology | 3371508 | 579127 | 454307 | 645 |
Test Micro and Macro F1 Scores for CNN on the Development Dataset of Pathology Reports
| CNN | CNN + CUIs | CNN + CW | |
|---|---|---|---|
| Site Micro/Macro | 93.62/69.81 | 93.67/71.31 | 93.51/72.07 |
| Subsite Micro/Macro | 78.22/36.74 | 78.46/39.65 | 77.40/38.35 |
| Histology Micro/Macro | 84.01/37.52 | 84.53/45.42 | 82.76/41.21 |
Fig. 1.CNN model performance on the development dataset for three different tasks (site, subiste, histology). First row shows fraction of classes vs training samples per class. Second row shows test accuracy vs training samples for the baseline CNN model (blue), CNN + CUIs (green), and CNN + Class Weights (red). The x-axis for all plots is scaled by a factor of 5 (i.e. the intervals are 0–50, 50–500, 500–5000, and >5000 training samples).
Fig. 2.For each class in the subsite task, the maximum model output probability for all bigrams in the development training corpus is shown vs the number of training samples. The two X’s in each figure correspond to the example bigrams and scores shown in Tables IV–V.
Example Bigram Scores for a Rare Subsite Class (C69.1)
| CNN | CNN + CUI | CNN + CW | |||
|---|---|---|---|---|---|
| Bigram | Score | Bigram | Score | Bigram | Score |
| orbital eye | 0.097 | cornea eye | 0.787 | carcinoma eye | 0.101 |
| carcinoma eye | 0.089 | accompanied cornea | 0.652 | accompanied eye | 0.089 |
| melanoma eye | 0.080 | mass cornea | 0.612 | conjunctiva eye | 0.084 |
| cornea eye | 0.077 | neoplasm cornea | 0.609 | lid eye | 0.079 |
| orbit eye | 0.075 | lesion cornea | 0.585 | tumor eye | 0.073 |
Example Bigram Scores for a Well-Represented Subsite Class (C75.1)
| CNN | CNN + CUI | CNN + CW | |||
|---|---|---|---|---|---|
| Bigram | Score | Bigram | Score | Bigram | Score |
| adenoma pituitary | 0.912 | adenoma pituitary | 0.961 | adenoma pituitary | 0.943 |
| gonadotroph pituitary | 0.733 | gonadotroph pituitary | 0.945 | origin pituitary | 0.873 |
| fluid pituitary | 0.713 | sellar pituitary | 0.936 | hormones pituitary | 0.758 |
| washing pituitary | 0.705 | pituitary gonadotroph | 0.902 | washing pituitary | 0.739 |
| mass pituitary | 0.701 | hormones pituitary | 0.864 | mass pituitary | 0.722 |
Test Micro and Macro F1 Scores for CNN Trained on Production Dataset With Over 4M Pathology Reports
| CNN | CNN + CUIs | CNN + CW | CNN + NPMI | |
|---|---|---|---|---|
| Site Micro/Macro | 92.82/70.79 | 92.82/71.46 | 92.46/71.51 | 92.79/71.04 |
| Subsite Micro/Macro | 69.99/34.80 | 70.14/38.34 | 69.37/37.77 | 69.78/37.71 |
| Histology Micro/Macro | 79.46/35.80 | 79.40/39.09 | 76.47/36.38 | 79.30/38.67 |
Test Micro/Macro F1 Scores for CUI and NPMI Keywords With Fixed α = 1.0 and Varying K on the Development Dataset
| CNN + CUI | CNN + NPMI | |||||
|---|---|---|---|---|---|---|
|
| Site | Subsite | Histology | Site | Subsite | Histology |
| 1 | 93.64/71.50 | 78.33/40.53 | 84.17/46.90 | 93.50/70.92 | 77.82/36.67 | 83.91/40.84 |
| 5 | 93.67/71.31 | 78.46/39.65 | 84.53/45.42 | 93.68/71.06 | 78.26/38.24 | 84.19/43.63 |
| 10 | 93.55/70.74 | 78.55/39.59 | 84.39/43.76 | 93.58/70.94 | 78.32/38.16 | 84.03/41.80 |
| 20 | 93.61/71.29 | 78.35/38.88 | 84.30/42.06 | 93.66/70.87 | 78.07/37.96 | 84.22/41.11 |
Test Micro/Macro F1 Scores for CUI and NPMI Keywords With Fixed K = 5 and Varying α on the Development Dataset
| CNN + CUI | CNN + NPMI | |||||
|---|---|---|---|---|---|---|
|
| Site | Subsite | Histology | Site | Subsite | Histology |
| 0.25 | 93.60/70.43 | 78.31/38.10 | 84.35/42.00 | 93.71/71.43 | 78.14/38.41 | 84.01/42.12 |
| 0.5 | 93.73/71.45 | 78.55/39.21 | 84.54/43.46 | 93.71/71.35 | 78.02/38.20 | 84.10/42.82 |
| 1.0 | 93.67/71.31 | 78.46/39.65 | 84.53/45.42 | 93.68/71.06 | 78.26/38.24 | 84.19/43.63 |
| 2.0 | 93.63/71.31 | 78.41/39.47 | 84.45/45.66 | 93.61/70.94 | 77.95/37.47 | 83.89/41.30 |
Test Micro/Macro F1 Scores for CW With Different Values of μ on the Development Dataset
| CNN + CW | |||
|---|---|---|---|
|
| Site | Subsite | Histology |
| 0.05 | 93.48/71.88 | 77.17/37.74 | 83.22/40.71 |
| 0.15 | 93.51/72.07 | 77.40/38.35 | 82.76/41.21 |
| 1.0 | 93.59/71.46 | 77.97/38.15 | 83.08/39.63 |