| Literature DB >> 32244833 |
Ivan Krsnik1, Goran Glavaš2, Marina Krsnik3, Damir Miletić4, Ivan Štajduhar1,5.
Abstract
Narrative texts in electronic health records can be efficiently utilized for building decision support systems in the clinic, only if they are correctly interpreted automatically in accordance with a specified standard. This paper tackles the problem of developing an automated method of labeling free-form radiology reports, as a precursor for building query-capable report databases in hospitals. The analyzed dataset consists of 1295 radiology reports concerning the condition of a knee, retrospectively gathered at the Clinical Hospital Centre Rijeka, Croatia. Reports were manually labeled with one or more labels from a set of 10 most commonly occurring clinical conditions. After primary preprocessing of the texts, two sets of text classification methods were compared: (1) traditional classification models-Naive Bayes (NB), Logistic Regression (LR), Support Vector Machine (SVM), and Random Forests (RF)-coupled with Bag-of-Words (BoW) features (i.e., symbolic text representation) and (2) Convolutional Neural Network (CNN) coupled with dense word vectors (i.e., word embeddings as a semantic text representation) as input features. We resorted to nested 10-fold cross-validation to evaluate the performance of competing methods using accuracy, precision, recall, and F 1 score. The CNN with semantic word representations as input yielded the overall best performance, having a micro-averaged F 1 score of 86 . 7 % . The CNN classifier yielded particularly encouraging results for the most represented conditions: degenerative disease ( 95 . 9 % ), arthrosis ( 93 . 3 % ), and injury ( 89 . 2 % ). As a data-hungry deep learning model, the CNN, however, performed notably worse than the competing models on underrepresented classes with fewer training instances such as multicausal disease or metabolic disease. LR, RF, and SVM performed comparably well, with the obtained micro-averaged F 1 scores of 84 . 6 % , 82 . 2 % , and 82 . 1 % , respectively.Entities:
Keywords: automatic labeling; decision support system; free-form radiology report; knee; machine learning; natural language processing; word embedding
Year: 2020 PMID: 32244833 PMCID: PMC7235892 DOI: 10.3390/diagnostics10040196
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Number of occurrences (counts) of distinct clinical conditions and examples of phrases indicating the clinical conditions. These phrases were the most frequently occurring in the used collection of unstructured radiology reports.
| Clinical Condition | Count | Observation |
|---|---|---|
| arthrosis | 820 | grade I degenerative changes, chondromalacia, pseudocyst, popliteal cyst |
| injury | 738 | patella fracture, partial ACL rupture, sprain, meniscus tear, contusion |
| degenerative disease | 160 | osteoarthritis |
| clean | 79 | lateral meniscus intact, cartilage preserved, patella positioned normally |
| inflammatory disease | 76 | osteomyelitis, septic arthritis, chronic enthesitis, Osgood–Schlatter disease, rheumatoid arthritis, discoid meniscus |
| neoplasm | 68 | sessile osteochondroma, hemangioma, enchondroma, fibroma |
| unclear | 42 | patchy areas of edema, patellar tilt |
| multicausal disease | 32 | varicose veins, chondrocalcinosis, osteochondromatosis |
| developmental anomaly | 16 | tibial developmental defect, bipartite patella, knee recurvation |
| metabolic disease | 14 | osteoporosis |
| neoplasm-like growth | 7 | fibrous cortical defect |
| idiopathic disease | 3 | ACL mucoid degeneration |
| autoimmune disease | 2 | Henoch–Schönlein purpura |
| genetic disease | 1 | osteopetrosis |
Observation counts for each unique set of clinical conditions appearing in the radiology reports used for this research. Sets of clinical conditions are shown in descending order with respect to their observation count.
| Clinical Conditions | Count | Clinical Conditions | Count |
|---|---|---|---|
| arthrosis, injury | 306 | arthrosis, injury, degenerative dis., neoplasm | 2 |
| injury | 281 | arthrosis, injury, degenerative dis., inflammatory dis. | 2 |
| arthrosis | 261 | arthrosis, injury, degenerative dis., multicausal dis. | 2 |
| clean | 79 | arthrosis, injury, degenerative dis., developmental anomaly | 1 |
| arthrosis, injury, degenerative dis. | 66 | arthrosis, injury, neoplasm, metabolic disease | 1 |
| arthrosis, degenerative dis. | 52 | degenerative dis., developmental anomaly | 1 |
| unclear | 41 | arthrosis, developmental anomaly | 1 |
| arthrosis, neoplasm | 20 | arthrosis, injury, degenerative dis., inflammatory dis., developmental anomaly, idiopathic dis. | 1 |
| arthrosis, inflammatory dis. | 20 | developmental anomaly | 1 |
| neoplasm | 19 | arthrosis, genetic dis. | 1 |
| inflammatory dis. | 15 | arthrosis, degenerative dis., inflammatory dis., neoplasm | 1 |
| arthrosis, injury, inflammatory dis. | 15 | arthrosis, injury, degenerative dis., neoplasm-like growth | 1 |
| arthrosis, degenerative dis., inflammatory dis. | 11 | arthrosis, degenerative dis., inflammatory dis., multicausal dis. | 1 |
| injury, neoplasm | 9 | arthrosis, injury, multicausal dis., idiopathic dis. | 1 |
| arthrosis, injury, multicausal dis. | 9 | injury, neoplasm, developmental anomaly | 1 |
| arthrosis, injury, neoplasm | 7 | arthrosis, idiopathic dis. | 1 |
| arthrosis, multicausal dis. | 6 | injury, metabolic disease | 1 |
| injury, degenerative dis. | 6 | neoplasm-like growth | 1 |
| arthrosis, injury, metabolic disease | 5 | arthrosis, inflammatory dis., developmental anomaly | 1 |
| arthrosis, injury, developmental anomaly | 5 | arthrosis, degenerative dis., inflammatory dis., developmental anomaly | 1 |
| injury, inflammatory dis. | 4 | injury, neoplasm-like growth | 1 |
| arthrosis, metabolic disease | 4 | degenerative dis. | 1 |
| arthrosis, degenerative dis., multicausal dis. | 4 | unclear, neoplasm, neoplasm-like growth | 1 |
| arthrosis, injury, degenerative dis., metabolic disease | 3 | arthrosis, injury, neoplasm-like growth | 1 |
| arthrosis, degenerative dis., neoplasm | 3 | neoplasm-like growth, autoimmune dis. | 1 |
| injury, developmental anomaly | 3 | autoimmune dis. | 1 |
| arthrosis, inflammatory dis., neoplasm | 3 | multicausal dis., neoplasm-like growth | 1 |
| injury, multicausal dis. | 3 | inflammatory dis., multicausal dis. | 1 |
| multicausal dis. | 3 | arthrosis, injury, degenerative dis., neoplasm, multicausal dis. | 1 |
Examples of morphological clusters obtained via our simple heuristic word clustering algorithm. The numbers in brackets indicate the number of occurrences in our corpus of radiology reports. Selected lemma of each cluster is emphasized.
| uznapredovala (10), uznapredovalog (13), uznapredovalih (17), uznapredovalnim (1), uznapredovalim (5), |
| posteromedijalni (1), posteromedijalnog (1), |
| infrapatelarnoj (1), infrapatelarno (29), infrapatelarnom (3), infrapatelano (1), |
| ekstenzorni (1), ekstenzije (1), ekstenzora (1), ekstenziji (1), ekstenzornom (1), ekstenzivne (1), ekstenzija (1), |
Figure 1An illustration of the architecture of the vanilla CNN we used for text classification.
Results (in terms of scores) for all five text classification models used on each of the 10 knee condition labels. The last two rows show the micro- and macro-averaged performance over all 10 classes. The results presented here were obtained via nested 10-fold CV.
| Class | NB | RF | LR | SVM | CNN |
|---|---|---|---|---|---|
| clean | 24.3 | 25.3 | 47.4 |
| 39.5 |
| unclear | 8.7 | 14.8 | 7.5 | 8.7 | 4.5 |
| arthrosis | 83.5 | 89.3 | 89.0 | 88.9 |
|
| injury | 81.3 | 82.3 | 88.3 | 87.7 |
|
| degenerative dis. | 26.4 | 94.5 | 94.8 | 72.5 |
|
| inflammatory dis. | 7.1 | 65.6 |
| 36.4 | 52.3 |
| neoplasm | 16.2 | 46.0 | 55.4 | 51.6 |
|
| multicausal dis. | 0.0 | 27.9 |
| 6.1 | 0.0 |
| developmental an. | 0.0 | 31.6 |
| 0.0 | 0.0 |
| metabolic dis. | 0.0 |
|
| 0.0 | 13.3 |
| Macro avg. | 24.8 | 54.7 |
| 40.9 | 46.4 |
| Micro avg. | 73.8 | 82.2 | 84.6 | 82.1 |
|
Detailed results (classification accuracy, precision, recall, score) for NB, RF, LR, SVM, and CNN models for the 10 knee condition labels. The last two columns show the micro- and macro-averaged performance on the selected evaluation metric over all 10 classes. The results are obtained via nested 10-fold CV.
| Model | Class | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Clean | Unclear | Arthrosis | Injury | Degenerative Dis. | Inflammatory Dis. | Neoplasm | Multicausal Dis. | Developmental An. | Metabolic Dis. | Macro Average | Micro Average | |
|
| ||||||||||||
| NB | 95.7 | 96.8 | 75.2 | 75.5 | 87.5 | 93.9 | 95.2 | 97.5 | 98.8 | 98.9 | 91.5 | 91.5 |
| RF | 95.0 | 96.5 | 86.2 | 79.7 | 98.7 | 96.7 | 95.8 | 97.6 | 99.0 | 99.5 | 94.5 | 94.4 |
| LR | 96.1 | 96.2 | 85.5 | 86.3 | 98.8 | 96.8 | 96.2 | 97.9 | 99.0 | 99.5 | 95.2 | 95.2 |
| SVM | 96.8 | 96.8 | 85.0 | 85.3 | 94.2 | 95.2 | 96.5 | 97.6 | 98.8 | 98.9 | 94.5 | 94.5 |
| CNN | 96.0 | 96.8 | 91.0 | 87.2 | 99.0 | 95.9 | 97.8 | 97.4 | 98.8 | 99.0 | 95.9 | 95.9 |
|
| ||||||||||||
| NB | 90.0 | 66.7 | 74.4 | 74.5 | 48.3 | 37.5 | 100 | 0.0 | 0.0 | 0.0 | 49.1 | 73.7 |
| RF | 47.8 | 36.4 | 91.7 | 85.3 | 98.0 | 85.4 | 71.9 | 54.5 | 100 | 88.9 | 76.0 | 89.0 |
| LR | 69.7 | 20.0 | 88.9 | 89.7 | 98.0 | 86.0 | 70.5 | 77.8 | 80.0 | 88.9 | 77.0 | 88.7 |
| SVM | 82.4 | 66.7 | 86.8 | 87.1 | 87.6 | 81.8 | 96.0 | 100 | 0.0 | 0.0 | 68.8 | 87.0 |
| CNN | 77.3 | 100 | 91.6 | 89.5 | 97.4 | 85.3 | 91.7 | 0.0 | 0.0 | 100 | 63.3 | 90.8 |
|
| ||||||||||||
| NB | 14.1 | 4.7 | 95.2 | 89.5 | 18.1 | 3.9 | 8.8 | 0.0 | 0.0 | 0.0 | 23.4 | 74.0 |
| RF | 17.2 | 9.3 | 86.9 | 79.5 | 91.3 | 53.2 | 33.8 | 18.8 | 18.8 | 57.1 | 46.6 | 76.4 |
| LR | 35.9 | 4.7 | 89.1 | 86.9 | 91.9 | 55.8 | 45.6 | 21.9 | 25.0 | 57.2 | 51.4 | 80.8 |
| SVM | 43.8 | 4.7 | 91.0 | 88.3 | 61.9 | 23.4 | 35.3 | 3.1 | 0.0 | 0.0 | 35.2 | 77.7 |
| CNN | 26.6 | 2.3 | 95.1 | 89.0 | 94.4 | 37.7 | 64.7 | 0.0 | 0.0 | 7.1 | 41.7 | 83.0 |
|
| ||||||||||||
| NB | 24.3 | 8.7 | 83.5 | 81.3 | 26.4 | 7.1 | 16.2 | 0.0 | 0.0 | 0.0 | 24.8 | 73.8 |
| RF | 25.3 |
| 89.3 | 82.3 | 94.5 | 65.6 | 46.0 | 27.9 | 31.6 |
| 54.7 | 82.2 |
| LR | 47.4 | 7.5 | 89.0 | 88.3 | 94.8 |
| 55.4 |
|
|
|
| 84.6 |
| SVM | 57.1 | 8.7 | 88.9 | 87.7 | 72.5 | 36.4 | 51.6 | 6.1 | 0.0 | 0.0 | 40.9 | 82.1 |
| CNN | 39.5 | 4.5 |
|
|
| 52.3 |
| 0.0 | 0.0 | 13.3 | 46.4 |
|