| Literature DB >> 30733701 |
Kentaro Akazawa1,2, Ryo Sakamoto1,3, Satoshi Nakajima1,3, Dan Wu1, Yue Li4, Kenichi Oishi1, Andreia V Faria1, Kei Yamada2, Kaori Togashi3, Constantine G Lyketsos5,6, Michael I Miller7, Susumu Mori1.
Abstract
Purpose: To examine the feasibility and potential difficulties of automatically generating radiologic reports (RRs) to articulate the clinically important features of brain magnetic resonance (MR) images. Materials andEntities:
Keywords: 3D T1 weighted image; automated generation; brain atlas; brain atrophy; dementia; radiologic description
Year: 2019 PMID: 30733701 PMCID: PMC6354548 DOI: 10.3389/fneur.2019.00007
Source DB: PubMed Journal: Front Neurol ISSN: 1664-2295 Impact factor: 4.003
Figure 1Schematic diagram for the image–sentence conversion. MPRAGE images were first segmented into 283 structural units using a multi-atlas segmentation tool, from which an additional 215 superstructures were defined. This process of reducing an image with >106 voxels to 498 structures was called the Anatomic Knowledge Filter. In the second step, called the Abnormality Judgment Filter, volumes of structures were compared with values retrieved from databases of healthy individuals, with z-scores (criteria for abnormality judgment) calculated. In the third step, z-score information for the 498 structures was reduced to information on 20 selected structures and their relationships, using a Clinical Knowledge Filter. The final step was the translation of this information to human-readable sentences using a dictionary. MPRAGE, magnetization-prepared rapid gradient-echo.
Figure 2Decision-making tree of the reasons for discrepancies between the radiologic reports and automated sentences. AKF, Anatomic Knowledge Filter; CKF, Clinical Knowledge Filter.
The number of the positive and/or negative findings in the brain tissue and the ventricle among AS and 3 RRs.
| Brain tissue | Positive | Positive | 45 | 32 | 45 |
| Positive | Negative | 12 | 25 | 12 | |
| Negative | Positive | 15 | 4 | 15 | |
| Negative | Negative | 20 | 31 | 20 | |
| Accuracy | 70.7% | 68.5% | 70.7% | ||
| Sensitivity | 75.0% | 88.9% | 75.0% | ||
| Precision | 78.9% | 56.1% | 78.9% | ||
| Specificity | 62.5% | 55.4% | 62.5% | ||
| Ventricle | Positive | Positive | 5 | 5 | 3 |
| Positive | Negative | 30 | 30 | 32 | |
| Negative | Positive | 0 | 0 | 0 | |
| Negative | Negative | 57 | 57 | 57 | |
| Accuracy | 67.4% | 67.4% | 65.2% | ||
| Sensitivity | 100.0% | 100.0% | 100.0% | ||
| Precision | 14.3% | 14.3% | 8.6% | ||
| Specificity | 65.5% | 65.5% | 64.0% | ||
AS, Automated sentence; RR, Radiologic report.
The number of positive and/or negative findings of all anatomic structures that were mentioned in AS or RRs among AS, 3 RR, and RR by majority voting.
| TP (n) | 23 | 31 | 34 | 28 | P-P (n) | 39 | 36 | 42 | 59 | 64 | 61 | |||
| FP (n) | 94 | 86 | 85 | 89 | P-N (n) | 82 | 85 | 79 | 64 | 58 | 58 | |||
| FN (n) | 98 | 90 | 83 | 56 | N-P (n) | 82 | 83 | 76 | 25 | 20 | 23 | |||
| TN (n) | 1901 | 1909 | 1914 | 1943 | N-N (n) | 1913 | 1912 | 1919 | 1968 | 1974 | 1974 | |||
| Accuracy | 90.9% | 91.7% | 92.1% | 93.1% | PP/Total | 92.2% | 92.1% | 92.7% | 95.8% | 96.3% | 96.2% | |||
| Sensitivity | 19.0% | 25.6% | 29.1% | 33.3% | PP/(PP+NP) | 32.2% | 30.3% | 35.6% | 32.7% | 70.2% | 76.2% | 72.6% | 73.0% | |
| Precision | 19.7% | 26.5% | 28.6% | 24.9% | 23.9% | PP/(PP+PN) | 32.2% | 29.8% | 34.7% | 32.2% | 48.0% | 52.5% | 51.3% | 50.5% |
| Specificity | 95.3% | 95.7% | 95.7% | 95.6% | NN/(NN+PN) | 95.9% | 95.7% | 96.0% | 95.9% | 96.9% | 97.1% | 97.1% | 97.0% | |
| kappa value | 0.15 | 0.22 | 0.25 | 0.20 | 0.24 | kappa value | 0.28 | 0.26 | 0.31 | 0.28 | 0.55 | 0.60 | 0.58 | 0.58 |
AS, Automated sentence; N, Negative; P, Positive; RR, Radiologic report; RRm, RR by majority voting.
Number and percentage of each error category and subcategory.
| RR1 | 192 | 34 (17.7%) | 10 (5.2%) | 16 (8.3%) | 0 (0.0%) | 29 (15.1%) | 23 (12.0%) | 80 (41.7%) | ||||||
| 15 (7.8%) | 19 (9.9%) | 3 (1.6%) | 3 (1.6%) | 1 (0.5%) | 3 (1.6%) | 6 (3.1%) | 10 (5.2%) | 56 (29.2%) | 24 (12.5%) | |||||
| RR2 | 176 | 7 (4.0%) | 10 (5.7%) | 2 (1.1%) | 11 (6.3%) | 14 (8.0%) | 14 (8.0%) | 118 (67.0%) | ||||||
| 3 (1.7%) | 4 (2.3%) | 3 (1.7%) | 3 (1.7%) | 1 (0.6%) | 3 (1.7%) | 2 (1.1%) | 0 (0.0%) | 62 (35.2%) | 56 (31.8%) | |||||
| RR3 | 168 | 0 (0.0%) | 10 (6.0%) | 13 (7.7%) | 5 (3.0%) | 3 (1.8%) | 16 (9.5%) | 121 (72.0%) | ||||||
| 0 (0.0%) | 0 (0.0%) | 3 (1.8%) | 3 (1.8%) | 1 (0.6%) | 3 (1.8%) | 0 | 13 (7.7%) | 72 (42.9%) | 49 (29.2%) | |||||
| RRm | 145 | 4 (2.8%) | 10 (6.9%) | 5 (3.4%) | 2 (1.4%) | 10 (6.9%) | 18(12.4%) | 96 (66.2%) | ||||||
| 2 (1.4 %) | 2 (1.4%) | 3 (2.1%) | 3 (2.1%) | 1 (0.7%) | 3 (2.1%) | 2 (1.4%) | 3 (2.1%) | 67 (46.2%) | 29 (20.0%) | |||||
RR, Radiologic report, RRm, RR by majority voting.
Figure 3Two examples of hippocampal segmentation. Both cases had a similar degree of atrophy based on the quantification results, with z-scores higher than −2.0; both were judged as normal and automated sentences were not triggered. The radiologists agreed that Case (A) was normal (true-negative), but Case (B) was judged to be atrophic (false-negative). In this case, the opening of the uncal sulci (arrows) was a factor in the radiologists' judgment. Blue lines show the boundary of the hippocampus.
Figure 4Subtraction of false-negative cases from false-positive cases for three different structures. For the hemisphere, there were more false-positive than false-negative cases, suggesting that the threshold (z-score <-2.0) was too low (too close to zero and sentences were triggered too often). On the other hand, the same threshold was too relaxed for the hippocampus, assuming the radiologists' judgment to be the gold standard. RR, radiologic report.
Figure 5Two sample cases with similar z-scores for the frontal lobe volumes, but with different interpretations by radiologists. Two radiologists reported bilateral frontal lobe atrophy in Case (A), where the image showed noticeable expansions of the bilateral frontal sulci and unilateral Sylvian fissure, as well as a highly dilated anterior lateral ventricle. All radiologists considered the frontal lobe to be normal in Case (B), where the image showed relatively smaller frontal sulci and Sylvian fissures, and moderately dilated anterior lateral ventricles. AS, automated sentence; L, left; LV, lateral ventricle; R, right; RR, radiologic report.