| Literature DB >> 31549323 |
Adam Spandorfer1, Cody Branch1, Puneet Sharma2, Pooyan Sahbaee2, U Joseph Schoepf1, James G Ravenel1, John W Nance3.
Abstract
BACKGROUND: Structured reports have been shown to improve communication between radiologists and providers. However, some radiologists are concerned about resultant decreased workflow efficiency. We tested a machine learning-based algorithm designed to convert unstructured computed tomography pulmonary angiography (CTPA) reports into structured reports.Entities:
Keywords: Artificial intelligence; Machine learning; Natural language processing; Structured reporting; Tomography (x-ray, computed)
Year: 2019 PMID: 31549323 PMCID: PMC6757071 DOI: 10.1186/s41747-019-0118-1
Source DB: PubMed Journal: Eur Radiol Exp ISSN: 2509-9280
Fig. 1Examples of unstructured (a, used for testing) and structured (b, used for training) radiology reports
Fig. 2Overview of the proposed deep learning framework for converting free-form unstructured reports into structured reports with section headings. Each input report is split into sentences and each sentence is classified by the pre-trained convolution neural network algorithm into one of the classes (section headings). NLP, Natural language processing
Fig. 3Text classification model with convolution neural network net. Conv, Convolution; NLP, Natural language processing
Accuracy of individual predicted labels
| Predicted label | Number of statements | Accuracy by strict criteria | Accuracy by modified criteria | Problematic statements |
|---|---|---|---|---|
| Cardiovascular | 840 | 805/840 (95.8%) | 815/840 (97.0%) | 23/840 (2.7%) |
| Lines/tubes | 118 | 111/118 (94.1%) | 113/118 (95.8%) | 2/118 (1.7%) |
| Lungs and airways | 821 | 717/821 (87.3%) | 768/821 (93.5%) | 68/821 (8.3%) |
| Mediastinum and lymph nodes | 447 | 402/447 (89.9%) | 444/447 (99.3%) | 48/447 (10.7%) |
| Pleura | 371 | 307/371 (82.7%) | 369/371 (99.5%) | 62/371 (16.8%) |
| Pulmonary arteries | 502 | 485/502 (96.6%) | 487/502 (97.0%) | 21/502 (4.2%) |
| Soft tissues and bones | 583 | 553/583 (94.8%) | 556/583 (95.4%) | 16/583 (2.7%) |
| Upper abdomen | 475 | 426/475 (89.7%) | 434/475 (91.4%) | 34/475 (7.2%) |
| Total | 4,157 | 3,806/4,157 (91.6%) | 3,986/4,157 (95.9%) | 274/4,157 (6.6%) |
Accuracy stratified by prediction probability
| Prediction probability threshold | Number of statements | Accuracy by strict criteria | Accuracy by modified criteria |
|---|---|---|---|
| 0.1 | 4,157 | 3806/4157 (91.6%) | 3986/4157 (95.9%) |
| 0.15 | 4,155 | 3806/4155 (91.6%) | 3986/4155 (95.9%) |
| 0.2 | 4,154 | 3806/4154 (91.6%) | 3986/4154 (96.0%) |
| 0.25 | 4,154 | 3806/4154 (91.6%) | 3986/4154 (96.0%) |
| 0.3 | 4,151 | 3805/4151 (91.7%) | 3985/4151 (96.0%) |
| 0.35 | 4,143 | 3802/4143, (91.8%) | 3982/4143 (96.1%) |
| 0.4 | 4,127 | 3794/4127 (91.9%) | 3974/4127 (96.3%) |
| 0.45 | 4,112 | 3786/4112 (92.1%) | 3966/4112 (96.4%) |
| 0.5 | 4,094 | 3779/4094 (92.3%) | 3958/4094 (96.7%) |
| 0.55 | 4,069 | 3771/4069 (92.7%) | 3945/4069 (97.0%) |
| 0.6 | 4,034 | 3761/4034 (93.2%) | 3934/4034 (97.5%) |
| 0.65 | 4,009 | 3747/4009 (93.5%) | 3920/4009 (97.8%) |
| 0.7 | 3,961 | 3711/3961 (93.7%) | 3881/3961 (98.0%) |
| 0.75 | 3,925 | 3693/3925 (94.1%) | 3854/3925 (98.2%) |
| 0.8 | 3,886 | 3667/3886 (94.4%) | 3824/3886 (98.4%) |
| 0.85 | 3,858 | 3651/3858 (94.6%) | 3806/3858 (98.7%) |
| 0.9 | 3,810 | 3623/3810 (95.1%) | 3773/3810 (99.0%) |
| 0.95 | 3,715 | 3546/3715 (95.5%) | 3689/3715 (99.3%) |
| 1.0 | 3,262 | 3187/3262 (97.8%) | 3259/3262 (99.9%) |
Fig. 4Accuracy of the algorithm in correctly labeling statements by strict and modified criteria at various prediction probability thresholds