| Literature DB >> 35545628 |
Thodsawit Tiyarattanachai1, Terapap Apiparakoon2, Sanparith Marukatat3, Sasima Sukcharoen4, Sirinda Yimsawad2, Oracha Chaichuen5, Siwat Bhumiwat6, Natthaporn Tanpowpong7, Nutcha Pinjaroen6, Rungsun Rerknimitr2, Roongruedee Chaiteerakij8.
Abstract
Despite the wide availability of ultrasound machines for hepatocellular carcinoma surveillance, an inadequate number of expert radiologists performing ultrasounds in remote areas remains a primary barrier for surveillance. We demonstrated feasibility of artificial intelligence (AI) to aid in the detection of focal liver lesions (FLLs) during ultrasound. An AI system for FLL detection in ultrasound videos was developed. Data in this study were prospectively collected at a university hospital. We applied a two-step training strategy for developing the AI system by using a large collection of ultrasound snapshot images and frames from full-length ultrasound videos. Detection performance of the AI system was evaluated and then compared to detection performance by 25 physicians including 16 non-radiologist physicians and 9 radiologists. Our dataset contained 446 videos (273 videos with 387 FLLs and 173 videos without FLLs) from 334 patients. The videos yielded 172,035 frames with FLLs and 1,427,595 frames without FLLs for training on the AI system. The AI system achieved an overall detection rate of 89.8% (95%CI: 84.5-95.0) which was significantly higher than that achieved by non-radiologist physicians (29.1%, 95%CI: 21.2-37.0, p < 0.001) and radiologists (70.9%, 95%CI: 63.0-78.8, p < 0.001). Median false positive detection rate by the AI system was 0.7% (IQR: 1.3%). AI system operation speed reached 30-34 frames per second, showing real-time feasibility. A further study to demonstrate whether the AI system can assist operators during ultrasound examinations is warranted.Entities:
Mesh:
Year: 2022 PMID: 35545628 PMCID: PMC9095624 DOI: 10.1038/s41598-022-11506-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Examples of focal liver lesions. The lesions are indicated with blue markers.
Figure 2Example of the same lesion appearing as a distinct observation when the ultrasound beam passes through the center of the lesion (2a). In contrast, the lesion appears as a small faint observation when the ultrasound beam passes through the periphery of the lesion (2b). Left panels show the original frames. Right panels show the labeled location of the lesion.
Figure 3AI system development.
Figure 4Difficult frames in Group FN used to train the AI system in the Step 2 Training. These are frames with faint lesions not detected by Model B pre-trained on ultrasound snapshot images in the Step 1 Training. Left panels show the original frames. Right panels show labeled location of the lesions (a: HCC, b: cyst, c: hemangioma, d: FFS, e: FFI).
Figure 5Difficult frames in Group FP used to train the AI system in the Step 2 Training. These are frames in which Model B pre-trained on ultrasound snapshot images in the Step 1 Training falsely detected other structures (arrows) as FLLs (a: blood vessel, b: stomach, c: fibrous tissue, d: parenchymal heterogeneity (dashed circle)).
Figure 6Process in performance evaluation of the AI system, non-radiologist physicians and radiologists. Detection rates by the AI system and physicians were evaluated using the same formula. Evaluation of false positive (FP) detection were different for the AI system and physicians, thus they should not be directly compared. Specifically, the AI system was evaluated by false positive detection rate, while physicians were evaluated by number of false positive detections.
Dataset characteristics.
| Number | All sets | Training set | Tuning set | Test set |
|---|---|---|---|---|
| Patients | 334 | 123 | 36 | 175 |
| Total videos | 446 | 228 | 43 | 175 |
| Videos with lesions | 273 | 145 | 43 | 85 |
| Types of lesions | ||||
| Total | 387 | 199 | 61 | 127 |
| HCC | 71 | 39 | 9 | 23 |
| Cyst | 138 | 84 | 20 | 34 |
| Hemangioma | 78 | 38 | 13 | 27 |
| FFS | 69 | 24 | 15 | 30 |
| FFI | 31 | 14 | 4 | 13 |
| Videos without lesions | 173 | 83 | 0 | 90 |
| Frames with lesions | 267,820 | 172,035 | 40,923 | 54,862 |
| Frames without lesions | 1,879,727 | 1,427,595 | 162,338 | 289,794 |
| Difficult frames with lesions (Group FN) | NA | 28,443 | NA | NA |
| Difficult frames without lesions (Group FP) | NA | 27,163 | NA | NA |
| Median size in cm (IQR) | ||||
| HCC | 2.2 (2.0) | 2.3 (3.1) | 2.1 (1.8) | 1.7 (1.0) |
| Cyst | 1.1 (0.4) | 1.0 (0.4) | 1.4 (0.5) | 1.3 (0.5) |
| Hemangioma | 1.4 (0.9) | 1.5 (0.8) | 1.5 (0.5) | 1.2 (0.9) |
| FFS | 1.2 (0.6) | 1.2 (0.6) | 1.1 (1.3) | 1.1 (0.4) |
| FFI | 1.9 (0.9) | 2.0 (0.4) | 1.9 (0.7) | 1.8 (0.9) |
NA not applicable; HCC hepatocellular carcinoma; FFS focal fatty sparing; FFI focal fatty infiltration; FN false negative; FP false positive; IQR interquartile range.
Performance results of AI system, non-radiologist physicians and radiologists on the test set.
| AI system | Non-radiologist physicians | Radiologists | |||
|---|---|---|---|---|---|
| Per-lesion detection rate | |||||
| Overall | 89.8% (84.5–95.0) | 29.1% (21.2–37.0) | < 0.001 | 70.9% (63.0–78.8) | < 0.001 |
| HCC | 100% (85.2–100) | 39.1% (19.2–59.1) | < 0.001 | 69.6% (50.8–88.4) | 0.016 |
| Cyst | 82.4% (69.5–95.2) | 29.4% (14.1–44.7) | < 0.001 | 73.5% (58.7–88.4) | 0.38 |
| Hemangioma | 85.2% (71.8–98.6) | 40.7% (22.2–59.3) | < 0.001 | 77.8% (62.1–93.5) | 0.73 |
| FFS | 96.7% (90.2–100) | 13.3% (1.2–25.5) | < 0.001 | 63.3% (46.1–80.6) | 0.006 |
| FFI | 84.6% (65.0–100) | 23.1% (0.2–46.0) | 0.008 | 69.2% (44.1–94.3) | 0.63 |
| Median (IQR) false positive detection rate | 0.7% (IQR 1.3%) | NA | NA | NA | NA |
| False positive detections | NA | 118 false positive detections from 175 videos | NA | 204 false positive detections from 175 videos | NA |
95% CIs are shown in brackets.
NA not applicable; HCC hepatocellular carcinoma; FFS focal fatty sparing; FFI focal fatty infiltration; IQR interquartile range.