| Literature DB >> 33880797 |
Hang-Tong Hu1,2, Wei Wang1, Li-Da Chen1, Si-Min Ruan1, Shu-Ling Chen1, Xin Li3, Ming-De Lu1,2, Xiao-Yan Xie1, Ming Kuang1,2.
Abstract
BACKGROUND AND AIM: This study aims to construct a strategy that uses assistance from artificial intelligence (AI) to assist radiologists in the identification of malignant versus benign focal liver lesions (FLLs) using contrast-enhanced ultrasound (CEUS).Entities:
Keywords: artificial intelligence; computer-assisted; diagnosis; liver neoplasms; ultrasonography
Mesh:
Substances:
Year: 2021 PMID: 33880797 PMCID: PMC8518504 DOI: 10.1111/jgh.15522
Source DB: PubMed Journal: J Gastroenterol Hepatol ISSN: 0815-9319 Impact factor: 4.029
Baseline characteristics of the included datasets
| Data sets | Development set | Testing set |
| |
|---|---|---|---|---|
| Reference standard | Malignant, No. | 281 | 164 | 0.984 |
| Benign, No. | 82 | 47 | ||
| Gender | Male, No. | 273 | 152 | 0.457 |
| Female, No. | 90 | 59 | ||
| Age | Mean ± SD, year | 52.64 ± 13.77 | 54.30 ± 12.58 | 0.151 |
| Lesion size | Mean ± SD, cm | 5.10 ± 3.27 | 4.74 ± 4.05 | 0.245 |
| No. of images | 614,728 (augmented) | 616 | ‐ | |
| Ultrasound devices | Types, No. | 5 | 6 | ‐ |
| CEUS examiners | No. | 10 | 11 | ‐ |
CEUS, contrast‐enhanced ultrasound; No., number; SD, standard deviation.
Figure 1Flowchart of data preparation and AI development. Data preparation consisted of data collection, decomposition of video clips into frames, frame selection, and image cropping into square four‐phase AI inputs. AI development consisted of input, network architectures, and output.
Detailed performance comparison between the AI and the four radiologists on the testing set
| Statistics | ACC | Se | Sp | PPV | NPV |
|---|---|---|---|---|---|
| AI |
|
|
|
|
|
| Expert1 | 0.867 (0.822, 0.913) | 0.884 (0.835, 0.933) | 0.809 (0.696, 0.921) | 0.942 (0.905, 0.979) | 0.667 (0.544, 0.789) |
| Expert2 | 0.863 (0.816, 0.909) | 0.902 (0.857, 0.948) | 0.723 (0.596, 0.851) | 0.919 (0.877, 0.961) | 0.680 (0.551, 0.809) |
| Resident1 | 0.839 (0.789, 0.888) | 0.896 (0.850, 0.943) | 0.638 (0.501, 0.776) | 0.896 (0.850, 0.943) | 0.638 (0.501, 0.776) |
| Resident2 | 0.820 (0.768, 0.872) | 0.884 (0.835, 0.933) | 0.596 (0.455, 0.736) | 0.884 (0.835, 0.933) | 0.596 (0.455, 0.736) |
|
| 0.256 | 0.419 | 0.297 | 0.385 | 0.453 |
|
| 0.021 | 0.406 | 0.016 | 0.052 | 0.157 |
ACC, accuracy; Se, sensitivity; Sp, specificity. Bold fonts indicate the best performance per column.
Statistically significant (P < 0.05).
Figure 2Performance comparison between AI and radiologists. (a) Error rate (1‐accuracy) comparison between AI and radiologists. (b) Detailed comparison of diagnostic sensitivity and specificity between AI and the radiologists.
Performance comparison of the four radiologists between radiologist‐alone and AI assisted radiologists on the testing set
| Statistics | ACC | Se | Sp | PPV | NPV | |
|---|---|---|---|---|---|---|
| Expert 1 | Alone/AI assisted | 0.867/0.924 | 0.884/0.970 | 0.809/ | 0.942/ | 0.667/0.878 |
|
| 0.080 | 0.006 | 0.801 | 0.998 | 0.031 | |
| Expert 2 | Alone/AI assisted | 0.863/ | 0.902/0.982 | 0.723/0.745 | 0.919/0.931 | 0.680/0.921 |
|
| 0.038 | 0.005 | 1.000 | 0.852 | 0.014 | |
| Resident 1 | Alone/AI assisted | 0.839/0.910 | 0.896/0.970 | 0.638/0.702 | 0.896/0.919 | 0.638/0.868 |
|
| 0.040 | 0.015 | 0.661 | 0.594 | 0.031 | |
| Resident 2 | Alone/AI assisted | 0.820/0.919 | 0.884/ | 0.596/0.660 | 0.884/0.911 | 0.596/ |
|
| ‐ | 0.004 | <0.001 | 0.670 | 0.528 | 0.001 |
ACC, accuracy; Se, sensitivity; Sp, specificity. Bold fonts indicate the best performance per column.
Statistically significant (P < 0.05).
Figure 3Performance validation of the strategy of AI assistance in the testing dataset. Performance comparison between radiologists with AI assistance and radiologists alone.
Performance comparison between the four radiologists with AI assistance on the testing set
| Statistics | ACC | Se | Sp | PPV | NPV |
|---|---|---|---|---|---|
| Expert1 | 0.924 (0.888, 0.960) | 0.970 (0.943, 0.996) |
|
| 0.878 (0.778, 0.978) |
| Expert2 |
| 0.982 (0.961, 1.000) | 0.745 (0.620, 0.869) | 0.931 (0.893, 0.968) | 0.921 (0.835, 1.000) |
| Resident1 | 0.910 (0.871, 0.949) | 0.970 (0.943, 0.996) | 0.702 (0.571, 0.833) | 0.919 (0.878, 0.960) | 0.868 (0.761, 0.976) |
| Resident2 | 0.919 (0.883, 0.956) |
| 0.660 (0.524, 0.795) | 0.911 (0.869, 0.952) |
|
|
| 0.904 | 0.360 | 0.671 | 0.818 | 0.460 |
ACC, accuracy; Se, sensitivity; Sp, specificity. Bold fonts indicate the best performance per column.