| Literature DB >> 35107593 |
Elisabeth von Brandis1,2, Håvard B Jenssen3, Derk F M Avenarius4,5, Atle Bjørnerud3,6, Berit Flatø7,8, Anders H Tomterstad3, Vibke Lilleby8, Karen Rosendahl4,5, Tomas Sakinis3,7, Pia K K Zadig4,5, Lil-Sofie Ording Müller3.
Abstract
BACKGROUND: Manual assessment of bone marrow signal is time-consuming and requires meticulous standardisation to secure adequate precision of findings.Entities:
Keywords: Adolescents; Artificial intelligence; Bone marrow; Children; Convolutional neural network; Magnetic resonance imaging; Segmentation
Mesh:
Year: 2022 PMID: 35107593 PMCID: PMC9107442 DOI: 10.1007/s00247-021-05270-x
Source DB: PubMed Journal: Pediatr Radiol ISSN: 0301-0449
Fig. 1a–c MRI, coronal T2-W Dixon water-only of the knee in a healthy and asymptomatic 14-year-old girl. The perceived intensity level of the periphyseal bone marrow hyperintensity in the distal femur varies considerably with different window (W) and level (C) settings: (a) C192/W501, (b) C122/W271 and (c) C96/W198
Fig. 2Bone marrow signal was divided into three intensity levels: 1 = slightly increased with diffuse distribution; 2 = focal and mildly increased; and 3 = focal and moderately to highly increased, up to fluid-like signal (turquoise = level 1, blue = level 2, yellow = level 3). These images illustrate the defined intensity levels on coronal T2-W Dixon water-only images of the knee (a–c) with corresponding segmentation masks (d–f) in a 12-year-old boy with chronic non-bacterial osteomyelitis and knee symptoms (a, d), a 14-year-old healthy and asymptomatic boy (b, e) and a 15-year-old girl with chronic non-bacterial osteomyelitis and knee symptoms (c, f)
Fig. 3Simplified illustration of the iterative segmentation process. The model, which is initially trained on small amounts of data, contributes output that is then manually corrected to further produce training data. This process cuts down on data preparation time and helps to identify areas the preliminary model is struggling with, which allows for focused adjustment of hyperparameters and network architecture to resolve the largest systematic errors. AI artificial intelligence
Median and mean Dice similarity coefficient between ground truth and the segmentations performed by the artificial intelligence (AI) model and the four readers
| Reader 1 | Reader 2 | Reader 3 | Reader 4 | AI model | ||
|---|---|---|---|---|---|---|
| Level-1 signal (turquoise) | Median (range) | 0.80 (0.69–0.90) | 0.83 (0.70–0.88) | 0.73 (0.42–0.81) | 0.75 (0.57–0.84) | 0.68 (0.60–0.74) |
| Mean | 0.81 | 0.8 | 0.67 | 0.72 | 0.68 | |
| Level-2 signal (blue) | Median (range) | 0.72 (0.35–0.87) | 0.67 (0.23–0.84) | 0.55 (0.29–0.78) | 0.17 (0.01–0.53) | 0.47 (0.25–0.62) |
| Mean | 0.7 | 0.64 | 0.55 | 0.17 | 0.45 | |
| Level-3 signal (yellow) | Median (range) | 0.64 (0.20–0.87) | 0.67 (0.52–0.93) | 0.59 (0.44–0.75) | 0.00 (0.00–0.89) | 0.40 (0.00–0.71) |
| Mean | 0.6 | 0.69 | 0.59 | 0.11 | 0.38 | |
| Combined levels 2 + 3 signal | Median (range) | 0.79 (0.33–0.86) | 0.71 (0.19–0.89) | 0.61 (0.38–0.78) | 0.18 (0.01–0.73) | 0.55 (0.24–0.74) |
| Mean | 0.7 | 0.69 | 0.6 | 0.21 | 0.5 | |
Fig. 4Boxplot illustrates the performance of the artificial intelligence (AI) model compared to ground truth for the different signal intensities expressed by the Dice similarity coefficient
Consensus scores with means and standard deviations (SD) for each intensity level and for the overall impression of the segmentation masks representing ground truth (GT) and the segmentations performed by the artificial intelligence (AI) model, and for the differences between the two scores (DiffAI-GT)
| Mask number | Number | Age | Level-1 signal | Level-2 signal | Level-3 signal | Overall | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GT | AI | Diff AI-GT | GT | AI | Diff AI-GT | GT | AI | Diff AI-GT | GT | AI | Diff AI-GT | |||
| 1 | 16 | 14.0 | 3.4 | 3.4 | 0.0 | 3.2 | 3.4 | 0.2 | 3.4 | 5.0 | 1.6 | 3.4 | 5.0 | 1.6 |
| 2 | 16 | 13.9 | 1.5 | 5.0 | 3.5 | 2.4 | 5.0 | 2.6 | 1.5 | 6.8 | 5.3 | 2.0 | 6.0 | 4.0 |
| 3 | 15 | 10.2 | 3.4 | 3.4 | 0.0 | 3.5 | 3.4 | –0.1 | 3.0 | 5.0 | 2.0 | 3.2 | 4.2 | 1.0 |
| 4 | 15 | 9.6 | 2.0 | 1.8 | –0.2 | 3.0 | 1.8 | –1.2 | 3.4 | 4.0 | 0.6 | 3.2 | 3.5 | 0.3 |
| 5 | 16 | 9.5 | 4.2 | 1.8 | –2.4 | 5.0 | 1.8 | –3.2 | 0 | 1.8 | 1.8 | 3.0 | 1.8 | –1.2 |
| 6 | 15 | 7.6 | 1.8 | 3.0 | 1.2 | 3.4 | 3.8 | 0.4 | 6.8 | 3.8 | 3.8 | 4.0 | 3.8 | –0.2 |
| 7 | 14 | 14.3 | 1.5 | 1.0 | –0.5 | 1.5 | 3.4 | 1.9 | 0.5 | 3.8 | 3.8 | 1.0 | 3.5 | 2.5 |
| 8 | 16 | 7.8 | 1.0 | 0.5 | –0.5 | 2.5 | 2.0 | –0.5 | 0.5 | 0.5 | 0.5 | 2.0 | 1.0 | –1.0 |
| 9 | 15 | 11.0 | 3.2 | 1.8 | –1.4 | 3.4 | 0.5 | –2.9 | 1.3 | 3.8 | 3.8 | 1.8 | 3.0 | 1.2 |
| 10 | 15 | 9.6 | 2.0 | 1.8 | –0.2 | 2.0 | 2.0 | 0.0 | 0 | 0 | 0.0 | 1.0 | 1.5 | 0.5 |
| Mean | 15.30 | 10.8 | 2.40 | 2.35 | –0.50 | 2.99 | 2.71 | –0.28 | 2.04 | 3.45 | 1.41 | 2.46 | 3.33 | 0.87 |
| SD | 0.67 | 2.5 | 1.06 | 1.34 | 1.56 | 0.97 | 1.31 | 1.83 | 2.14 | 2.11 | 2.22 | 1.04 | 1.57 | 1.58 |
SD standard deviation
aIntensity level 1 = slightly increased with diffuse distribution
bIntensity level 2 = focal and mildly increased signal
cIntensity level 3 = focal and moderately to highly increased, up to fluid-like signal
Fig. 5a, b MRI, coronal T2-W Dixon water-only of the knee in an 8-year-old healthy and asymptomatic girl. Image (b) includes the segmentation mask performed by the model. The small foci of level-3 signal shown with arrows in image (a) are either missed (no color coding in image b) or incorrectly labelled with a lower intensity level by the model (coded with either blue or turquoise in image b)