| Literature DB >> 29487880 |
Barath Narayanan Narayanan1, Russell Craig Hardie1, Temesguen Messay Kebede1.
Abstract
We study the performance of a computer-aided detection (CAD) system for lung nodules in computed tomography (CT) as a function of slice thickness. In addition, we propose and compare three different training methodologies for utilizing nonhomogeneous thickness training data (i.e., composed of cases with different slice thicknesses). These methods are (1) aggregate training using the entire suite of data at their native thickness, (2) homogeneous subset training that uses only the subset of training data that matches each testing case, and (3) resampling all training and testing cases to a common thickness. We believe this study has important implications for how CT is acquired, processed, and stored. We make use of 192 CT cases acquired at a thickness of 1.25 mm and 283 cases at 2.5 mm. These data are from the publicly available Lung Nodule Analysis 2016 dataset. In our study, CAD performance at 2.5 mm is comparable with that at 1.25 mm and is much better than at higher thicknesses. Also, resampling all training and testing cases to 2.5 mm provides the best performance among the three training methods compared in terms of accuracy, memory consumption, and computational time.Entities:
Keywords: computed tomography; computer-aided detection; downsampling; lung nodules; slice thickness
Year: 2018 PMID: 29487880 PMCID: PMC5818068 DOI: 10.1117/1.JMI.5.1.014504
Source DB: PubMed Journal: J Med Imaging (Bellingham) ISSN: 2329-4302
Fig. 1Top-level block diagram of the CAD system adopted from Ref. 8.
Fig. 2Nodule image from the case “sub0_p73” at (a) native 1.25-mm thickness, (b) simulated 2.5-mm thickness, (c) simulated 5-mm thickness, and (d) simulated 10-mm thickness.
Fig. 3Nodule image from the case “sub0_p32” at (a) native 1.25-mm thickness, (b) simulated 2.5-mm thickness, (c) simulated 5-mm thickness, and (d) simulated 10-mm thickness.
Fig. 4SFS merit function for 1.25-mm LUNA16 training set at all thickness levels.
Feature selected using SFS for classification using 1.25-mm LUNA16 training dataset at all thickness levels.
| Feature name | 1.25 mm | 2.5 mm | 5 mm | 10 mm |
|---|---|---|---|---|
| Number of slices | X | — | X | — |
| Equivalent diameter | X | X | X | — |
| Periapsis | — | — | X | X |
| Circularity | — | X | — | — |
| Elongation | X | X | — | — |
| Minimum voxel LCE2 | — | — | — | X |
| Standard deviation inside LCE1 | — | — | X | — |
| Fisher ratio | — | X | — | — |
| Moment 1 | X | X | — | — |
| Moment 1 LCE2 | — | — | X | X |
| Radial-deviation mean outside | — | — | X | — |
| Radial-deviation standard deviation outside | — | — | — | X |
| Radial-gradient standard deviation outside LCE2 | — | — | — | X |
| Standard deviation inside | — | X | — | — |
| Fisher ratio 1 | — | — | — | X |
| Standard deviation separation 3 | X | X | X | X |
| Fisher ratio 3 | X | — | — | — |
| Fisher ratio LCE1 | — | X | — | X |
| Contrast | — | — | — | X |
| Fisher ratio | — | X | X | — |
| Gradient magnitude mean outside 1 | X | — | — | — |
| Radial-deviation mean outside 2 | — | — | X | — |
| Radial-deviation mean outside 3 | — | — | X | — |
| Radial-deviation standard deviation outside 2 | — | — | — | X |
| Radial-deviation standard deviation outside 3 | — | — | X | — |
| Radial-gradient perimeter standard deviation separation inside | — | X | — | — |
| Radial-gradient perimeter mean outside 1 | X | — | — | — |
| Radial-gradient perimeter mean outside 2 | X | — | — | — |
| Radial-gradient perimeter standard deviation outside 1 | X | — | — | — |
| Radial-deviation mean inside | — | X | — | — |
| Radial-gradient standard deviation outside 2 | — | — | — | X |
| Radial-deviation mean separation | — | X | — | — |
| Surface gradient LCE1 | X | — | — | — |
| Area outside | X | — | — | — |
| Distance to center projection | X | — | X | X |
| Standard deviation voxel below | — | X | — | — |
| Standard deviation voxel below LCE1 | — | — | — | X |
| Bottom shadow fraction | — | — | — | X |
| X-fraction global | — | X | X | — |
Fig. 5FROC curves comparing CAD performance at all thickness levels.
Overall CAD performance comparison at all thickness levels.
| Type of dataset (based on thickness) | Candidate detector sensitivity (before classification) | Number of features selected for classification | Overall CAD performance AUC (0 to 10 FPs) | 95% Confidence AUC (0 to 10 FPs) interval | ANODE score |
|---|---|---|---|---|---|
| Native 1.25 mm | 91.37 | 13 | 7.16 | 0.496 | |
| Simulated 2.5 mm | |||||
| Simulated 5 mm | 92.24 | 13 | 6.13 | 0.418 | |
| Simulated 10 mm | 85.34 | 14 | 4.34 | 0.277 |
Note: Bold values represent the best performance among the compared methods.
Training and testing dataset compositions for different methods of classification—experiment based on 1.25-mm testing dataset.
| Classification approach | Training dataset (number of cases) | Testing dataset (number of cases) | |||
|---|---|---|---|---|---|
| 1.25 mm | 2.5 mm | 1.25 to 2.5 mm | 1.25 mm | 1.25 to 2.5 mm | |
| Aggregate | 112 | 283 | 0 | 80 | 0 |
| Homogeneous thickness | 112 | 0 | 0 | 80 | 0 |
| Common thickness | 0 | 283 | 112 | 0 | 80 |
Fig. 6SFS merit function for different training methods.
Fig. 7FROC curves comparing overall CAD performance using different training methods for 1.25-mm testing dataset utilizing the composition provided in Table 3.
Overall CAD performance comparison using different training methods for 1.25-mm testing dataset.
| Training method | Candidate detector sensitivity (before classification) | Number of features selected for classification | Overall CAD performance AUC (0 to 10 FPs) | 95% Confidence AUC (0 to 10 FPs) interval | ANODE score |
|---|---|---|---|---|---|
| Aggregate | 91.37 | 11 | 7.36 | 0.530 | |
| Homogeneous thickness | 91.37 | 13 | 7.16 | 0.496 | |
| Common thickness |
Note: Bold values represent the best performance among the compared methods.
Training and testing dataset compositions for different methods of classification—experiment based on 2.5-mm testing dataset.
| Classification approach | Training dataset (number of cases) | Testing dataset 2.5-mm slice thickness (number of cases) | ||
|---|---|---|---|---|
| 1.25 mm | 2.5 mm | 1.25 to 2.5 mm | ||
| Aggregate | 192 | 183 | 0 | 100 |
| Homogeneous thickness | 0 | 183 | 0 | 100 |
| Common thickness | 0 | 183 | 192 | 100 |
Fig. 8FROC curves comparing overall CAD performance using different training methods for 2.5-mm testing dataset utilizing the composition provided in Table 5.
Overall CAD performance comparison using different training methods for 2.5-mm testing dataset.
| Training method | Candidate detector sensitivity (before classification) | Number of features selected for classification | Overall CAD performance AUC (0 to 10 FPs) | 95% Confidence AUC (0 to 10 FPs) interval | ANODE score |
|---|---|---|---|---|---|
| Aggregate | 96.49 | 13 | 8.57 | 0.705 | |
| Homogeneous thickness | 96.49 | 12 | 8.68 | 0.718 | |
| Common thickness |
Note: Bold values represent the best performance among the compared methods.
Fig. 9FROC curves comparing aggregate and common thickness training methods for the entire LUNA16 dataset.
Overall CAD performance comparison using different training methods for the entire LUNA16 dataset.
| Training method | Candidate detector sensitivity (before classification) | Overall CAD performance AUC (0 to 10 FPs) | 95% Confidence AUC (0 to 10 FPs) interval | ANODE score |
|---|---|---|---|---|
| Aggregate | 90.36 | 7.75 | 0.596 | |
| Common thickness |
Note: Bold values represent the best performance among the compared methods.