| Literature DB >> 32755895 |
Kun-Hsing Yu1,2,3, Tsung-Lu Michael Lee4, Ming-Hsuan Yen5,6, S C Kou2, Bruce Rosen7,8, Jung-Hsien Chiang6, Isaac S Kohane1,8.
Abstract
BACKGROUND: Chest computed tomography (CT) is crucial for the detection of lung cancer, and many automated CT evaluation methods have been proposed. Due to the divergent software dependencies of the reported approaches, the developed methods are rarely compared or reproduced.Entities:
Keywords: computed tomography, spiral; early detection of cancer; lung cancer; machine learning; reproducibility of results
Mesh:
Year: 2020 PMID: 32755895 PMCID: PMC7439139 DOI: 10.2196/16709
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1The log-loss score distribution of the top 250 teams in the Kaggle Data Science Bowl Competition. The log-loss scores of the public test set and the final test set of each team were plotted. The red horizontal line indicates the log-loss of outputting the cancer probability as 0.5 for each patient. The blue horizontal line shows the log-loss of outputting cancer probability of each patient as the prevalence of cancer (0.26) in the training set.
Figure 2A weak to moderate correlation between the log-loss scores of the public test set and the scores of the final test set. The red regression line shows the relation between the log-loss scores of the public test set and those of the final test set using a linear regression model. (A) The log-transformed scores of all participants who finished both stages of the Kaggle Data Science Bowl Competition were plotted. The Spearman correlation coefficient of the performance in the two test sets is .23. (B) The log-transformed scores of the top 10 teams defined by the final test set performance. The Spearman correlation coefficient among the top 10 teams is .39.
Figure 3A model of the informatics workflow used by most teams. In addition to the Kaggle training set, most teams obtained additional publicly available datasets with annotations. Lung segmentation, image rescaling, and nodule segmentation modules were commonly used before classification.
Comparisons of the top-performing solutions of the Kaggle Data Science Bowl.
| Rank | Team name | Additional datasets used | Data preprocessing | Nodule segmentation | Classification algorithms | Implementation | Final test set score |
| 1 | Grt123 | LUNA16a | Lung segmentation, intensity normalization | Variant of U-Net | Neural network with a max-pooling layer and two fully connected layers | Pytorch | 0.39975 |
| 2 | Julian de Wit and Daniel Hammack | LUNA16, LIDCb | Rescale to 1×1×1 | C3Dc, ResNet-like CNNd | C3D, ResNet-like CNN | Keras, Tensorflow, Theano | 0.40117 |
| 3 | Aidence | LUNA16 | Rescale to 2.5×0.512×0.512 (for nodule detection) and 1.25×0.5×0.5 (for classification) | ResNete | 3D DenseNetf multitask model (different loss functions depending on the input source) | Tensorflow | 0.40127 |
| 4 | qfpxfd | LUNA16, SPIE-AAPMg | Lung segmentation | Faster R-CNNh, with 3D CNN for false positive reduction | 3D CNN inspired by VGGNet | Keras, Tensorflow, Caffe | 0.40183 |
| 5 | Pierre Fillard (Therapixel) | LUNA16 | Rescale to 0.625×0.625×0.625, lung segmentation | 3D CNN inspired by VGGNet | 3D CNN inspired by VGGNet | Tensorflow | 0.40409 |
| 6 | MDai | None | Rescale to 1×1×1, normalize HUi | 2D and 3D ResNet | 3D ResNet + a Xgboost classifier incorporating CNN output, patient sex, # nodules, and other nodule features | Keras, Tensorflow, Xgboost | 0.41629 |
| 7 | DL Munich | LUNA16 | Rescale to 1×1×1, lung segmentation | U-Net | 2D and 3D residual neural network | Tensorflow | 0.42751 |
| 8 | Alex, Andre, Gilberto, and Shize | LUNA16 | Rescale to 2×2×2 | Variant of U-Net | CNN, tree-based classifiers (with better performance) | Keras, Theano, xgboost, extraTree | 0.43019 |
| 9 | Deep Breath | LUNA16, SPIE-AAPMj | Lung mask | Variant of SegNet | Inception-ResNet v2 | Theano and Lasagne | 0.43872 |
| 10 | Owkin Team | LUNA16 | Lung segmentation | U-Net, 3D VGGNet | Gradient boosting | Keras, Tensorflow, xgboost | 0.44068 |
aLUNA16: Lung Nodule Analysis 2016.
bLIDC: Lung Image Database Consortium.
cC3D: convolutional 3D.
dResNet-like CNN: residual net–like convolutional neural network.
eResNet: residual net.
fDenseNet: dense convolutional network.
gSPIE-AAPM: International Society for Optics and Photonics–American Association of Physicists in Medicine Lung CT Challenge.
hR-CNN: region-based convolutional neural networks.
iHU: Hounsfield unit.
jDataset has been evaluated but not used in building the final model.
A summary of the chest computed tomography datasets employed by the participants.
| Datasets | Number of CTa scan series | Data originated from multiple sites | Availability of nodule locations | Availability of nodule segmentations | Availability of patients’ diagnoses (benign versus malignant) |
| Kaggle Data Science Bowl (this competition) | Training: 1397; public test set: 198; final test set: 506 | Yes | No | No | Yes |
| Lung nodule analysis | 888 | Yes | Yes | Yes | Yes |
| SPIE-AAPMb Lung CT Challenge | 70 | No | Yes | No | Yes |
| Lung Image Database Consortium | 1398 | Yes | Yes | Yes | Yes |
aCT: computed tomography.
bSPIE-AAPM: International Society for Optics and Photonics–American Association of Physicists in Medicine.
Figure 4The most widely used dependencies by the top 10 teams. The packages are ordered by their prevalence among the top teams. For simplicity, dependencies used by only one team are omitted from the figure.