| Literature DB >> 35454914 |
Catharina Silvia Lisson1,2,3, Christoph Gerhard Lisson1, Marc Fabian Mezger1,3,4, Daniel Wolf1,3,4, Stefan Andreas Schmidt1,2, Wolfgang M Thaiss1,3,5, Eugen Tausch6,7, Ambros J Beer2,3,5,8,9, Stephan Stilgenbauer6,7, Meinrad Beer1,2,3,8,9, Michael Goetz1,3,10.
Abstract
Mantle cell lymphoma (MCL) is a rare lymphoid malignancy with a poor prognosis characterised by frequent relapse and short durations of treatment response. Most patients present with aggressive disease, but there exist indolent subtypes without the need for immediate intervention. The very heterogeneous behaviour of MCL is genetically characterised by the translocation t(11;14)(q13;q32), leading to Cyclin D1 overexpression with distinct clinical and biological characteristics and outcomes. There is still an unfulfilled need for precise MCL prognostication in real-time. Machine learning and deep learning neural networks are rapidly advancing technologies with promising results in numerous fields of application. This study develops and compares the performance of deep learning (DL) algorithms and radiomics-based machine learning (ML) models to predict MCL relapse on baseline CT scans. Five classification algorithms were used, including three deep learning models (3D SEResNet50, 3D DenseNet, and an optimised 3D CNN) and two machine learning models based on K-nearest Neighbor (KNN) and Random Forest (RF). The best performing method, our optimised 3D CNN, predicted MCL relapse with a 70% accuracy, better than the 3D SEResNet50 (62%) and the 3D DenseNet (59%). The second-best performing method was the KNN-based machine learning model (64%) after principal component analysis for improved accuracy. Our optimised CNN developed by ourselves correctly predicted MCL relapse in 70% of the patients on baseline CT imaging. Once prospectively tested in clinical trials with a larger sample size, our proposed 3D deep learning model could facilitate clinical management by precision imaging in MCL.Entities:
Keywords: deep learning; deep neural networks; machine learning; personalised oncology; precision imaging; radiomics
Year: 2022 PMID: 35454914 PMCID: PMC9028737 DOI: 10.3390/cancers14082008
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Recruitment pathway of the study.
Radiomic features used for model development.
| Radiomic Features of First Order: | Radiomic Features of Second Order: |
|---|---|
| Skewness | Angular Second Moment |
| Uniformity | Autocorrelation |
| Entropy | Correlation |
| Kurtosis | Contrast |
| Minimum Histogram Gradient | Energy |
| Maximum Histogram Gradient | Joint Average |
| Coefficient Variation | Sum Average |
| Quartile Coefficient Dispersion | Joint Maximum |
| P 10th (10th percentile) | Joint Entropy |
| P 25th (25th percentile) | Sum Entropy |
| P 50th (50th percentile) | Difference Entropy |
| P 90th (90th percentile) | Cluster Prominence |
| Interquartile Range | Cluster Shade |
| Minimum | Cluster Tendency |
| Maximum | Information Correlation |
| Mean | Information Correlation Difference |
| Mean Absolute Deviation | Inverse Difference |
| Median Absolute Deviation | Inverse Difference Normalised |
| Range | Inverse Difference Moment |
| Root Mean Square | Inverse Difference Moment Normalised |
| Standard Deviation | Difference Average |
| Variance | Difference Variance |
| Dissimilarity | |
| Inverse Variance | |
| Joint Variance | |
| Sum Variance |
Figure 2The schematic diagram for processing and analysis in the machine learning approach. Morphological features, including volume and textural features of first and second-order, were obtained. Details regarding the extraction settings are listed in Appendix A, Table A1.
Figure 3Correlation plot depicting the hierarchical clustering of all extracted features and their discriminatory power between the relapse and the remission group.
Figure 4The schematic diagram for data processing in deep neural networks.
Baseline demographic and clinical data.
| Characteristic | Number |
|---|---|
| Sex | |
| Male | 26 (86.7%) |
| Female | 4 (13.3%) |
| Average age (range) | 62.2 ± 9.7 years (42–76) |
| Ann Arbor Stage | |
| Stage I | 0 |
| Stage II | 2 (6.7%) |
| Stage III | 5 (16.7%) |
| Stage IV | 23 (76.7%) |
| Patients’ status in 5-years follow up | |
| Complete remission (CR) | 17 (57%) |
| Relapse of disease (RD) | 13 (43%) |
The outcome of the radiomics-based machine learning models.
| Machine Learning Models | Accuracy | Precision | Sensitivity | F1-Score | AUC |
|---|---|---|---|---|---|
| KNN | 0.63 ± 0.02 | 0.64 ± 0.01 | 0.63 ± 0.02 | 0.62 ± 0.02 | 0.62 ± 0.02 |
| PCA KNN | 0.64 ± 0.02 | 0.64 ± 0.01 | 0.63 ± 0.02 | 0.62 ± 0.02 | 0.62 ± 0.02 |
| RF | 0.47 ± 0.07 | 0.50 ± 0.09 | 0.47 ± 0.07 | 0.45 ± 0.08 | 0.49 ± 0.07 |
| PCA RF | 0.58 ± 0.02 | 0.61 ± 0.04 | 0.58 ± 0.02 | 0.58 ± 0.02 | 0.58 ± 0.02 |
Figure 5The receiver operating characteristics (ROC) curves of the KNN and PCA KNN algorithm yielded an area under the curve (AUC) of 0.62. In contrast, PCA RF yielded an AUC of 0.58 for classification performance of complete remission versus relapse of MCL.
Figure 6The decision curve analysis for the best run of PCA KNN shows that within a threshold probability from 25% to 55%, checking patients based on the classification model leads to a higher benefit than checking no or checking all lymphomas more frequently.
The outcome of the deep learning models.
| Deep Learning Models | Accuracy | Precision | Sensitivity | F1-Score | AUC |
|---|---|---|---|---|---|
| Own 3D Net | 0.70 ± 0.02 | 0.71 ± 0.02 | 0.70 ± 0.02 | 0.69 ± 0.01 | 0.70 ± 0.04 |
| 3D DenseNet | 0.59 ± 0.05 | 0.64 ± 0.07 | 0.59 ± 0.05 | 0.57 ± 0.06 | 0.58 ± 0.13 |
| SEResNet50 | 0.62 ± 0.04 | 0.65 ± 0.07 | 0.62 ± 0.05 | 0.60 ± 0.04 | 0.62 ± 0.06 |
Figure 7The deep learning models’ receiver operating characteristics (ROC) curve shows our 3D network mostly outperforming the 3D DenseNet and 3D SEResNet50.
Figure 8The decision curve analysis for the best run of the novel 3D CNN shows that within the threshold probability from 30% to 65% checking patients based on the classification model leads to a higher benefit than checking no or checking all lymphomas more frequently.
Figure 9The grouped bar chart shows that our own 3D CNN has the highest performance among all prediction models.
Settings of the feature extraction.
| Setting | Determination |
|---|---|
| Bin Method | FBS |
| Bin Amount | 1 |
| LoG Filter | 0 |
| LoG Sigma | 1 |
| Matrix Aggregation | 3D Average |
| Resample Filter | 0 |
| Resample Spacing X | 1 |
| Resample Spacing Y | 1 |
| Resample Spacing Z | 0 |
| Second-Order Distance | 1 |
| Threshold Filter | 0 |
| Threshold Filter Min | −1000 |
| Threshold Filter Max | 3000 |