| Literature DB >> 36263203 |
Amrita Guha1, Jayant S Goda1, Archya Dasgupta2, Abhishek Mahajan1, Soutik Halder3, Jeetendra Gawde3, Sanjay Talole3.
Abstract
Background: Glioblastoma (GBM) and primary central nervous system lymphoma (PCNSL) are common in elderly yet difficult to differentiate on MRI. Their management and prognosis are quite different. Recent surge of interest in predictive analytics, using machine learning (ML) from radiomic features and deep learning (DL) for diagnosing, predicting response and prognosticating disease has evinced interest among radiologists and clinicians. The objective of this systematic review and meta-analysis was to evaluate the deep learning & ML algorithms in classifying PCNSL from GBM.Entities:
Keywords: deep learning; glioblastoma; machine learning; magnetic resonance imaging; metaanalysis; predictive analytics; primary central nervous system (CNS) lymphoma; systematic review
Year: 2022 PMID: 36263203 PMCID: PMC9574102 DOI: 10.3389/fonc.2022.884173
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Figure 1PRISMA 2009 Flow Diagram.
Figure 2Studies included in the meta-analysis with the quality of diagnostic accuracy studies (QUADAS) scores.
Summary of the study profile & methodology of the reviewed studies.
| Sr. No | Author/Year/country | Patient cohort | Classifier/algorithm used | Internal validation set | External validation set | MRI used for Imaging | MRI sequences | Image Segmentation & Feature extraction tool | Reference Standard test | Type of study | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| PCNSL | GBM | ||||||||||
| 1 | Chen ( | 30 | 66 | Convolutional neural network, | Yes, cross validation | Yes | 3 Tesla | T1 contrast enhanced | Scale Invariant Feature Transformation (SIFT) | Histopathology | Retrospective study |
| 2 | Xiao ( | 22 | 60 | Machine learning: Naïve Bayes (NB), SVM, LR, Random Forest model | Yes ,10-fold internal cross validation | No | 1.5 & 3 Tesla | T1W, T2W & T1W contrast enhanced | Pyradiomics | Histopathology | Retrospective study |
| 3 | Guoqing ( | 32 | 70 | Convolutional neural network | Yes, Independent set | No | 3 Tesla | Not reported | Patch based Sparse representation method | Histopathology | Retrospective study |
| 4 | Kim ( | 65 | 78 | Logistic regression, SVM, Random Forest model | No | Yes, Independent set | 3 Tesla | T1W contrast, DWI, T2W | Pyradiomics | Histopathology | Retrospective study |
| 5 | Shrot ( | 12 | 41 | Machine learning: | Yes, Leave one out cross validation | No | 1.5 & 3 Tesla | MR perfusion using DSC images & DTI | 3D slicer, intensity-based feature extraction from MR maps | Histopathology | Single Institutional retrospective study |
| 6 | Chen ( | 62 | 76 |
| Yes, Independent set | No | 3 Tesla | T1W contrast | lifeX | Histopathology | Single Institutional retrospective study |
| 7 | Park ( | 95 | 165 | Convolutional Neural network | Yes, Internal Validation set | Yes, Independent set | 3 Tesla | T1W contrast, T2 FLAIR, DSC | Segmentation done semiautomatically by two neuroradiologists | Histopathology | Retrospective study |
| 8 | Escoda ( | 47 | 48 | Logistic binary regression | Yes, Independent set | No | 1.5 & 3 Tesla | T1 Contrast, DSC-PWI | 3D Slicer, Time intensity curve normalization | Histopathology | Retrospective study |
| 9 | Bathla ( | 34 | 60 | Machine learning | Yes (5 fold cross validation) | No | Not Reported | T1 W contrast, & FLAIR, ADC | Pyradiomics | Histopathology | Single Institutional retrospective study |
| 10 | McAvoy ( | 135 | 113 | Convolutional Neural network | yes, independent | No | 3 Tesla | T1 Contrast | Not Reported | Histopathology | Single Institutional retrospective study |
FLAIR, Fluid Attenuation & Recovery; MLP, Multilayer Perception; SVM, Support Vector Machine; GBRM, Generalised Boosted regression Model; DTI, Diffusion tensor Imaging; DSC, Dynamic Susceptibility; ML , Machine learning; CNN, convolutional Neural network; RF, Random forest; LASSO, Least Absolute Shrinkage and Selection Operator; PCA, Principal Component Analysis; LR, Logistic Regression; SVM, Support Vector Machine; LOGISMOS, Layered Optimal Graph Image Segmentation for Multiple Objects and Services; MLP, Multilayer perceptron; SIFT, Scale invariant feature transform; mRMR, Minimum redundancy maximum relevance; CNN, Convolutional neural network; LOOCV, Leave One Out Cross Validation.
Summary of the results of the reviewed studies.
| Sr. No. | Author/Year/country | Diagnostic metrics of the best performing model from the validation set | Study limitations reported | Study strengths reported | |||
|---|---|---|---|---|---|---|---|
| Accuracy | Sensitivity | Specificity | AUC | ||||
| 1 | Chen ( | 0.906 | 0.8 | 0.955 | 0.982 | - Single MR sequence was used | Calculation methods are fast |
| 2 | Xiao ( | 0.82 | 0.78 | 0.91 | 0.9 | - non enhancing & multiple lesions were excluded | - Image pre-processing technique used |
| 3 | Guoqing ( | 0.945 | 0.9 | 0.96 | NA | Not reported | -Completely automated |
| 4 | Kim ( | 0.947 | 0.966 | 0.929 | 0.956 | - Retrospective study with patient selection bias | Not reported |
| 5 | Shrot ( | NA | 1.00 | 1.00 | NA | -ROI tracing was done manually leading to intra & interobserver variability | Not reported |
| 6 | Chen ( | 0.979 | 0.982 | 0.976 | 0.978 | -Isolated evaluation of T1C images | Not reported |
| 7 | Park ( | NA | 0.95 | 0.76 | 0.89 | -Diagnostic performance dropped in external data set due to overfitting | Not reported |
| 8 | Escoda ( | 0.93 | 0.93 | 0.92 | NA | -Retrospective nature of the study. | -Near Homogenous Imaging protocol. |
| 9 | Bathla ( | 0.934 | 0.97 | 0.871 | 0.977 | Small sample size | -Well documented Imaging protocol |
| 10 | McAvoy ( | 0.93 | 1 | 0.86 | 0.94 (GBM) | - Retrospective study with small number of patients | Not reported |
| 0.94 | 0.87 | 1 | 0.95(PCNSL) | ||||
Summary of the diagnostic metrics of all the studies included in the meta-analysis.
|
|
|
|
|
|
|
|
|
| Chen Y ( | 2018 | 96 | 90.6 | 80.0 | 95.5 | 87.8 | 98.2 |
| Xiao DD ( | 2018 | 82 | 82.0 | 78.0 | 91.0 | 84.5 | 90.0 |
| Wu G ( | 2018 | 102 | 94.5 | 90.0 | 96.0 | 93.0 | NA |
| Shrot S ( | 2019 | 53 | 93.6 | 100 | 100 | 100 | NA |
| Chen C ( | 2020 | 138 | 97.9 | 98.2 | 97.6 | 97.9 | 97.8 |
| Park JE ( | 2020 | 260 | NA | 95.0 | 76.0 | 85.5 | 89.0 |
| Escoda A ( | 2020 | 95 | 93.0 | 93.0 | 92.0 | 92.5 | NA |
| Bathla G ( | 2021 | 94 | 93.4 | 97.0 | 87.1 | 92.1 | 97.7 |
| McAvoy ( | 2021 | 248 | 94.0 | 87.0 | 100 | 93.5 | 95.0 |
| Kim Y ( | 2018 | 143 | 94.7 | 96.6 | 92.9 | 94.7 | 95.6 |
Summary of Radiomics Quality Score (RQS) of individual studies.
| Sr. No | Name | RQS score | % | RQS checkpoint 1 (image protocol quality) | RQS checkpoint 12 | RQS checkpoint 3 |
|---|---|---|---|---|---|---|
| 1 | Chen Y ( | 16 | 44.44% | 1 | 1 | 14 |
| 2 | Xiao DD ( | 16 | 44.44% | 1 | 1 | 14 |
| 3 | Wu G ( | 15 | 41.67% | 1 | 1 | 13 |
| 4 | Short S ( | 13 | 36.11% | 1 | 1 | 11 |
| 5 | Chen C ( | 17 | 47.22% | 1 | 1 | 15 |
| 6 | Park JE ( | 18 | 50.00% | 1 | 1 | 16 |
| 7 | Pons-Escoda A ( | 16 | 44.44% | 1 | 1 | 14 |
| 8 | Bathla G ( | 16 | 44.44% | 1 | 1 | 14 |
| 9 | McAvoy ( | 16 | 44.44% | 1 | 1 | 14 |
| 10 | Kim Y ( | 17 | 47.22% | 1 | 1 | 15 |
Figure 3Hierarchical Summary Receiver Operator Curve (HSROC) plot displaying the diagnostic performance of radiomic based ML tools & DL tools in differentiating PCNSL from GBM. Hierarchical Summary Receiver Operator Curve (HSROC) plot displaying the diagnostic performance of radiomic based ML tools & DL tools in differentiating PCNSL from GBM. Each coloured triangle represents each of the studies in the meta-analysis. The plotted curve is the regression line that summarizes the overall diagnostic accuracy. The pooled sensitivity and specificity estimate is based on the assumption of conditional independence and the use of perfect reference standards. The “TP”, “FP”, “FN”, “TN” rates for the two studies (Park JE 2020 and Shrot S 2019 studies) as the former study has no available accuracy value and the latter one has both sensitivity and specificity equal to one.
Figure 4(A–D) Performance evaluation of the ML and DL algorithms of all the studies in distinguishing PCNSL from GBM as represented by the random forest plots. (A) Forest plots of sensitivity. (B): Forest plots of accuracy. (C) Forest plot of balanced accuracy, (D) Forest plots of specificity.