| Literature DB >> 35599360 |
Abhishta Bhandari1,2, Ravi Marwah1, Justin Smith1,2, Duy Nguyen3, Asim Bhatti3, Chee Peng Lim3, Arian Lasocki4,5.
Abstract
INTRODUCTION: Chemotherapy and radiotherapy can produce treatment-related effects, which may mimic tumour progression. Advances in Artificial Intelligence (AI) offer the potential to provide a more consistent approach of diagnosis with improved accuracy. The aim of this study was to determine the efficacy of machine learning models to differentiate treatment-related effects (TRE), consisting of pseudoprogression (PsP) and radiation necrosis (RN), and true tumour progression (TTP).Entities:
Mesh:
Year: 2022 PMID: 35599360 PMCID: PMC9545346 DOI: 10.1111/1754-9485.13436
Source DB: PubMed Journal: J Med Imaging Radiat Oncol ISSN: 1754-9477 Impact factor: 1.667
Fig. A2Flow diagram of included studies.
Pipeline features in the differentiation of TRE and TTP using machine learning
| First Author, Year | WHO Grade | Number | Imaging sequences | Segmentation | Features and selection | Machine Learning Method | Result | Confirmation | |
|---|---|---|---|---|---|---|---|---|---|
| Glioma (All Grades): PsP versus TTP | |||||||||
| Jang 2020 | Glioblastoma (IV) |
PsP = 38 TTP = 66 | T1‐Gd | Unclear | CNN‐LSTM learning derived features | CNN | AUC = 0.86 | Based on expert opinions | |
| Liu 2020 | Glioblastoma (IV) |
PsP = 23 TTP = 61 | DTI | Pipeline performed on whole brain |
DL features Human‐selected | CNN | DenseNet produced an AUC = 0.98, accuracy of 88.02%, sensitivity of 99.24% with a specificity of 66.04% | Final diagnosis made from follow‐up image interpretation and physician evaluation | |
| Li 2020 | Glioblastoma (IV) |
PsP = 23 TTP = 61 | DTI | Pipeline performed on whole brain |
DL features using deep convolutional generative adversarial network (DCGAN) and AlexNet NA | SVM | Using the last 2 convolutional layers from AlexNet the accuracy= 92%, sensitivity = 97.6%, specificity = 88.3% | Follow‐up images and professional evaluation by experienced clinicians | |
| Kim 2019 | Glioblastoma (IV) |
PsP = 46 TTP = 49 |
T1 T1‐Gd FLAIR DWI: ADC DSC: CBV | Semi‐automated with segmentation threshold and region‐growing segmentation |
Shape Intensity Texture Wavelet‐ transformed LASSO logistic regression model | Generalized linear model | AUC of 0.96 (95% CI: 0.88–1.00), accuracy of 95.6%, sensitivity of 93.7% and specificity of 100%. |
Final diagnosis of PsP made with increasing contrast‐enhancing lesions on MRI that subsequently regressed or became stable without any changes in the treatment for at least 6 months after therapy. Final diagnosis of TTP made if enhancing lesions gradually increased on more than 2 subsequent follow‐up MRI studies performed at 2‐ to 3‐month intervals and required a prompt change in treatment. | |
| Elshafeey 2019 | Glioblastoma (IV) |
PsP = 22 TTP = 83 |
DCE: Ktrans DSC: rCBV | Semi‐automated |
Histogram features Haralick Maximum Relevance Minimum Redundancy (MRMR) | SVM | Using SVM model a AUC = 0.89, accuracy = 90.82%, sensitivity = 91.4% and specificity = 88.2% | Histopathological tissue evaluation | |
| Bani‐Sadr 2019 | Glioblastoma (IV) |
PsP = 23 TTP = 53 |
T1 T1‐Gd FLAIR | Manual |
Radiomic features Wilcoxon‐test–based method | Random forest | Radiomics and MGMT promoter status discriminated with an AUC = 0.85, accuracy = 79.2%, sensitivity = 80.0% (95% CI [56.3–94.3%]), and specificity= 75.0% (95% CI [19.4–99.3%]) | Histopathological tissue evaluation or radioclinical follow‐up by RANO criteria | |
| Jang 2018 | Glioblastoma (IV) |
PsP = 30 TTP = 48 |
T1 T1‐Gd | Pipeline performed on whole brain |
CNN‐LSTM DL structure and clinical features NA | CNN‐LSTM structure | An AUC = 0.83 was found using the model |
CE lesion on follow‐up MRI based on RANO, Histopathological tissue evaluation or significant uptake on PET scans | |
| Ismail 2018 | Glioblastoma (IV) |
PsP = 71 TTP = 34 |
T2 FLAIR | Manual |
Shape Local hand‐crafted features Sequential feed‐forward feature selection | SVM | An accuracy = 90.2% was found | Histologic analysis or follow‐up imaging | |
| Booth 2017 | Glioblastoma (IV) |
PsP = 9 TTP = 15 | T2 | Manual |
Shape Intensity Minkowski functionals t‐tests | SVM | Accuracy = 86%, Sensitivity = 100% (95%CI: 51–100%), Specificity = 67% (95%CI: 21–94%) |
RANO criteria Increasing enhancing lesion at 4 weeks, 4 months or 7 months following chemoradiotherapy | |
| Zhang 2016 | Glioblastoma (IV) |
PsP = 23 TTP = 56 | DTI | Pipeline performed on whole brain |
Deep (dictionary) learning Index‐based approach | SVM | An AUC = 0.87 was found using the model | On follow‐up by physicians clinical and imaging experience. Biopsy performed if necessary. | |
| Qian 2016 | Glioblastoma (IV) |
PsP = 13 TTP = 22 | DTI | Pipeline performed on whole brain |
Deep (dictionary) learning 28 top‐ranked features | SVM | Using locally linear embedding to extract 4 dimensions from the image an AUC = 0.875 (SD = 0.276) and accuracy of 77.0% (SD = 19%) was found | Follow‐up imaging with clinician experience or histopathological evaluation | |
| Hu 2011 | Glioblastoma (IV) |
PsP = 16 TTP = 15 |
T1 T2 FLAIR TTP DWI: ADC DSC: rCBF, rCBV, MTT | Pipeline performed on whole brain |
Intensity NA | One‐Class SVM | AUROC= 0.9439, sensitivity = 89.91%, specificity= 93.72% | Follow‐up MRI scans acquired every 2–3 months following chemoradiotherapy | |
| Akbari 2020 | Glioblastoma (IV) |
PsP = 20 TTP = 63 |
T1 T1‐Gd T2 T2‐FLAIR DTI DSC | Manual |
Intensity Shape Texture: GLCM, GLRLM Sequential feature selection | SVM | Radiomics produced the highest AUC = 0.919, accuracy of 87.3%, sensitivity of 80% with a specificity of 88.69% | Histopathological tissue evaluation | |
| Lohmann 2020 | Glioblastoma: |
PsP = 16 TTP = 18 | FET‐PET | Three different segmentations based on tumor brain ratio of the SUV |
Shape, intensity and texture: GLSZM SVM‐RFE | Random Forest |
AUC = 0.73 Accuracy = 70% Sensitivity = 100% Specificity = 40% | Histopathological tissue evaluation or clinicoradiolocal by RANO follow‐up | |
| Lee 2020 | Glioblastoma (IV) = 23; anaplastic astrocytoma (III) = 2; Grade II = 18; |
PsP = 36 TTP = 7 |
T1 T1‐Gd T2 T2‐FLAIR DWI: ADC T1‐post–T1 minus pre‐contrast T2 minus FLAIR | Pipeline performed on whole brain | CNN‐LTSM deep learning model | CNN | AUC = 0.81 (95%CI: 0.73–0.87) | Histopathological tissue evaluation | |
| Kebir 2020 | Glioblastoma (IV) |
PsP = 14 TTP = 30 | FET‐PET | NA | Handcrafted features | Linear discriminant analysis | AUC = 0.93 (95% CI: 0.78–1, sensitivity 100% specificity 80%) | Confirmatory MRI 4 weeks later | |
| Gliomas (all grades): RN versus TTP | |||||||||
| Prasanna 2016 | Glioblastoma (IV) |
RN= 24 TTP= 18 | T1 | Manual |
CoLlAGe features (Co‐occurrence of Local Anisotropic Gradient Orientations) NA | Random Forest | Accuracy= 83.79 ± 5.43% | Histopathological tissue evaluation | |
| Zhang 2019 |
High Grade Gliomas (III/IV)‐ 32 Low Grade Gliomas (I/II)‐ 19 |
TTP = 35 RN = 16 |
T1 T1‐Gd T2 FLAIR | Manual segmentation | Handcrafted Features plus Inception v3 (deep learning derived features) | Logistic regression |
Fusion Inception v3 deep learning achieved an AUC= 0.9988, Sensitivity = 99.07% Specificity = 97.93% | Histopathological confirmation or confirmed by imaging and clinical follow‐up by neuroradiologists (follow‐up time > 6 months) | |
| Wang 2020 |
Grade II Glioma = 72 Grade III Glioma = 45 Grade IV Glioma = 43 |
TTP = 118 RN = 42 |
FDG‐PET MET‐PET T1 FLAIR T1‐Gd | Manual segmentation |
Texture LASSO | Logistic regression |
FDG‐PET + MET PET: AUC = 0.891 (95% CI: 0.823–0.958) Accuracy = 79.2% Sensitivity = 75.0% Specificity = 91.7% | Clinicoradiological follow‐up MRI – minimum of 3 months | |
| Gao 2020a |
Grade II = 41 Grade III = 32 Grade IV = 56 Unknown = 17 |
TTP = 96 RN = 50 |
T1 T2 T1‐Gd | Pipeline performed on whole brain | Deep learning derived features from ERN‐Net (efficient radionecrosis neural network) | Deep neural network |
AUC (95% CI) = 0.92 (0.90–0.93) Accuracy (95% CI) = 81% (79–83%) Sensitivity (95% CI) = 82% (79–84%) Specificity (95% CI) = 79 (75–82%) | Histopathological tissue evaluation | |
| Gliomas (all grades): TRE (RN and PsP) versus TTP | |||||||||
| Gao 2020b |
Anaplastic astrocytoma (III) = 7 Glioblastoma (IV) = 32 |
TRE = 14 TTP = 25 |
T1 T2‐FLAIR | Manual |
Shape Intensity Texture: GLCM, GLSZM, GLRLM, NGTDM, GLDM RFE | SVM | A combination of both subtractions gave an AUC = 0.94 (95%CI = 0.7788–1.0000), accuracy of 93.33%, sensitivity of 100% and specificity of 90% |
Histopathological tissue evaluation | |
| Bacchi 2019 | High grade gliomas (3 Grade III, 52 Grade IV) |
TRE = 16 TTP = 39 |
T1‐Gd FLAIR DWI: ADC | Pipeline performed on whole brain |
CNN DL features NA | CNN | The highest AUC = 0.80 for DWI + FLAIR images; Accuracy = 82%, Sensitivity = 100%, Specificity = 60% | Histopathological tissue evaluation or on follow‐up MRI at >6 months. | |
| Artzi 2016 |
Glioblastoma (IV) = 18 Anaplastic Astrocytoma (III)= 2 |
TRE = 3 TTP = 12 Both = 1 | DCE: KTrans, ve, kep, vp, Bolus arrival time (BAT) | Semi‐automated |
KTrans, ve, kep, vp ANOVA | SVM |
Training Data sens = 98.31% spec = 96.97% | Follow‐up at 2–3 months or histopathology | |
| Metastasis: RN versus TTP | |||||||||
| Peng 2018 | Metastasis |
TTP = 52 RN = 30 |
T1‐Gd FLAIR | Semi‐automatic using deep learning |
Shape Texture | IsoSVM |
AUC = 0.81 Sensitivity = 65.38% Specificity = 86.67% | Based on neuroradiologist interpretation | |
| Zhang 2018 | Metastasis |
TTP = 73 RN = 24 |
T1 T1‐Gd T2 FLAIR | Semi‐automated |
Texture Concordance correlation coefficients | RUSBoost |
AUC = 0.73 Accuracy = 73.2% | Histopathological resection or imaging on follow‐up | |
ADC, apparent diffusion coefficient; AUC, area under the curve; CBF, cerebral blood flow; CBV, cerebral blood volume; CNN, convolution neural network; DCE, dynamic contrast‐enhanced; DNN, deep neural network; DSC, dynamic susceptibility contrast; DTI, diffusion tensor imaging; DWI, diffusion weighted imaging; FDG‐PET, fluorodeoxyglucose (FDG)‐positron emission tomography; FET‐PET, O‐(2‐ [18F]fluoroethyl)‐L‐tyrosine (18F‐FET) positron emission tomography; FLAIR, fluid‐attenuated inversion recovery; Gd, gadolinium; GLCM, gray level co‐occurrence matrix; GLDM, gray level dependence matrix; GLRLM, grey‐level run length matrix; GLSZM, gray level size zone matrix; LASSO, least absolute shrinkage and selection operator; LSTM, long short‐term memory; MET‐PET, C‐methionine positron emission tomography; MGMT, O‐6‐Methylguanine‐DNA Methyltransferase; MTT, mean transit time; NGTDM, neighbourhood gray‐tone difference matrix; PsP, pseudoprogression; RFE, recursive feature elimination; RN, radiation necrosis; SUV, standardized uptake value; SVM, support vector machine; TTP, true tumor progression.
Fig. 1Hierarchical summary receiver operator curve for PsP versus TTP in gliomas (All Grades). , , , , , , , , [Colour figure can be viewed at wileyonlinelibrary.com]
Fig. A1PROBAST quality scoring of included studies.
Checklist for artificial intelligence in medical imaging (CLAIM)
| Title/abstract | Gao 2020b | Liu 2020 | Akbari 2020 | Gao 2020a | Lohmann 2020 | Lee 2020 | Kebir 2020 | Li 2020 | Kim 2019 | Elshafeey 2019 | Bani‐Sadr 2019 | Bacchi 2019 | Jang 2018 | Ismail 2018 | Booth 2017 | Zhang 2016 | Artzi 2016 | Prasanna 2016 | Qian 2016 | Hu 2011 | Jang 2020 | Peng 2018 | Wang 2020 | Zhang 2019 | Zhang 2018 | Total Score by Quality Indicator |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Identification as a study of AI methodology, specifying the category of technology used (e.g., deep learning) | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 8 |
| Structured summary of study design, methods, results, and conclusions | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 19 |
| Introduction | ||||||||||||||||||||||||||
| Scientific and clinical background, including the intended use and clinical role of the AI approach | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 20 |
| Study objectives and hypotheses | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 18 |
| Methods | ||||||||||||||||||||||||||
| Study Design | ||||||||||||||||||||||||||
| Prospective or retrospective study | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | 19 |
| Study goal, such as model creation, exploratory study, feasibility study, non‐inferiority trial | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 23 |
| Data | ||||||||||||||||||||||||||
| Data sources | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 21 |
| Eligibility criteria: how, where, and when potentially eligible participants or studies were identified (e.g., symptoms, results from previous tests, inclusion in registry, patient‐care setting, location, dates) | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 16 |
| Data pre‐processing steps | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 18 |
| Selection of data subsets, if applicable | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| Definitions of data elements, with references to Common Data Elements | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 14 |
| De‐identification methods | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| How missing data were handled | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Ground truth | ||||||||||||||||||||||||||
| Definition of ground truth reference standard, in sufficient detail to allow replication | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 18 |
| Rationale for choosing the reference standard (if alternatives exist) | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 7 |
| Source of ground‐truth annotations; qualifications and preparation of annotators | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 8 |
| Annotation tools | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 20 |
| Measurement of inter‐ and intrarater variability; methods to mitigate variability and/or resolve discrepancies | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 4 |
| Data partitions | ||||||||||||||||||||||||||
| Intended sample size and how it was determined | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| How data were assigned to partitions; specify proportions | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 12 |
| Level at which partitions are disjoint (e.g., image, study, patient, institution) | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 16 |
| Model | ||||||||||||||||||||||||||
| Detailed description of model, including inputs, outputs, all intermediate layers and connections | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 7 |
| Software libraries, frameworks, and packages | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 19 |
| Initialization of model parameters (e.g., randomization, transfer learning) | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| Training | ||||||||||||||||||||||||||
| Details of training approach, including data augmentation, hyperparameters, number of models trained | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 8 |
| Method of selecting the final model | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 14 |
| Ensembling techniques, if applicable | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| Evaluation | ||||||||||||||||||||||||||
| Metrics of model performance | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 23 |
| Statistical measures of significance and uncertainty (e.g., confidence intervals) | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 7 |
| Robustness or sensitivity analysis | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 17 |
| Methods for explainability or interpretability (e.g., saliency maps), and how they were validated | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 16 |
| Validation or testing on external data | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 4 |
| Results | ||||||||||||||||||||||||||
| Data | ||||||||||||||||||||||||||
| Flow of participants or cases, using a diagram to indicate inclusion and exclusion | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 4 |
| Demographic and clinical characteristics of cases in each partition | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 16 |
| Model performance | ||||||||||||||||||||||||||
| Performance metrics for optimal model(s) on all data partitions | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 15 | |
| Estimates of diagnostic accuracy and their precision (such as 95% confidence intervals) | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 10 |
| Failure analysis of incorrectly classified cases | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Discussion | ||||||||||||||||||||||||||
| Study limitations, including potential bias, statistical uncertainty, and generalizability | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 21 |
| Implications for practice, including the intended use and/or clinical role | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 16 |
| Other information | ||||||||||||||||||||||||||
| Registration number and name of registry | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Where the full study protocol can be accessed | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Sources of funding and other support; role of funders | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 22 |
| Total score by study | 26 | 20 | 26 | 30 | 18 | 24 | 22 | 14 | 27 | 22 | 24 | 13 | 22 | 14 | 22 | 10 | 15 | 15 | 16 | 13 | 17 | 18 | 23 | 19 | 15 |
Fig. 2Deeks' funnel plot for PsP versus TTP in gliomas (All Grades). , , , , , , , , [Colour figure can be viewed at wileyonlinelibrary.com]