| Literature DB >> 35174084 |
Thomas C Booth1,2, Mariusz Grzeda1, Alysha Chelliah1, Andrei Roman3,4, Ayisha Al Busaidi2, Carmen Dragos5, Haris Shuaib6,7, Aysha Luis2, Ayesha Mirchandani8, Burcu Alparslan2,9, Nina Mansoor2, Jose Lavrador10, Francesco Vergani10, Keyoumars Ashkan10, Marc Modat1, Sebastien Ourselin1.
Abstract
OBJECTIVE: Monitoring biomarkers using machine learning (ML) may determine glioblastoma treatment response. We systematically reviewed quality and performance accuracy of recently published studies.Entities:
Keywords: artificial intelligence; deep learning; glioblastoma; glioma; machine learning; meta-analysis; monitoring biomarkers; treatment response
Year: 2022 PMID: 35174084 PMCID: PMC8842649 DOI: 10.3389/fonc.2022.799662
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Longitudinal series of MRI images in two patients (A, B) with glioblastoma, IDH-wildtype. All images are axial T1-weighted after contrast administration. Images (Aa–Ad) demonstrate tumor progression. (Aa) Pre-operative MRI of a glioblastoma in the occipital lobe. (Ab) Post-operative MRI five days after resection; there is no contrast enhancement therefore no identifiable residual tumor. (Ac) The patient underwent a standard care regimen of radiotherapy and temozolomide. A new enhancing lesion at the inferior margin of the post-operative cavity was identified on MRI at three months after radiotherapy completion. (Ad) The enhancing lesion continued to increase in size three months later and was confirmed to represent tumor recurrence after repeat surgery. Images (Ba–Bd) demonstrate pseudoprogression. (Ba) Pre-operative MRI of a glioblastoma in the insula lobe. (Bb) Post-operative MRI at 24 hours after surgery; post-operative blood products are present but there is no contrast enhancement therefore no identifiable residual tumor. (Bc) The patient underwent a standard care regimen of radiotherapy and temozolomide. A new rim-enhancing lesion was present on MRI at five months after radiotherapy completion. (Bd) Follow-up MRI at monthly intervals showed a gradual reduction in the size of the rim-enhancing lesion without any change in the standard care regimen of radiotherapy and temozolomide or corticosteroid use. The image shown here is the MRI four months later.
Figure 2Flow diagram of search strategy.
Studies using machine learning in the development of glioblastoma monitoring biomarkers.
| Author | Target condition | Reference standard | Dataset(s) | Available demographic information | Methodology | Features selected | Test set performance |
|---|---|---|---|---|---|---|---|
|
| Early true progression or Early pseudoprogression | Mixture of histopathology and imaging follow up | Training = 61 | Training = | Retrospective | First-order, | Recall 0.71 |
| Kim J.Y. et al. ( | Early true progression or Early pseudoprogression | Mixture of histopathology and imaging follow up | Training = 59 | Training = | Retrospective | First-order, | Recall 0.80 |
| Bacchi S. et al. ( | True progression or PTRE (HGG) | Histopathology for progression and imaging follow up for pseudoprogression | Training = 44 | Combined = | Retrospective | CNN. | Recall 1.00 |
| Elshafeey N. et al. ( | True progression or | Histopathology | Training = 98 | Training = | Retrospective | Ktrans & CBV parameters | Insufficient published data to determine diagnostic performance |
| Verma G. et al. ( | True progression or Pseudoprogression | Mixture of histopathology and imaging follow up | Training = 27 | Training = | Retrospective | Cho/NAA & Cho/Cr | No test set |
| Ismail M. et al. ( | True progression or Pseudoprogression | Mixture of histopathology and imaging follow up | Training = 59 | Training = | Retrospective | Global & curvature shape | Recall 1.00 |
|
| True progression or Pseudoprogression | Mixture of histopathology and imaging follow up | Training = 52 | Combined = | Retrospective | Second-order features | Recall 0.94 (0.71 - 1.00) |
| Gao X.Y. et al. ( | True progression or PTRE (HGG) | Mixture of histopathology and imaging follow up | Training = 34 | Combined = | Retrospective | Recall 1.00 | |
| Jang B-S. et al. ( | True progression or Pseudoprogression | Mixture of histopathology and imaging follow up | Training = 59 Testing = 19 | Training = | Retrospective | CNN | Recall 0.64 |
| Li M. et al. ( | True progression or | Imaging follow up | Training = 84 | No demographic information | Retrospective. | CNN. DTI | No test set |
| Akbari H. et al. ( | True progression or Pseudoprogression | Histopathology | Training = 40 | Combined | Retrospective | First-order, second-order (texture). | Recall 0.70 |
| Li X. et al. ( | Early True progression or early pseudoprogression (HGG) | Mixture of histopathology and imaging follow up | Training = 362 | Training = age mean (range) 50 (19–70) | Retrospective | Sparse representations | No test set |
| Manning P et al. ( | True progression or pseudoprogression | Mixture of histopathology and imaging follow up | Training = 32 | Training = age mean ± SD | Retrospective | CBF and CBV parameters included. | No test set |
| Park J.E. et al., 2020 ( | Early True progression or early pseudoprogression | Mixture of histopathology and imaging follow up | Training = 53 | Training = age mean ± SD | Retrospective | First-order, volume/shape, Second-order (texture), wavelet parameters included. | Recall 0.61 |
| Lee J. et al. ( | True progression or | Histopathology | Training = 43 | Training =age mean ± SD (range) | Retrospective | CNN-LSTM parameters. | No test set |
| Kebir S. et al. ( | True progression or | Imaging follow up | Training = 30 | Combined = age mean ± SD (range) | Retrospective | TBRmean | Recall 1.00 |
| Cluceru J. et al. ( | Early True progression or early pseudoprogression (HGG) | Histopathology | Training = 139 | Training = age median (range) | Retrospective | Cho, Cho/Cr, Cho/NAA & CBV parameters. | No test set |
| Jang B.S. et al. ( | True progression or | Mixture of histopathology and imaging follow up (including PET) | (i) (trained model = 78) | Testing = age median (range) | Retrospective | CNN | (i) Insufficient published data to determine diagnostic performance |
Within publication some data appears mathematically discrepant.
Within publication discrepant or unclear information (e.g. interval after radiotherapy).
Unless otherwise stated, glioblastoma alone was analyzed.
PTRE, post-treatment related effects; HGG, high-grade glioma.
MRI sequences: T1 C, postcontrast T1-weighted; T2, T2-weighted; FLAIR, fluid-attenuated inversion recovery; DSC, dynamic susceptibility-weighted; DCE, dynamic contrast-enhanced; DWI, diffusion-weighted imaging; DTI, diffusor tensor imaging; ASL, arterial spin labelling; MRI parameters: ADC, apparent diffusion coefficient; FA, fractional anisotropy; TR, trace (DTI); CBV, cerebral blood volume; PH, peak height; Ktrans, volume transfer constant.
Magnetic resonance spectroscopy: 1H-MRS, 1H-magnetic resonance spectroscopy; 3D-EPSI, 3D echo planar spectroscopic imaging.
1H-MRS parameters: Cr, creatine; Cho, choline; NAA, N-acetyl aspartate.
Nuclear medicine: TBR, tumor-to-brain ratio; TTP, time-to-peak.
Molecular markers: MGMT, O6-methylguanine-DNA methyltransferase; IDH, isocitrate dehydrogenase.
Machine learning methodology: CV, cross validation; LOOCV, leave-one-out cross validation; SVM, support vector machine; CNN, convolutional neural network; LASSO, least absolute shrinkage and selection operator; LSTM, long short-term memory; mRMR, minimum redundancy and maximum relevance; VGG, Visual Geometry Group (algorithm); DCGAN, deep convolutional generative adversarial network; DC-AL GAN, DCGAN with AlexNet.
Statistical measures: CI, confidence intervals; BA, balanced accuracy; AUC, area under the receiver operator characteristic curve; AUPRC, area under the precision-recall curve.
Studies applying machine learning models to baseline MRI images (or genomic signatures) to operate as glioblastoma prognostic biomarkers to predict future treatment response.
| Author | Target condition | Reference standard | Dataset(s) | Available demographic information | Methodology | Features selected | Test set performance |
|---|---|---|---|---|---|---|---|
| Wang S. et al. ( | True progression or pseudoprogression (immunotherapy for EGFRvIII mutation) | Histopathology | model testing set = 10 DTI, DSC and 3D-EPSI | Testing = age mean (range) | Prospective. | CL, CBV, FA parameters | Insufficient data to determine per patient diagnostic performance (per lesion results only available: |
| Yang K. et al. ( | True progression or not (stable disease, partial & complete response & pseudoprogression) | Imaging follow up | Training = 49 | Training = | Retrospective. | Genomic alterations including | No test set |
| Lundemann M. et al. ( | Early recurrence or not (voxel-wise) | Mixture of histopathology and imaging follow up | Training = 10 | Training = | Prospective. | FET; FDG; MD, FA; F, Vb, Ve, Ki, and MTT parameters. | No test set |
EGFR, epidermal growth factor receptor; EGFRvIII, epidermal growth factor receptor variant III; CDKN2A, cyclin-dependent kinase Inhibitor 2A.
MRI sequences: T1 C, post-contrast T1-weighted; T2, T2-weighted; FLAIR, fluid-attenuated inversion recovery; DSC, dynamic susceptibility-weighted; DCE, dynamic contrast-enhanced; DTI, diffusor tensor imaging.
Other imaging techniques: 3D-EPSI, 3D echo planar spectroscopic imaging; PET/CT, positron emission tomography and computed tomography; PET/MRI, positron emission tomography and magnetic resonance imaging; 18F-FDG, [18F]-fluorodeoxyglucose; 18F-FET, [18F]-fluoroethyl-L-tyrosine.
MRI parameters: FA, fractional anisotropy; MD, mean diffusivity; CL, linear anisotropy; CBV, cerebral blood volume; MTT, mean transit time; F, blood flow; Ve, extra-vascular extra-cellular blood volume; Vb, vascular blood volume; Ki, vascular permeability.
Statistical and machine learning methodology: LOOCV, leave-one-out cross validation; AUC, area under the receiver operator characteristic curve; PFS, progression free survival.
Figure 3Forest plots showing sensitivity and specificity.
Figure 4Summary Receiver Operator Characteristic Curve (SROC) of diagnostic performance accuracy. The summary point estimate and surrounding 95% confidence region is shown.
Advantages and disadvantages of using ML-based monitoring biomarkers for glioblastoma treatment response assessment (61).
| Advantages | Disadvantages |
|---|---|
| Using ML requires less formal statistical training given the huge developments in software ( | The clinical context may not be represented with a decreased ability to perform holistic evaluations of patients, with loss of valuable and irreducible aspects of the human experience such as psychological, relational, social, and organizational issues ( |
| Wide data can be handled relatively easily ( | Linking the empirical data to a categorical analysis can neglect an intrinsic ambiguity in the observed phenomena ( |
| ML models have the ability to determine implicitly any complex nonlinear relationship between independent and dependent variables ( | Overreliance on the capabilities of automation can lead to the related phenomenon of radiologist deskilling ( |
| Algorithms may be unreliable due to several technical constraints: domain adaptation is currently limited, and more solutions are required to help algorithms extrapolate well to new centers. Ultimately models may require calibration or retraining. | |
| Robustness to unintended data, such as artifacts, is also a technical constraint that needs to be overcome. Finally, the presence of more than one pathology (e.g., stroke or abscess associated with a tumor following treatment) can also confound algorithms as these cases are scarce and often unlabeled. | |
| Accuracy-driven performance metrics have led to a trend towards increasingly opaque models ( |