| Literature DB >> 26355298 |
Petros-Pavlos Ypsilantis1, Musib Siddique2, Hyon-Mok Sohn2, Andrew Davies2, Gary Cook2, Vicky Goh2, Giovanni Montana1.
Abstract
Imaging of cancer with 18F-fluorodeoxyglucose positron emission tomography (18F-FDG PET) has become a standard component of diagnosis and staging in oncology, and is becoming more important as a quantitative monitor of individual response to therapy. In this article we investigate the challenging problem of predicting a patient's response to neoadjuvant chemotherapy from a single 18F-FDG PET scan taken prior to treatment. We take a "radiomics" approach whereby a large amount of quantitative features is automatically extracted from pretherapy PET images in order to build a comprehensive quantification of the tumor phenotype. While the dominant methodology relies on hand-crafted texture features, we explore the potential of automatically learning low- to high-level features directly from PET scans. We report on a study that compares the performance of two competing radiomics strategies: an approach based on state-of-the-art statistical classifiers using over 100 quantitative imaging descriptors, including texture features as well as standardized uptake values, and a convolutional neural network, 3S-CNN, trained directly from PET scans by taking sets of adjacent intra-tumor slices. Our experimental results, based on a sample of 107 patients with esophageal cancer, provide initial evidence that convolutional neural networks have the potential to extract PET imaging representations that are highly predictive of response to therapy. On this dataset, 3S-CNN achieves an average 80.7% sensitivity and 81.6% specificity in predicting non-responders, and outperforms other competing predictive models.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26355298 PMCID: PMC4565716 DOI: 10.1371/journal.pone.0137036
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Distributions of axial 18F-FDG PET intra slices extracted from the 3D tumor volume of non-responders and responders.
Fig 2Kaplan-Meier plot showing the survival rates of responders and non-responders.
Summary of second and high order texture features extracted from texture analysis.
| Texture Matrices | Texture Features |
|---|---|
|
| Energy, Autocorrelation, |
| Cluster Prominence, Cluster shade, | |
| Contrast, Correlation, Difference Entropy, | |
| Difference Variance, Dissimilarity, | |
| Entropy, Homogeneity, Difference Moment | |
| Information Measure Cor.1/Cor.2, | |
| Sum Average, Sum Entropy, Sum Variance | |
| Inverse Difference Moment Normalized, | |
| Inv. Difference. Normalized, Max. Probability, | |
|
| Short Run Emphasis, Long Run Emphasis, |
| Short Run Low/High Gray Level Intensity, | |
| Long Run High/Low Gray Level Intensity, | |
| Run Length Non-uniformity,Run Percentage, | |
| Intensity Variability, Run Length Variability | |
| High/Low Gray Level Run Emphasis, | |
|
| Short/Long Zone Emphasis, Zone Percentage, |
| Short Zone Low/High Emphasis, | |
| Long Zone High/Low Emphasis, | |
| Intensity Non-uniformity, | |
| Zone Length Non-uniformity, | |
| Low/High Intensity Zone Emphasis, | |
| Intensity Variability, Size zone Variability | |
|
| Mean, Entropy, Variance, Contrast |
|
| Mean, Standard Deviation, |
| Hurst Exponent, Lacunarity | |
|
| Coarseness, Contrast, Busyness, |
| Texture Strength, Complexity |
Fig 318F-FDG PET ROIs of a specific tumor i after segmentation embedded into larger square background of standard size of 100 × 100 pixels.
Each enlarged slice is denoted by x and each set of three spatially adjacent enlarged slides is denoted by z , where j and k represent the slices and triplets of the specific tumor i. In this example only 3 triplets, from the 5 available slices can be formed, so k = 1,2,3.
Fig 4CNN architecture for fusion of 3 adjacent 18F-FDG PET intra slices into a vector v.
The CNN architecture is composed from 4 convolutional and 4 max-pooling layers denoted by and . In the first convolutional layer U (1), different coloured arrows represent the usage of different learnable weight matrices for convolving each PET slice in the triplet. Colored dotted rectangles in the feature maps represent elements of the feature maps that enclose local spatial information of the previous layer in the architecture. In the Max-pooling layers 2 × 2 element windows represent non-overlapped grids from which we choose the maximum element to downsample the feature maps.
Classification results: each figure is the average of three independent experiments using different training and test datasets.
| Method | Sensitivity | Specificity | Accuracy |
|---|---|---|---|
|
| 80.7±11.5 | 81.6±9.2 | 73.4±5.3 |
|
| 77.9±12.9 | 58.3±4.2 | 66.4±5.9 |
|
| 70.5±6.0 | 63.8±6.1 | 66.7±5.2 |
|
| 68.1±7.9 | 46.8±16.2 | 66.8±6.0 |
|
| 61.0±8.6 | 36.4±18.4 | 57.3±7.8 |
|
| 65.8±7.5 | 52.0±28.9 | 65.7±5.6 |
|
| 66.9±8.5 | 38.4±19.2 | 55.9±8.1 |
|
| 67.4±10.3 | 50.9±5.0 | 60.5±8.0 |
|
| 60.4±6.2 | 38.3±7.3 | 51.4±3.0 |
|
| 58.9±4.9 | 38.9±12.5 | 48.4±8.0 |
|
| 33.0±33.0 | 35.2±10.2 | 41.0±4.5 |
|
| 81.5±1.5 | 53.0±13.0 | 67.7±4.2 |
Fig 5Ten most important texture features for prediction of the chemotherapy response using the GB algorithm.
Since these measures are relative, we assign the largest importance a value of 100% and then scale the others accordingly.
Fig 6Examples of feature maps in the first and last max-pooling layers V (1) , V (4) of the CNN architecture.
The feature maps illustrate how a specific triplet is represented in the first and last max-pooling layers.