| Literature DB >> 32967377 |
Sacheth Chandramouli1, Patrick Leo1, George Lee1, Robin Elliott2, Christine Davis3, Guangjing Zhu3, Pingfu Fu4, Jonathan I Epstein3, Robert Veltri5, Anant Madabhushi1,6.
Abstract
In this work, we assessed the ability of computerized features of nuclear morphology from diagnostic biopsy images to predict prostate cancer (CaP) progression in active surveillance (AS) patients. Improved risk characterization of AS patients could reduce over-testing of low-risk patients while directing high-risk patients to therapy. A total of 191 (125 progressors, 66 non-progressors) AS patients from a single site were identified using The Johns Hopkins University's (JHU) AS-eligibility criteria. Progression was determined by pathologists at JHU. 30 progressors and 30 non-progressors were randomly selected to create the training cohort D1 (n = 60). The remaining patients comprised the validation cohort D2 (n = 131). Digitized Hematoxylin & Eosin (H&E) biopsies were annotated by a pathologist for CaP regions. Nuclei within the cancer regions were segmented using a watershed method and 216 nuclear features describing position, shape, orientation, and clustering were extracted. Six features associated with disease progression were identified using D1 and then used to train a machine learning classifier. The classifier was validated on D2. The classifier was further compared on a subset of D2 (n = 47) against pro-PSA, an isoform of prostate specific antigen (PSA) more linked with CaP, in predicting progression. Performance was evaluated with area under the curve (AUC). A combination of nuclear spatial arrangement, shape, and disorder features were associated with progression. The classifier using these features yielded an AUC of 0.75 in D2. On the 47 patient subset with pro-PSA measurements, the classifier yielded an AUC of 0.79 compared to an AUC of 0.42 for pro-PSA. Nuclear morphometric features from digitized H&E biopsies predicted progression in AS patients. This may be useful for identifying AS-eligible patients who could benefit from immediate curative therapy. However, additional multi-site validation is needed.Entities:
Keywords: active surveillance; machine learning; pathology; prostate cancer
Year: 2020 PMID: 32967377 PMCID: PMC7563653 DOI: 10.3390/cancers12092708
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Feature maps of a patient that did not progresses (left) and one who progressed (right). Nuclear segmentation results representing the nuclear shape (a,b). Nuclear spatial arrangement represented through the Delaunay triangulation (c,e) Voronoi tessellation (d,f), and Minimum Spanning Trees (g,i). Disorder in localized nuclear orientation (h,j).
Summary of QH based features to describe tumor morphology.
| Feature Family | Description | Features |
|---|---|---|
| Graph | Voronoi, Delaunay, minimum spanning trees, k-NN graphs | 51 |
| Shape | Area, perimeter ratio, smoothness, distance, etc. | 100 |
| Nuclear Disorder | Orientation entropy, energy, contrast | 39 |
| Cluster Graphs | Clustering coefficient, edge length, connected components | 26 |
The six features identified from D1 and used in the final model.
| Feature |
|---|
| Voronoi: Min/max polygon perimeter |
| Shape: Min/max standard deviation of distance of contour point from centroid of nuclei |
| Shape: Min/max nuclei perimeter |
| Shape: Standard deviation of Fourier descriptor 6 |
| Orientation: Tensor contrast inverse moment range |
| Voronoi: Standard deviation of polygon area |
A confusion matrix created on D2 using the RF model using an operating point threshold of 0.69. The positive predictive value (PPV), negative predictive value (NPV), sensitivity and specificity was also calculated on D2 using the same operating point.
| Predicted | ||||
|---|---|---|---|---|
|
| Non-Progressor | Progressor | ||
| Non-Progressor | 22 | 14 | NPV: 44% | |
| Progressor | 28 | 67 | PPV: 83% | |
| Specificity: 61% | Sensitivity: 71% | |||
Figure 2The 6 features used to create a clustergram on D1. Features and patients were clustered hierarchically. The shading of each cell shows the relative over- or under-expression of that feature in a patient. From left to right, the features are: (1) standard deviation of minimum/maximum nuclear radius, (2) minimum/maximum nuclear perimeter, (3) minimum/maximum ratio of Voronoi polygon perimeter, (4) standard deviation of nuclear shape’s Fourier descriptor 6, (5) tensor contrast inverse moment range, and (6) standard deviation of Voronoi polygon area.
The average AUC, accuracy (ACC), sensitivity (SENS), and specificity (SPEC) across 100 iterations for each classification scheme on D1 using the 6 discriminating features from Experiment 1. These values were calculated using 3-fold cross validation. Each fold had an equal number of progressors (n = 10) and non-progressors (n = 10).
| Model Type | AUC | ACC | SENS | SPEC |
|---|---|---|---|---|
| LDA | 0.68 ± 0.12 | 0.63 ± 0.11 | 0.53 ± 0.16 | 0.26 ± 0.13 |
| QDA | 0.69 ± 0.11 | 0.62 ± 0.10 | 0.58 ± 0.17 | 0.67 ± 0.16 |
| RF | 0.73 ± 0.10 | 0.75 ± 0.09 | 0.70 ± 0.17 | 0.81 ± 0.16 |