| Literature DB >> 27258119 |
Georgina Cosma1, Giovanni Acampora1, David Brown1, Robert C Rees2, Masood Khan3, A Graham Pockley2.
Abstract
The prediction of cancer staging in prostate cancer is a process for estimating the likelihood that the cancer has spread before treatment is given to the patient. Although important for determining the most suitable treatment and optimal management strategy for patients, staging continues to present significant challenges to clinicians. Clinical test results such as the pre-treatment Prostate-Specific Antigen (PSA) level, the biopsy most common tumor pattern (Primary Gleason pattern) and the second most common tumor pattern (Secondary Gleason pattern) in tissue biopsies, and the clinical T stage can be used by clinicians to predict the pathological stage of cancer. However, not every patient will return abnormal results in all tests. This significantly influences the capacity to effectively predict the stage of prostate cancer. Herein we have developed a neuro-fuzzy computational intelligence model for classifying and predicting the likelihood of a patient having Organ-Confined Disease (OCD) or Extra-Prostatic Disease (ED) using a prostate cancer patient dataset obtained from The Cancer Genome Atlas (TCGA) Research Network. The system input consisted of the following variables: Primary and Secondary Gleason biopsy patterns, PSA levels, age at diagnosis, and clinical T stage. The performance of the neuro-fuzzy system was compared to other computational intelligence based approaches, namely the Artificial Neural Network, Fuzzy C-Means, Support Vector Machine, the Naive Bayes classifiers, and also the AJCC pTNM Staging Nomogram which is commonly used by clinicians. A comparison of the optimal Receiver Operating Characteristic (ROC) points that were identified using these approaches, revealed that the neuro-fuzzy system, at its optimal point, returns the largest Area Under the ROC Curve (AUC), with a low number of false positives (FPR = 0.274, TPR = 0.789, AUC = 0.812). The proposed approach is also an improvement over the AJCC pTNM Staging Nomogram (FPR = 0.032, TPR = 0.197, AUC = 0.582).Entities:
Mesh:
Substances:
Year: 2016 PMID: 27258119 PMCID: PMC4892614 DOI: 10.1371/journal.pone.0155856
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Definitions of clinical TNM according AJCC 2010 [21].
| TX | Primary tumor cannot be assessed |
| T0 | No evidence of primary tumor |
| T1a | Tumor incidental histologic finding in ≤ 5% of tissue resected |
| T1b | Tumor incidental histologic finding in > 5% of tissue resected |
| T1c | Tumor identified by needle biopsy (e.g. because of elevated PSA) |
| T2a | Tumor involves one-half of one lobe or less |
| T2b | Tumor involves more than one-half of one lobe but not both lobes |
| T2c | Tumor involves both lobes |
| T3a | Extracapsular extension (unilateral or bilateral) |
| T3b | Tumor invades seminal vesicle(s) |
| T4 | Tumor is fixed or invades adjacent structures other than seminal vesicles such as external sphincter, rectum, bladder, levator muscles, and/or pelvic wall |
| NX | Regional lymph nodes were not assessed |
| N0 | No regional lymph node metastasis |
| N1 | Metastasis in regional lymph node(s) |
| M0 | No distant metastasis |
| M1 | Distant metastasis |
| M1a | Non-regional lymph node(s) |
| M1b | Bone(s) |
| M1c | Other site(s) with or without bone disease |
Pathological TNM according AJCC 2010 [21]. There is no pT1 classification.
| pT2a | Unilateral, one-half of one side or less |
| pT2b | Unilateral, involving more than one-half of one side, but not both sides |
| pT2c | Bilateral disease |
| pT3a | Extraprostatic extension or microscopic bladder neck invasion |
| pT3b | Seminal vesicle invasion |
| pT4 | Invasion of rectum levator muscles, and/or pelvic wall |
| pNX | Regional lymph nodes not sampled |
| pN0 | No positive regional lymph nodes |
| pN1 | Metastasis in regional lymph node(s) |
| pM1 | Distant metastasis |
| pM1a | Non-regional lymph node(s) |
| pM1b | Bone(s) |
| pM1c | Other site(s) with or without bone disease |
Anatomic stage/prognostic groups (from AJCC 2010) [21].
| Group | T | N | M | PSA | Gleason score (GS) |
|---|---|---|---|---|---|
| I | T1a–c | N0 | M0 | ||
| T2a | N0 | M0 | |||
| T1–2a | N0 | M0 | PSA X | GS X | |
| IIA | T1a–c | N0 | M0 | GS 7 | |
| T1a–c | N0 | M0 | |||
| T2a | N0 | M0 | |||
| T2b | N0 | M0 | |||
| T2b | N0 | M0 | PSA X | GS X | |
| IIB | T2c | N0 | M0 | Any PSA | Any GS |
| T1–2 | N0 | M0 | Any GS | ||
| T1–2 | N0 | M0 | Any PSA | ||
| III | T3a–b | N0 | M0 | Any PSA | Any GS |
| IV | T4 | N0 | M0 | Any PSA | Any GS |
| Any T | N1 | M0 | Any PSA | Any GS | |
| Any T | Any N | M1 | Any PSA | Any GS |
Fig 1Neuro-Fuzzy Prostate Cancer Pathological Stage Predictor.
Dataset Statistics.
| Statistics of variables before categorisation | ||||
|---|---|---|---|---|
| Minimum | Maximum | Mean | Standard deviation | |
| 3 | 5 | 3.54 | 0.60 | |
| 3 | 5 | 3.74 | 0.69 | |
| 0.70 | 107.00 | 9.84 | 11.25 | |
| 41.10 | 78.00 | 59.88 | 6.92 | |
| 1.00 | 5.00 | 2.19 | 1.45 | |
| 1.00 | 2.00 | 1.55 | 0.50 | |
Primary and Secondary Gleason pattern groups.
| 205 | 51.4 | |
| 173 | 43.4 | |
| 21 | 5.3 | |
| 399 | 100.0 | |
| 159 | 39.8 | |
| 185 | 46.4 | |
| 55 | 13.8 | |
| 399 | 100.0 |
PSA groups.
| PSA group | PSA range | Frequency count | Proportion of patients (%) |
|---|---|---|---|
| 0–2.5 ng/mL | 16 | 4.01 | |
| 2.6–4.0 ng/mL | 33 | 8.27 | |
| 4.1–6.0 ng/mL | 124 | 31.08 | |
| 6.1–9.9 ng/mL | 124 | 31.08 | |
| 10–19 ng/mL | 67 | 16.79 | |
| ≥ 20 ng/mL | 35 | 8.77 |
Age groups.
| Age group | Age range | Frequency count | Proportion of patients (%) |
|---|---|---|---|
| < 25 | 0 | 0 | |
| 25–29 | 0 | 0 | |
| 30–34 | 0 | 0 | |
| 35–39 | 0 | 0 | |
| 40–44 | 5 | 1.25 | |
| 45–49 | 22 | 5.51 | |
| 50–54 | 68 | 17.04 | |
| 55–59 | 97 | 24.31 | |
| 60–64 | 100 | 25.06 | |
| 65–69 | 76 | 19.05 | |
| > 70 | 31 | 7.77 |
Fig 2Histogram of grouped PSA values.
Clinical T stage groups.
| Clinical T group | Clinical T stage | Frequency count | Proportion of patients (%) |
|---|---|---|---|
| T1(a-c) | 204 | 51.13 | |
| T2a | 53 | 13.28 | |
| T2b | 53 | 13.28 | |
| T2c | 42 | 10.53 | |
| T3a | 29 | 7.27 | |
| T3b | 16 | 4.01 | |
| T4 | 2 | 0.50 | |
| 399 | 100.00 |
Pathological T (pT) stage groups.
| pT group | Pathological T (pT) stage | Frequency count | Proportion of patients (%) | OCD or ED |
|---|---|---|---|---|
| T2(unknown if a or b) | 1 | 0.25 | OCD | |
| T2a | 14 | 3.51 | OCD | |
| T2b | 47 | 11.78 | OCD | |
| T2c | 117 | 29.32 | OCD | |
| T3a | 142 | 35.59 | ED | |
| T3b | 72 | 18.05 | ED | |
| T4 | 6 | 1.50 | ED | |
| 399 | 100.00 |
Before data normalisation.
| Case No. | Primary Gleason Pattern | Secondary Gleason Pattern | PSA | Age | Clinical T stage | Pathological (pT) stage |
|---|---|---|---|---|---|---|
| 3 | 3 | 1.00 | 51.6 | T2b | T2a | |
| 3 | 3 | 1.70 | 77.0 | T2b | T2c | |
| 3 | 3 | 2.05 | 55.2 | T2a | pT2b | |
| 3 | 3 | 2.09 | 61.1 | T1c | pT2b | |
| 3 | 3 | 2.20 | 57.0 | T1c | T3a | |
| … | … | … | … | … | … |
After data normalisation.
| Case No. | Primary Gleason Pattern | Secondary Gleason Pattern | PSA group | Age group | Clinical T group | Pathological (pT) group |
|---|---|---|---|---|---|---|
| 3 | 3 | 1 | 7 | 3 | 1 | |
| 3 | 3 | 1 | 11 | 3 | 1 | |
| 3 | 3 | 1 | 8 | 2 | 1 | |
| 3 | 3 | 1 | 9 | 1 | 1 | |
| 3 | 3 | 1 | 8 | 1 | 2 | |
| … | … | … | … | … | … |
Fig 3Histogram of grouped age values.
PSA levels categorised by age group.
| Age group | Patient count | PSA mean | Standard deviation of PSA values |
|---|---|---|---|
| 5 | 4.40 | 1.14 | |
| 22 | 4.14 | 0.89 | |
| 68 | 3.54 | 1.23 | |
| 97 | 3.75 | 1.20 | |
| 100 | 3.74 | 1.14 | |
| 76 | 3.75 | 1.21 | |
| 31 | 3.81 | 1.56 | |
| 399 | 3.75 | 1.21 |
Mean and Standard deviation values for Organ-Confined Disease (OCD) and Extra-Prostatic Disease (ED) groups diagnosed at the Pathological stage.
| Groups | |||
|---|---|---|---|
| Variables | OCD | ED | |
| n = 179 | n = 220 | ||
| 3.45 ± 0.52 | 3.61 ± 0.64 | 0.005 | |
| 3.65 ± 0.64 | 3.81 ± 0.71 | 0.016 | |
| 3.76 ± 1.26 | 3.74 ± 1.17 | 0.848 | |
| 8.49 ± 1.37 | 8.60 ± 1.40 | 0.434 | |
| 2.01 ± 1.36 | 2.33 ± 1.51 | 0.025 | |
Fig 4Neuro-Fuzzy System Membership Functions: Gleason 1 is Primary Gleason Pattern; Gleason 2 is Secondary Gleason pattern; PSA is Prostate Specific Antigen; Age represents the Age group; and clinical T stage is the result of the Digital Rectal Examination.
OCD is Organ-Confined Disease and ED is Extra-Prostatic Disease.
Performance evaluation.
| Performances based on ROC evaluation measurements | ||||||
|---|---|---|---|---|---|---|
| Neuro-Fuzzy (Our approach) | FCM | Quadratic-SVM | ANN | GB-NB | AJCC pTNM Nomogram | |
| 0.812 | 0.809 | 0.738 | 0.699 | 0.750 | 0.582 | |
| 0.274 | 0.403 | 0.242 | 0.303 | 0.274 | 0.032 | |
| 0.789 | 0.901 | 0.718 | 0.701 | 0.775 | 0.197 | |
| 1.000 | 0.868 | 0.499 | 1.000 | 1.000 | 0.000 | |
Fig 5Performance Comparison.
Fig 6ROC Curves: Performance Comparison.
Support Vector Machine(SVM) performance evaluation when applying various kernel functions.
| Kernel Function | ||||
|---|---|---|---|---|
| Evaluation Measure | Linear | Quadratic | GRB | MP |
| 0.758 | 0.758 | 0.661 | 0.597 | |
| 0.704 | 0.718 | 0.747 | 0.690 | |
| 0.731 | 0.738 | 0.704 | 0.644 | |
| 0.242 | 0.242 | 0.339 | 0.403 | |
| 0.704 | 0.718 | 0.747 | 0.690 | |
Naive Bayes(NB) performance evaluation using the Gaussian distribution and Kernel Density Estimation functions.
| Type of function | ||
|---|---|---|
| Evaluation Measure | GD-NB | KDE-NB |
| 0.726 | 0.645 | |
| 0.745 | 0.747 | |
| 0.750 | 0.696 | |
| 0.274 | 0.355 | |
| 0.775 | 0.747 | |