| Literature DB >> 25866762 |
Enrique Redondo-Gonzalez1, Leandro Nunes de Castro2, Jesús Moreno-Sierra1, María Luisa Maestro de las Casas3, Vicente Vera-Gonzalez4, Daniel Gomes Ferrari2, Juan Manuel Corchado5.
Abstract
Bladder cancer occurs in the epithelial lining of the urinary bladder and is amongst the most common types of cancer in humans, killing thousands of people a year. This paper is based on the hypothesis that the use of clinical and histopathological data together with information about the concentration of various molecular markers in patients is useful for the prediction of outcomes and the design of treatments of nonmuscle invasive bladder carcinoma (NMIBC). A population of 45 patients with a new diagnosis of NMIBC was selected. Patients with benign prostatic hyperplasia (BPH), muscle invasive bladder carcinoma (MIBC), carcinoma in situ (CIS), and NMIBC recurrent tumors were not included due to their different clinical behavior. Clinical history was obtained by means of anamnesis and physical examination, and preoperative imaging and urine cytology were carried out for all patients. Then, patients underwent conventional transurethral resection (TURBT) and some proteomic analyses quantified the biomarkers (p53, neu, and EGFR). A postoperative follow-up was performed to detect relapse and progression. Clusterings were performed to find groups with clinical, molecular markers, histopathological prognostic factors, and statistics about recurrence, progression, and overall survival of patients with NMIBC. Four groups were found according to tumor sizes, risk of relapse or progression, and biological behavior. Outlier patients were also detected and categorized according to their clinical characters and biological behavior.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25866762 PMCID: PMC4383273 DOI: 10.1155/2015/168682
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Variables measured and available in the dataset.
| Name | Description | Values | Significance |
|---|---|---|---|
| Type | Type of sample | 1/2/3 | NMIBC/MIBC/control |
| Age | Diagnosis age | Numeric | Years |
| N History | Identification number | Numeric | — |
| Gender | Gender | 1/2 | Male/female |
| Fdiagn | Diagnostic data | Date | DD/MM/YYYY |
| Tobacco | Tobacco smoking | 0/1 | No/yes |
| Alcohol | Alcohol consumption | 0/1 | No/yes |
| Af | Family history of BC | 0/1 | No/yes |
| Mfum | More than 20 cigarettes a day | 0/1 | No/yes |
| Otrosf | Other risk factors of BC | Text | Not analyzable |
| Hematuri | Haematuria | 0/1 | No/yes |
| Irritat | Irritative syndrome | 0/1 | No/yes |
| Dolorsup | Suprapubic pain | 0/1 | No/yes |
| Otros | Other symptoms | 0/1 | No/yes |
| Diagn | Diagnostic type | 1/2 | Symptomatic/incidental |
| Tumor | Number of tumors | Numeric | Numeric |
| Creat | Creatinine | Numeric | mg/dL |
| Got | GOT | Numeric | U/L |
| Gpt | GPT | Numeric | U/L |
| Hem | Number of red blood cells | Numeric | E6/uL |
| Hb | Haemoglobin | Numeric | g/dL |
| Hcto | Hematocrit | Numeric | % |
| Ca | Calcium | Numeric | mg/dL |
| P | Phosphorum | Numeric | mg/dL |
| Falc | Alcaline phosphatase | Numeric | U/L |
| Citesp, Citarr; eco, UIV; CT, cistosc | Diagnosis test performed | Text | Not analyzable |
| Multiple | Multiplicity | 1/2 | Single/multiple |
| Tam | Size (cm) | numeric | cm |
| TAM3CM | Size ≥3 cm | 1/2 | No/yes |
| Aspect | Endoscopic aspect | 1/2/3 | 1 Superficial/2 infiltrative/3 intermediate |
| ASPESUP | Superficial aspect | 1/2 | Yes/no |
| Tto | Type of adjuvant therapy | Text | Not analyzable |
| ADYUV | Adjuvant therapy | 1/2 | Yes/no |
| Jewett | Histologic staging | 1/2/3 | A/B/C-D |
| G | Grade | 1/2/3 | G1/G2/G3 |
| G23 | Grades 2 or 3 | 1/2 | No/yes |
| Tnm | TNM | 1/2 | Ta/T1 |
| Gries | EORTC risk group | 1/2 | Low-intermediate/high |
| Grx | Millan risk group | 1/2 | Low-intermediate/high |
| AP, tipoAP | Type of BC | 1/2/9 | Not analyzable TCC/SC/other |
| p53iha | P53 immunohistochemistry | 1/2/3 | +/++/+++ |
| p53ria | P53 quantified | Numeric | ng/ml |
| Neu | Prot p185 quantified | Numeric | HNU (0.05 fmol/mg)/ml |
| p16 | Prot p16 immunohistochemistry | 1/2/3 | +/++/+++ |
| Recid | Relapse | 1/2 | Yes/no |
| Fechare | First relapse data | Date | DD/MM/YYYY |
| narecid | Number of relapses | Numeric | Number |
| narecidp | Number of relapses till progression | Numeric | Number |
| Prog | Progression | 1/2 | Yes/no |
| Fprog | Progression date | Date | DD/MM/YYYY |
| Metas | Metastatic disease | 1/2 | Yes/no |
| Muerte | Death | 1/2 | Yes/no |
| Fechmuerte | Date of death | Date | DD/MM/YYYY |
| Mporca | Cancer specific mortality | 1/2 | Yes/no |
| Recm | Number of relapses till death | Numeric | Number |
| fechaultre | Last Revision Date | Date | DD/MM/YYYY |
| Egfr | EGFR quantified | Numeric | EGFR fmol/protein mg |
| Logneu | Neu logarithm | Numeric | Number of months |
| Super | Survival (months) | Numeric | Number of months |
| Ile | Relapse-free survival | Months | Number of months |
| Tprogre | Progression-free survival | Months | Number of months |
| Tmetas | Metastatic disease-free survival | Months | Number of months |
| Np53ria | p53 RIATertile | 1/2/3 | Tertile 1/tertile 2/tertile 3 |
| Nneu | NeuTertile | 1/2/3 | Tertile 1/tertile 2/tertile 3 |
| Negfr | EGFRTertile | 1/2/3 | Tertile 1/tertile 2/tertile 3 |
| Filtro | NMIBC | ||
| edad70 | Older than 70 years | 1/2 | Yes/no |
Variables removed from the dataset during the cleansing process.
| Variable | Explanation |
|---|---|
| Type | Constant value |
| Nhistori | Identifier |
| Otrosf | 87% of missing values (additional medical information) |
| Creat | Empty |
| p16 | Empty |
| Otrosm | 93% of missing values (additional medical information) |
| Fdiagn | Date |
| Fecharec | Date |
| Fechaprog | Date |
| Fechametas | Date |
| Fechmuerte | Date |
| Fechaultre | Date |
| filter_$ | Constant value |
Figure 1Hierarchical clustering of the NMIBC patients removing variables with missing values. (a) Clustering of the whole dataset. (b) Clustering of the dataset after removing the anomalous patients 10, 13, and 28.
Replacement values for the variables with missing values.
| Variable | Value |
|---|---|
| Got | 25 |
| Gpt | 24 |
| Ca | 9.32 |
| P | 3.22 |
| Falc | 87 |
| h_c | 23 |
| Egfr | 10.05 |
| Ile | 61 |
| Negfr | 3 |
Figure 2Hierarchical clustering of the NMIBC patients using all variables. (a) Clustering of the whole dataset. (b) Clustering of the dataset after removing the anomalous patients 13 and 28.
Figure 3Hierarchical clustering of the NMIBC patients using the selected variables. (a) Clustering of the whole dataset. (b) Clustering of the dataset after removing patients 13, 26, 30, 35, 37, 38, 44, and 45.
Clinical and molecular characteristics of the different clusters.
| Attribute | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 |
|---|---|---|---|---|
| Age* | 61 years; 18 | 67 years; 9 | 70 years; 9 | 82 years; 9 |
| (23–73) | (52–79) | (60–82) | (72–93) | |
|
| ||||
| Multiplicity | No: 100% | No: 20% | No: 40% | No: 50% |
| Yes 0% | Yes 80% | Yes 60% | Yes 50% | |
|
| ||||
| TM > 3 cm | No: 71% | No: 50% | No: 60% | No: 0% |
| Yes: 29% | Yes: 50% | Yes: 40% | Yes: 100% | |
|
| ||||
| Grade | G1: 83.3% | G1: 0% | G1: 20% | G1: 0% |
| G2: 16.6% | G2: 62.5% | G2: 80% | G2: 50% | |
| G3: 0% | G3: 37.5% | G3: 0% | G3: 50% | |
|
| ||||
| TNM | Ta: 100% | Ta: 0% | Ta: 80% | Ta: 0% |
| T1: 0% | T1: 100% | T1: 20% | T1: 100% | |
|
| ||||
| Risk group [ | Low-Int: 100% | Low-Int: 0% | Low-Int: 80% | Low-Int: 0% |
| High: 0% | High: 100% | High: 20% | High: 100% | |
|
| ||||
| Risk group [ | Low-Int: 100% | Low-Int: 12.5% | Low-Int: 100% | Low-Int: 0% |
| High: 0% | High: 87.5% | High: 20% | High: 100% | |
|
| ||||
| p53 (ng/mL)* | 0.1; 0.2 | 0.5; 1.2 | 0; 0 | 0; 0 |
| (0–0.6) | (0–3.40) | (0-0) | (0-0) | |
|
| ||||
| neu (HNU/mL)* | 748.5; 415.6 | 775.4; 544.2 | 1379.9; 184.7 | 854.4; 497.7 |
| (328–1596.1) | (76–1749.1) | (1253.0–1698.1) | (330.9–1527.2) | |
|
| ||||
| EGRF (fmol/mg)* | 6.9; 4.0 | 12.5; 12.8 | 8.5; 4.7 | 22.8; 17.2 |
| (0.2–11.4) | (2.2–16.6) | (3.0–15.1) | (7.1–39.5) | |
|
| ||||
| GS (months)* | 104; 37 (47–128) | 93; 46 (3–135) | 84; 47 (23–133) | 17; 10 (5–28) |
|
| ||||
| RFS (months)* | 104; 37 (47–128) | 81; 56 (3–135) | 9; 4 (4–13) | 13; 10 (5–27) |
|
| ||||
| PFS (months)* | 104; 37 (47–128) | 93; 46 (3–135) | 84; 47 (23–133) | 13; 10 (5–27) |
*Mean; SD (range).
Clinical and molecular characteristics of the different outliers (patient number).
| Attribute | Outlier 1 (13) | Outlier 2 (26) | Outlier 3 (30) | Outlier 4 (35) | Outlier 5 (37) | Outlier 6 (38) | Outlier 7 (44) | Outlier 8 (45) |
|---|---|---|---|---|---|---|---|---|
| Age | 77 | 73 | 69 | 72 | 41 | 71 | 80 | 60 |
| Multiplicity | Yes | Yes | Yes | No | No | No | No | Yes |
| TM > 3 cm | Yes | No | No | No | No | No | Yes | Yes |
| Grade | G3 | G2 | G2 | G2 | G2 | G2 | G3 | G2 |
| TNM | T1 | Ta | Ta | Ta | T1 | T1 | T1 | Ta |
| Risk group | High | Low-Int | Low-Int | Low-Int | High | High | High | Low-Int |
| p53 | 0 |
|
| 0 |
| 0 |
| 0 |
| Neu |
|
|
|
| 994 | 700 | 385.7 |
|
| Egfr |
| 6.3 |
| 8.6 | 3.2 |
| 16.4 |
|
| GS (months) | 15 | 35 | 133 | 111 | 131 | 11 | 9 | 106 |
| RFS (months) | 15 | 11 | 5 | 11 | 59 | 1 | 3 | 18 |
| PFS (months) | 15 | 35 | 133 | 111 | 131 | 1 | 3 | 91 |
|
| ||||||||
| Cluster | 4 | 3 | 3 | 3 | 2 | 4 | 4 | 3 |
Bold: reason for exclusion by the algorithm.