| Literature DB >> 34462648 |
Chia-Lun Lo1, Ya-Hui Yang1, Hsiao-Ting Tseng2.
Abstract
Bladder cancer, the ninth most common cancer worldwide, requires fast diagnosis and treatment to prevent disease progression and improve patient survival. However, patients with bladder cancer often experience considerable delays in diagnosis. One reason for such delays is that hematuria, a major symptom of bladder cancer, has a high probability of also being a warning sign for urinary tract diseases. Another reason is that the sensitivity of the body parts affected by bladder cancer deters patients from undergoing cystoscopy and influences patients' "physician shopping" behavior. In this study, the analytic hierarchy process was used to determine critical variables influencing delayed diagnosis; moreover, the variables were used to construct models for predicting delayed diagnosis in patients with hematuria by using several machine learning techniques. Furthermore, the critical variables associated with delayed diagnosis of bladder cancer in patients with hematuria were evaluated using GainRatio technology. The study sample was selected from a population-based database. The model evaluation results indicated that the prediction model established using decision tree algorithms outperformed the other models. The critical risk factors for delayed diagnosis of bladder cancer were as follows: (1) cystoscopy performed 6 months after hematuria diagnosis and (2) physician shopping.Entities:
Mesh:
Year: 2021 PMID: 34462648 PMCID: PMC8403036 DOI: 10.1155/2021/3831453
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
The importance of variable ranking by AHP.
| Rank | Dimension |
|---|---|
| 1 | Characteristics of patients |
| 2 | Cystoscopy after hematuria record for half year |
| 3 | Characteristics of hospital |
| 4 | Visiting behavior |
| 5 | Characteristics of physicians |
Figure 1Flowchart showing derivation of the study sample.
Parameter setting in Weka.
| Method | Parameters | Range | Value setting |
|---|---|---|---|
| C4.5 | Confidence factor | 0.1–0.5 | 0.25 |
| Minimum number of instances per leaf | 2–20 | 20 | |
|
| |||
| Random forest | Number of trees | 5–250 | 10 |
| Number of attributes to be used in random selection | 2–8 | 4 | |
|
| |||
| Support vector machines | Kernel | PolyKernel | |
|
| |||
| Multilayer perceptron | Number of hidden nodes | 5–10 | 7 |
| Learning rate | 0.1–0.6 | 0.3 | |
| Momentum factor | 0–0.9 | 0.2 | |
| Maximum number of epochs | 300–900 | 500 | |
Characteristics of bladder cancer patients with delay diagnosed and control subjects.
| Variables | Value | Delay diagnosed group ( | Nondelayed diagnosed group ( |
|---|---|---|---|
| Age | Patient | 67.3 (31–93) | 67.4 (16–98) |
| Physician | 54.7 (47–63) | 56 (43–63) | |
|
| |||
| Seniority (physician) | 11.7 (4–20) | 13 (5–20 | |
| Gender (patient) | Male | 142 (67.6) | 227 (68.8) |
| Female | 68 (32.4) | 98 (30.2) | |
|
| |||
| Gender (physician) | Male | 201 (95.7) | 313 (96.3) |
| Female | 9(4) | 12 (3.7) | |
|
| |||
| Hospital level | Medical center | 60 (28.6) | 113 (34.8) |
| Regional hospital | 81 (38.6) | 135 (41.5) | |
| District hospital | 36 (17.1) | 43 (13.2) | |
| Clinic | 33 (15.7) | 32 (10.5) | |
|
| |||
| Visit behavior | Surgery | 52 (24.7) | 19 (5.8) |
| Gynecology | 38 (18.1) | 13 (4) | |
| Chinese medicine | 48 (22.3) | 3 (0.9) | |
| Gastroenterology | 61 (29.0) | 6 (1.8) | |
| Nephrology | 53 (25.2) | 30 (9.2) | |
|
| |||
| Cystoscopy after hematuria record for half year | 205 (97.6) | 120 (36.9) | |
| Visit times | Surgery | 1.69 | 0.03 |
| Gynecology | 1.69 | 0.05 | |
| Chinese medicine | 1.73 | 0.01 | |
| Gastroenterology | 0 | 0.02 | |
| Nephrology | 0 | 0.18 | |
|
| |||
| Location (level of urbanization) | |||
| City | 82 (39.0) | 114 (44.3) | |
| Commuting zone | 54 (25.7) | 80 (24.6) | |
| Towns and semidense areas | 10 (4.8) | 12 (3.7) | |
| Rural areas | 64 (30.5) | 89 (27.4) | |
n (%), the others are μ(σ).
Performance results of classifiers.
| Classifier | Algorithms | Accuracy, | Sensitivity, | Specificity, | AUC, |
|---|---|---|---|---|---|
| Without AdaBoost | C4.5 | 0.859/0.014 | 0.843/0.003 | 0.858/0.016 | 0.871/0.042 |
| RF | 0.879/0.071 | 0.875/0.045 | 0.8720.081 | 0.942/0.048 | |
| SVM | 0.746/0.007 | 0.752/0.005 | 0.769/0.040 | 0.705/0.008 | |
| LGR | 0.788/0.011 | 0.799/0.013 | 0.802/0.021 | 0.854/0.011 | |
| MLP | 0.742/0.079 | 0.720/0.048 | 0.709/0.127 | 0.775/0.104 | |
|
| |||||
| With AdaBoost | C4.5 | 0.856/0.088 | 0.825/0.109 | 0.856/0.088 | 0.915/0.064 |
| RF | 0.881/0.066 | 0.857/0.062 | 0.881/0.066 | 0.943/0.045 | |
| SVM | 0.743/0.011 | 0.722/0.008 | 0.743/0.001 | 0.762/0.010 | |
| LGR | 0.791/0.002 | 0.802/0.025 | 0.791/0.003 | 0.828/0.008 | |
| MLP | 0.751/0.088 | 0.791/0.029 | 0.752/0.087 | 0.786/0.111 | |
Variable importance ranking by GainRatioAttributeEval.
| Rank | Variable | GainRatio |
|---|---|---|
| 1 | Cystoscopy after hematuria record for half year | 0.3030443 |
| 2 | Gastroenterology visiting times | 0.122365 |
| 3 | Gastroenterology visiting | 0.122365 |
| 4 | Chinese medicine visiting | 0.103643 |
| 5 | Chinese medicine visiting times | 0.0636328 |
| 6 | Surgery visiting times | 0.073374 |
| 7 | Gynecology visiting times | 0.063343 |
| 8 | Nephrology visiting times | 0.035427 |
| 9 | Hospital level | 0.007911 |
| 10 | Location | 0.004625 |