| Literature DB >> 35116609 |
Chao Yang1, Ruihua Yu1, Hui Ji1,2, Haosheng Jiang3, Wanli Yang1, Feng Jiang1.
Abstract
BACKGROUND: As the number of patients with cancer rises, home care for patients with advanced disease is becoming increasingly important. To provide guidance for home medical services and hospice care, we investigated the basic information and medical service information of patients with advanced cancer receiving home care by using a data mining algorithm to predict the patients' survival and medical expenses.Entities:
Keywords: Data mining; back-propagation neural network (BP neural network); patients with advanced cancer; random forest; support vector machine (SVM)
Year: 2021 PMID: 35116609 PMCID: PMC8798724 DOI: 10.21037/tcr-21-896
Source DB: PubMed Journal: Transl Cancer Res ISSN: 2218-676X Impact factor: 1.241
Types of categorical variables
| Categorical variables | Attributes | No. |
|---|---|---|
| Survival (days) | {<30, 30–90, >90} | 3 |
| Address | {Chenjia town, Chengqiao town, …} | 17 |
| Sex | {Male or female} | 2 |
| Living situation | {Living alone, living in a nursing home, living with family} | 3 |
| Household monthly income per capita (Yuan) | {<300, 300–600, >600} | 3 |
| Education | {Illiteracy, elementary school, junior high school, high school and above} | 4 |
| Primary disease diagnosis | {Liver cancer, lung cancer, stomach cancer...} | 21 |
| Transfer status | {Yes, No} | 2 |
| Radiotherapy | {Yes, No} | 2 |
| Chemotherapy | {Yes, No} | 2 |
| Surgery | {Yes, No} | 2 |
| Past medical history | {High blood pressure, diabetes, heart disease...} | 12 |
| Pain duration | {1 month, 1–6 months, 6 months–1 year, > 1 year} | 4 |
| Physical pain | {Yes, No} | 2 |
| Visceral pain | {Yes, No} | 2 |
| Pain medicine | {NSAIDs, weak opioids, strong opioids, other} | 4 |
| Anticonvulsant drug use | {Yes, No} | 2 |
| Anti-anxiety drug use | {Yes, No} | 2 |
| Glucocorticoid use | {Yes, No} | 2 |
| Constipation | {Yes, No} | 2 |
| Disgusting vomits | {Yes, No} | 2 |
| Vomiting | {Yes, No} | 2 |
| Dizziness | {Yes, No} | 2 |
| Sweating | {Yes, No} | 2 |
| Difficulty urinating | {Yes, No} | 2 |
| Drowsiness | {Yes, No} | 2 |
| Other negative symptoms | {Yes, No} | 2 |
NSAIDs, non-steroidal anti-inflammatory drugs.
Numeric variables
| Numeric variables | Mean | Range |
|---|---|---|
| Medical expenditure (Yuan) | 1,982.03 | 0–84,384.78 |
| Age | 70.58 | 23–91 |
| Analgesic dosage (mg) | 26.64 | 0–360 |
| NRS | 3.78 | 0–9 |
| QOL | 49.84 | 0–100 |
| KPS | 33.54 | 0–60 |
NRS, Numeric Rating Scale; QOL, quality of life; KPS, Karnofsky Performance Scale.
Performance of 3 algorithms in predicting the survival time of patients
| Algorithm | Correct rate (mean ± SD) | Error rate (mean ± SD) |
|---|---|---|
| Random forest | 81.94%±6.12% | 18.06%±6.12% |
| SVM | 74.61%±7.01% | 25.39%±7.01% |
| BP network | 72.90%±8.08% | 27.10%±8.08% |
BP, back-propagation; SVM, support vector machine.
Figure 1Accuracy of the different algorithms. BP, back-propagation; SVM, support vector machine.
Results of regression prediction of patient expenses using the 3 algorithms
| Algorithm | NMSE (mean ± SD) |
|---|---|
| Random forest | 0.4194±0.2393 |
| SVM | 1.1222±0.0648 |
| BP network | 1.2986±0.1762 |
BP, back-propagation; SVM, support vector machine; NMSE, normalized mean square error.
Figure 2The NMSE of the different algorithms. NMSE, normalized mean square error; BP, back-propagation; SVM, support vector machine.