| Literature DB >> 28669163 |
Peyman Rezaei Hachesu1, Nazila Moftian, Mahsa Dehghani, Taha Samad Soltani.
Abstract
Background: Data mining, a new concept introduced in the mid-1990s, can help researchers to gain new, profound insights and facilitate access to unanticipated knowledge sources in biomedical datasets. Many issues in the medical field are concerned with the diagnosis of diseases based on tests conducted on individuals at risk. Early diagnosis and treatment can provide a better outcome regarding the survival of lung cancer patients. Researchers can use data mining techniques to create effective diagnostic models. The aim of this study was to evaluate patterns existing in risk factor data of for mortality one year after thoracic surgery for lung cancer.Entities:
Keywords: Data mining; lung neoplasms; cancer survival; informatics; knowledge
Year: 2017 PMID: 28669163 PMCID: PMC6373791 DOI: 10.22034/APJCP.2017.18.6.1531
Source DB: PubMed Journal: Asian Pac J Cancer Prev ISSN: 1513-7368
Thoracic Surgery Data Set
| Features (16 inputs and one outcome) |
|---|
| Feature 1: specific combination of ICD-10 codes for primary and secondary as well multiple tumors if any(DGN3,DGN2,DGN4,DGN6,DGN5,DGN8,DGN1) |
| Feature 2: FVC |
| Feature 3:FEV1 |
| Feature 4: Performance status - Zubrod scale (PRZ2,PRZ1,PRZ0) |
| Feature 5: Pain before surgery (T,F) |
| Feature 6: Hemoptysis before surgery (T,F) |
| Feature 7: Dyspnea before surgery (T,F) |
| Feature 8: Cough before surgery (T,F) |
| Feature 9: Weakness before surgery (T,F) |
| Feature 10: T in clinical TNM - size of the original tumor, from OC11 (smallest) to OC14 (largest) |
| Feature 11: Type 2 Diabetes mellitus |
| Feature 12: MI up to 6 months (T,F) |
| Feature 14: Smoking (T,F) |
| Feature 15: Asthma (T,F) |
| Feature 16: Age at surgery (numeric) |
| Features 17: Risk1Y: (T (True or Died (N: 70 patients)), F (False or live(N: 400 patients)) |
Analysis Results of the Key Influencers on The Thoracic Surgery Dataset
| Column | value | Status of Cancer | Relative impact |
|---|---|---|---|
| Type 2 Diabetes mellitus | False | False | 36 |
| Dyspnea before surgery | False | False | 33 |
| T in clinical TNM size of the original tumor | OC14 | True | 100 |
| Type 2 Diabetes mellitus | True | True | 36 |
| Dyspnea before surgery | True | True | 33 |
Analysis Results of Categories Detection for the Thoracic Surgery Dataset
| Category name | No of records | Important factors | Factor value | Relative impact |
|---|---|---|---|---|
| Medium FEV1 | 194 | FEV1 | Medium | 100 |
| FVC | High | 20 | ||
| Low FEV1 with high age | 153 | FEV1 | Low | 100 |
| Age | VeryHigh or, >69 years | 39 | ||
| Low FEV1 with low age | 105 | FEV1 | Low | 100 |
| Age | Low 48-55 years | 19 | ||
| High and Very high FEV1 | 18 | FEV1 | High | 100 |
| FEV1 | VeryHigh | 100 |
Results of Detection Analysis of Exceptions
| Factor | Frequency |
|---|---|
| DGN | 1 |
| Hemoptysis before surgery | 1 |
| Total | 2 |
Figure 1The Cost of Incidence of False Positives and False Negatives and the Threshold Value Based On Repetitive Sampling On the Dataset (Horizontal Axis Indicating the Calculated Threshold Value and Vertical Axis Indicating the Number of Samples); The Threshold
The Proposed Questionnaire To Determine The Risk Of Death According To The Variables Of The Studied Dataset. In this questionnaire, the first column presents the title of features, their ascending values and points of each option. If the sum of the values is higher than 482, the patient is at the risk of one-year postoperative mortality
| DGN | DGN1 | 194 |
| DGN2 | 14 | |
| DGN3 | 49 | |
| DGN4 | 74 | |
| DGN5 | 52 | |
| DGN6 | 0 | |
| DGN8 | 126 | |
| FVC | < 2.59 | 30 |
| 2.59 - 3.18 | 69 | |
| 3.19 - 3.91 | 17 | |
| 3.92 - 4.66 | 33 | |
| >= 4.67 | 0 | |
| FEV1 | < 1.88 | 34 |
| 1.88 - 2.24 | 31 | |
| 2.24 - 2.64 | 0 | |
| 2.64 - 3.24 | 52 | |
| >= 3.24 | 5 | |
| Performance state | PRZ0 | 0 |
| PRZ1 | 8 | |
| PRZ2 | 106 | |
| Pain before surgery | False | 0 |
| True | 33 | |
| Hemoptysis before surgery | False | 0 |
| True | 13 | |
| Dyspnea before surgery | False | 0 |
| True | 42 | |
| Cough before surgery | False | 0 |
| True | 3 | |
| Weakness before surgery | False | 25 |
| True | 0 | |
| T in clinical TNM | OC11 | 0 |
| OC12 | 28 | |
| OC13 | 200 | |
| OC14 | 206 | |
| type 2 Diabetes Mellitus | False | 0 |
| True | 25 | |
| MI up to 6 montds | False | 60 |
| True | 0 | |
| peripheral arterial diseases | False | 41 |
| True | 0 | |
| Smoking | False | 0 |
| True | 21 | |
| Astdma | False | 63 |
| True | 0 | |
| Age | < 48 | 36 |
| 48 - 55 | 12 | |
| 55 - 62 | 45 | |
| 62 - 69 | 0 | |
| >= 69 | 9 | |
| Total |