| Literature DB >> 35924264 |
N Junath1, Alok Bharadwaj2, Sachin Tyagi3, Kalpana Sengar4, Mohammad Najmus Saquib Hasan5, M Jayasudha6.
Abstract
The diagnosis and treatment of patients in the healthcare industry are greatly aided by data analytics. Massive amounts of data should be handled using machine learning approaches to provide tools for prediction and categorization to support practitioner decision-making. Based on the kind of tumor, disorders like breast cancer can be categorized. The difficulties associated with evaluating vast amounts of data should be overcome by discovering an efficient method for categorization. Based on the Bayesian method, we analyzed the influence of clinic pathological indicators on the prognosis and survival rate of breast cancer patients and compared the local resection value directly using the lymph node ratio (LNR) and the overall value using the LNR differences in effect between estimates. Logistic regression was used to estimate the overall LNR of patients. After that, a probabilistic Bayesian classifier-based dynamic regression model for prognosis analysis is built to capture the dynamic effect of multiple clinic pathological markers on patient prognosis. The dynamic regression model employing the total estimated value of LNR had the best fitting impact on the data, according to the simulation findings. In comparison to other models, this model has the greatest overall survival forecast accuracy. These prognostic techniques shed light on the nodal survival and status particular to the patient. Additionally, the framework is flexible and may be used with various cancer types and datasets.Entities:
Mesh:
Year: 2022 PMID: 35924264 PMCID: PMC9343185 DOI: 10.1155/2022/1859222
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.246
Figure 1Overall flowchart.
Dataset sample characteristics.
| Feature name | Value | Number of samples (percentage) | Mean |
|---|---|---|---|
| Track time | 0~59 | 4405 (100) | 37.89 |
| State | Die | 160 (3.8) | |
| Survive | 4250 (95.69) | ||
| Total number of lymph nodes | 1~84 | 4405 (100) | 12.68 |
| Number of positive lymph nodes | 1~82 | 4405 (100) | 4.71 |
| Age at diagnosis | 20~80 | 4405 (100) | 57.12 |
| [20,30) | 48 (1.3) | ||
| [30,40) | 334 (8.6) | ||
| [40,50) | 951 (26.34) | ||
| [50,60) | 1064 (28.26) | ||
| [60,70) | 975 (25.18) | ||
| [70,80] | 520 (13.25) | ||
| T stage | T0 | 78 (1.9) | |
| T1 | 1028 (32.8) | ||
| T2 | 1766 (40.4) | ||
| T3 | 600 (13.8) | ||
| T4 | 235 (5.6) | ||
| TX adjusted | 58 (1.4) | ||
| Others | 240 (5.5) | ||
| M stage | M0 | 4112 (94.12) | |
| M1 | 240 (5.6) | ||
| M2 | 53 (1.4) | ||
| N stage | N1 | 2870 (65.4) | |
| N2 | 889 (20.11) | ||
| N3 | 588 (13.6) | ||
| NX adjusted | 58 (1.2) |
Figure 2Prerelationship between parameters.
Coefficients of some predictors.
| Predictor variable | Estimated value | Standard deviation |
|---|---|---|
| Intercept | 0.227 | 0.084 |
| Total number of lymph nodes | -0.183 | 0.015 |
| Number of positive lymph nodes | 0.367 | 0.029 |
| M1 | 0.589 | 0.248 |
| MX | -0.148 | 1.245 |
| N2 | 0.505 | 0.141 |
| N3 | 0.531 | 0.248 |
| NX adjusted | 0.298 | 0.173 |
Figure 3Predicted training and test set standard deviation coefficients.
Model features.
| Name | LNR data | Survival analysis model | LPML |
|---|---|---|---|
| Model_1 | Local cut value | Standard model | -719.45 |
| Model_2 | Overall estimate | Standard model | -705.81 |
| Model_3 | Local cut value | Dynamic model | -703.11 |
| Model_4 | Overall estimate | Dynamic model | -694.43 |
Figure 4Predicted training and test set survival-time curves.