| Literature DB >> 33907111 |
Hanxu Guo1, Xianjie Jia2, Hao Liu3.
Abstract
ABSTRACT: To explore the influencing factors of prostate cancer occurrence, set up risk prediction model, require reference for the preliminary diagnosis of clinical doctors, this model searched database through the data of prostate cancer patients and prostate hyperplasia patients National Clinical Medical Science Data Center.With the help of Stata SE 12.0 and SPSS 25.0 software, the biases between groups were balanced by propensity score matching. Based on the matched data, the relevant factors were further screened by stepwise logistic regression analysis, the key variable and artificial neural network model are established. The prediction accuracy of the model is evaluated by combining the probability of test set with the area under receiver operating characteristic curve (ROC).After 1:2 PSM, 339 pairs were matched successfully. There are 159 cases in testing groups and 407 cases in training groups. And the regression model was P = 1 / (1 + e (0.122 ∗ age + 0.083 ∗ Apo lipoprotein C3 + 0.371 ∗ total prostate specific antigen (tPSA) -0.227 ∗ Apo lipoprotein C2-6.093 ∗ free calcium (iCa) + 0.428 ∗ Apo lipoprotein E-1.246 ∗ triglyceride-1.919 ∗ HDL cholesterol + 0.083 ∗ creatine kinase isoenzyme [CKMB])). The logistic regression model performed very well (ROC, 0.963; 95% confidence interval, 0.951 to 0.978) and artificial neural network model (ROC, 0.983; 95% confidence interval, 0.964 to 0.997). High degree of Apo lipoprotein E (Apo E) (Odds Ratio, [OR], 1.535) in blood test is a risk factor and high triglyceride (TG) (OR, 0.288) is a protective factor.It takes the biochemical examination of the case as variables to establish a risk prediction model, which can initially reflect the risk of prostate cancer and bring some references for diagnosis and treatment.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33907111 PMCID: PMC8084031 DOI: 10.1097/MD.0000000000025602
Source DB: PubMed Journal: Medicine (Baltimore) ISSN: 0025-7974 Impact factor: 1.817
Basic demographics.
| Before PSM | Optimization degree (%) | After PSM | Optimization degree (%) | ||||||
| Variable | Non-PCa n = 458 | PCa n = 229 | Relative bias | Non-PCa n = 458 | PCa n = 229 | Relative bias | Bias (%) | ||
| AGE | 54.733 | 66.279 | .000 | 119.3 | 66.899 | 66.274 | .504 | −6.5 | 94.6 |
| WEIGHT | 77.019 | 72.611 | .000 | −41.5 | 72.367 | 72.686 | .758 | 3.0 | 92.8 |
| BMI | 25.882 | 24.876 | .000 | −31.9 | 24.901 | 24.906 | .987 | 0.2 | 99.5 |
| Apo C3 | 11.801 | 10.956 | .109 | −8.8 | 10.343 | 10.975 | .484 | 6.6 | 25.3 |
| Apo A2 | 27.925 | 26.724 | .001 | −23.9 | 26.674 | 26.761 | .851 | 1.7 | 92.8 |
| Apo C2 | 5.1609 | 4.0804 | .000 | −37.8 | 4.0751 | 4.0728 | .847 | −0.1 | 99.8 |
| Apo E | 4.2864 | 4.7976 | .000 | 30.1 | 4.9889 | 4.8034 | .365 | −10.9 | 63.7 |
| ALB | 43.553 | 41.414 | .000 | −54.7 | 41.168 | 41.395 | .642 | 5.8 | 89.4 |
| CKMB | 12.48 | 16.034 | .000 | 36.7 | 16.548 | 16.095 | .682 | −0.3 | 87.3 |
| fPSA | 0.4132 | 3.9925 | .000 | 14.8 | 1.1049 | 1.538 | .136 | 1.8 | 87.9 |
| tPSA | 1.9376 | 25.824 | .000 | 31.8 | 8.9587 | 14.662 | .061 | 5.8 | 76.1 |
| Ca | 2.2898 | 2.2544 | .000 | −32.1 | 2.2439 | 2.2541 | .387 | 9.2 | 71.2 |
| CL | 105.05 | 103.22 | .000 | −58.5 | 102.93 | 103.23 | .341 | 9.6 | 83.7 |
| IP | 1.1793 | 1.1541 | .064 | −13.2 | 1.1378 | 1.155 | .340 | 9.1 | 31.5 |
| iCa | 1.2019 | 1.1589 | .000 | −73.8 | 1.1521 | 1.1586 | .217 | 11.2 | 84.8 |
| LDH | 150.65 | 158 | .000 | 22.3 | 159.28 | 158.13 | .722 | −3.5 | 84.3 |
| CK | 103.83 | 98.169 | .443 | −6.4 | 101.64 | 98.779 | .670 | −3.2 | 49.5 |
| Cre | 80.962 | 84.258 | .318 | 6.8 | 85.771 | 84.32 | .722 | −3.0 | 56.0 |
| TG | 1.9869 | 1.4027 | .000 | −42.2 | 1.411 | 1.4054 | .330 | −0.4 | 99.0 |
| HDL-C | 1.1229 | 1.217 | .000 | 31.2 | 1.2167 | 1.223 | .839 | −2.1 | 93.4 |
| Apo A1 | 1.1679 | 1.3294 | .000 | 67.0 | 1.3451 | 1.3301 | .566 | −6.2 | 90.7 |
| K | 4.0508 | 4.0311 | .409 | −6.1 | 4.0239 | 4.0322 | .799 | 2.6 | 58.0 |
| Apo B | 0.8912 | 0.96598 | .000 | 31.8 | 0.99848 | 0.96743 | .218 | −13.2 | 58.5 |
| Na | 141.9 | 141.94 | .851 | 1.2 | 141.79 | 141.91 | .661 | 4.7 | −300.7 |
| ALP | 68.444 | 71.784 | .039 | 9.1 | 66.53 | 71.433 | .159 | 13.3 | −46.8 |
| LDL-C | 2.8569 | 2.9132 | .322 | 6.8 | 2.919 | 3.0009 | .330 | −10.0 | −45.5 |
Multivariable regression analysis.
| Step | Relative variable | β | S.E | Wald | OR | |
| Step 1 | AGE | 0.168 | 0.014 | 152.430 | .000 | 1.183 |
| constant | −10.338 | 0.824 | 157.255 | .000 | 0.000 | |
| Step 2 | AGE | 0.161 | 0.014 | 130.246 | .000 | 1.174 |
| iCa | −11.848 | 2.025 | 34.238 | .000 | 0.000 | |
| constant | 4.139 | 2.467 | 2.816 | .093 | 62.758 | |
| Step 3 | AGE | 0.132 | 0.015 | 78.299 | .000 | 1.141 |
| iCa | −13.053 | 2.160 | 36.536 | .000 | 0.000 | |
| TG | −0.593 | 0.126 | 22.060 | .000 | 0.553 | |
| constant | 8.492 | 2.716 | 9.774 | .002 | 4877.526 | |
| Step 4 | AGE | 0.142 | 0.016 | 81.421 | .000 | 1.153 |
| iCa | −13.595 | 2.253 | 36.418 | .000 | 0.000 | |
| HDLC | −2.209 | 0.450 | 24.129 | .000 | 0.110 | |
| TG | −0.882 | 0.151 | 33.932 | .000 | 0.414 | |
| constant | 11.697 | 2.880 | 16.493 | .000 | 120193.950 | |
| Step 5 | AGE | 0.137 | 0.016 | 72.925 | .000 | 1.147 |
| iCa | −13.295 | 2.323 | 32.764 | .000 | 0.000 | |
| HDLC | −2.679 | 0.481 | 31.005 | .000 | 0.069 | |
| TG | −1.328 | 0.211 | 39.586 | .000 | 0.265 | |
| Apo E | 0.349 | 0.102 | 11.776 | .001 | 1.418 | |
| constant | 11.310 | 2.977 | 14.432 | .000 | 81646.190 | |
| Step 6 | AGE | 0.132 | 0.016 | 67.985 | .000 | 1.141 |
| iCa | −13.338 | 2.373 | 31.592 | .000 | 0.000 | |
| CKMB | 0.080 | 0.029 | 7.537 | .006 | 1.083 | |
| HDLC | −2.652 | 0.497 | 28.505 | .000 | 0.070 | |
| TG | −1.321 | 0.217 | 36.990 | .000 | 0.267 | |
| Apo E | 0.344 | 0.101 | 11.531 | .001 | 1.410 | |
| constant | 10.493 | 3.007 | 12.179 | .000 | 36072.605 | |
| Step 7 | AGE | 0.111 | 0.017 | 40.784 | .000 | 1.117 |
| iCa | −13.081 | 2.674 | 23.926 | .000 | 0.000 | |
| CKMB | 0.085 | 0.032 | 7.327 | .007 | 1.089 | |
| tpsa | 0.352 | 0.066 | 28.124 | .000 | 1.422 | |
| HDLC | −2.138 | 0.563 | 14.404 | .000 | 0.118 | |
| TG | −1.294 | 0.250 | 26.736 | .000 | 0.274 | |
| Apo E | 0.408 | 0.106 | 14.836 | .000 | 1.504 | |
| constant | 9.310 | 3.361 | 7.670 | .006 | 11042.987 | |
| Step 8 | AGE | 0.112 | 0.018 | 40.924 | .000 | 1.119 |
| Apo C3 | 0.062 | 0.026 | 5.549 | .018 | 1.064 | |
| iCa | −13.831 | 2.741 | 25.457 | .000 | 0.000 | |
| CKMB | 0.083 | 0.032 | 6.834 | .009 | 1.086 | |
| tpsa | 0.363 | 0.069 | 27.811 | .000 | 1.438 | |
| HDLC | −2.432 | 0.589 | 17.071 | .000 | 0.088 | |
| TG | −1.687 | 0.310 | 29.538 | .000 | 0.185 | |
| Apo E | 0.387 | 0.112 | 11.907 | .001 | 1.472 | |
| constant | 10.484 | 3.455 | 9.210 | .002 | 35751.766 | |
| Step 9 | AGE | 0.108 | 0.018 | 38.036 | .000 | 1.114 |
| Apo C2 | −0.211 | 0.100 | 4.441 | .035 | 0.810 | |
| Apo C3 | 0.089 | 0.035 | 6.544 | .011 | 1.093 | |
| iCa | −13.158 | 2.755 | 22.810 | .000 | 0.000 | |
| CKMB | 0.078 | 0.031 | 6.234 | .013 | 1.082 | |
| tpsa | 0.368 | 0.071 | 27.160 | .000 | 1.445 | |
| HDLC | −2.269 | 0.596 | 14.521 | .000 | 0.103 | |
| TG | −1.405 | 0.324 | 18.847 | .000 | 0.245 | |
| Apo E | 0.412 | 0.108 | 14.596 | .000 | 1.510 | |
| constant | 9.927 | 3.470 | 8.183 | .004 | 20466.604 |
Figure 1Comparison of ROC curves. ROC curves showed that the new model that based on the multivariable logistic regression has higher prediction efficiency than the model that only be built on PSA.
Figure 2Artificial neural network. Artificial neural network shows that the different importance of independent variables, which similar to the model that based on logistic regression.