| Literature DB >> 32993576 |
Mengjie Chen1, He Wang1, Yuejuan Liang1, Mingmiao Hu1, Li Li2.
Abstract
BACKGROUND: To study the risk factors involved in the occurrence and progression of cervical intraepithelial neoplasia (CIN) and to establish predictive models.Entities:
Keywords: Bioinformatics; Cervical cancer; Cervical intraepithelial neoplasia; Random forest model
Mesh:
Substances:
Year: 2020 PMID: 32993576 PMCID: PMC7523359 DOI: 10.1186/s12885-020-07265-7
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1the weight of factors in model3. the ROC curve of model3
The clinical features and genes expressed in the CIN and SCC groups
| CIN | SCC | Z/X2 | P | |
|---|---|---|---|---|
| 41.71 ± 9.31 | 49.87 ± 10.03 | −3.575 | ||
| gravidity | 3.46 ± 1.84 | 4.00 ± 1.44 | −1.720 | 0.085 |
| 1.67 ± 1.34 | 2.30 ± 1.34 | −2.377 | ||
| HPV infection | ||||
| positive | 47 | 28 | / | 1.000 |
| negative | 5 | 2 | ||
| 13.610 | ||||
| yes | 9 | 17 | ||
| no | 43 | 13 | ||
| TCT | 0.109 | |||
| CINII-(Normal/ASCUS) | 16 | 4 | / | |
| CINII+(LSIL/HSIL/SCC) | 36 | 26 | ||
| CTNNB1 | 0.88 ± 0.24 | 4.23 ± 2.47 | −1.637 | 0.102 |
| 665.56 ± 357.49 | 15,156.14 ± 9065.91 | −3.023 | ||
| 0.50 ± 0.30 | 0.10 ± 0.05 | −3.110 | ||
| 0.56 ± 0.11 | 5.49 ± 2.66 | −4.130 | ||
| 135.69 ± 81.39 | 4062.52 ± 1658.02 | −3.783 | ||
| TP73 | 75.39 ± 21.31 | 408.60 ± 145.24 | −1.714 | 0.087 |
| MUC2 | 1.89 ± 1.26 | 1.35 ± 0.70 | −1.271 | 0.204 |
| PIK3CA | 7.28 ± 1.98 | 8.60 ± 3.22 | −0.053 | 0.958 |
| CCND2 | 1.40 ± 0.72 | 0.53 ± 0.25 | −1.078 | 0.281 |
| 16.59 ± 6.14 | 599.16 ± 375.43 | −2.089 | ||
The clinical features and genes expressed in the CIN and normal groups
| normal | CIN | Z/X2 | P | |
|---|---|---|---|---|
| 44.08 ± 10.00 | 41.71 ± 9.31 | −1.480 | 0.139 | |
| gravidity | 3.00 ± 1.76 | 3.46 ± 1.84 | −1.536 | 0.124 |
| 1.66 ± 0.85 | 1.67 ± 1.34 | −0.561 | 0.561 | |
| HPV infection | 26.552 | |||
| positive | 15 | 47 | ||
| negative | 23 | 5 | ||
| yes | 9 | 9 | 0.558 | 0.455 |
| no | 29 | 43 | ||
| TCT | ||||
| CINII-(Normal/ASCUS) | 28 | 10 | / | |
| CINII+(LSIL/HSIL/SCC) | 16 | 36 | ||
| CTNNB1 | 0.85 ± 0.15 | 0.88 ± 0.24 | −1.626 | 0.104 |
| 1613.35 ± 369.73 | 665.56 ± 357.49 | −2.916 | ||
| 0.04 ± 0.01 | 0.50 ± 0.30 | −2.728 | ||
| PRKCI | 0.65 ± 0.22 | 0.56 ± 0.11 | −1.675 | 0.094 |
| CSKN1A1 | 211.45 ± 120.00 | 135.69 ± 81.39 | −0.270 | 0.787 |
| TP73 | 150.80 ± 44.11 | 75.39 ± 21.31 | −1.805 | 0.071 |
| MUC2 | 0.92 ± 0.25 | 1.89 ± 1.26 | −0.343 | 0.732 |
| PIK3CA | 4.91 ± 0.87 | 7.28 ± 1.98 | −1.005 | 0.315 |
| CCND2 | 0.47 ± 0.19 | 1.40 ± 0.72 | −0.556 | 0.579 |
| 40.18 ± 11.28 | 16.59 ± 6.14 | −2.091 | ||
Logistic regression analysis of risk factors for CIN progression
| Univariate logistic analysis | Multivariate logistic analysis | |||||
|---|---|---|---|---|---|---|
| HR | 95% CI | P | HR | 95% CI | P | |
| 1.089 | 1.034–1.146 | 0.986 | 0.886–1.097 | 0.794 | ||
| 6.248 | 2.256–17.303 | 11.36 | 1.175–117.976 | |||
| parity | 1.416 | 0.990–2.026 | 1.064 | 0.599–1.888 | 0.833 | |
| TGFBR2 | 1.000 | 1.000–1.000 | 0.174 | / | / | |
| FOXO1 | 0.404 | 0.079–2.053 | 0.274 | / | / | |
| 2.113 | 1.212–3.683 | 3.363 | 1.153–9.810 | |||
| 1.001 | 1.000–1.001 | 1.001 | 1.000–1.002 | |||
| CTBP2 | 1.006 | 0.999–1.012 | 1.005 | 0.988–1.023 | 0.534 | |
Logistic regression analysis of risk factors for CIN occurrence
| Univariate logistic analysis | Multivariate logistic analysis | |||||
|---|---|---|---|---|---|---|
| HR | 95% CI | P | HR | 95% CI | P | |
| 14.413 | 4.664–44.545 | 18.984 | 4.368–82.504 | |||
| 6.300 | 2.481–15.995 | 9.785 | 2.525–37.921 | |||
| TGFBR2 | 1.000 | 1.000–1.000 | 0.106 | / | / | |
| 207.63 | 1.063–40,539.222 | 22.660 | 0.136–3789.024 | 0.233 | ||
| CTBP2 | 0.992 | 0.983–1.001 | 0.987 | 0.977–0.997 | ||
Random forest models for predicting CIN progression
| NO. | Indicators | Factors | accuracy | AUC | OOB |
|---|---|---|---|---|---|
| 1 | All clinical features | age + menopause+HPV + gravidity+parity+TCT | 65.85 | 67.75 | 36.59 |
| 2 | Significant genes | TGFBR2 + CSKN1A1 + PRKCI+FOXO1 + CTBP2 | 73.17 | 86.75 | 29.27 |
| Significant genes + significant clinical features | TGFBR2 + CSKN1A1 + PRKCI+FOXO1 + CTBP2+ menopause+parity+age | ||||
| 4 | Genes as the risk factors in unvariable logistic analysis | CSKN1A1 + PRKCI+CTBP2 | 68.29 | 72.75 | 24.39 |
| 5 | Genes as the risk factors in unvariable logistic analysis + Significant genes | CSKN1A1 + PRKCI+CTBP2+ menopause+parity+age | 68.29 | 78.75 | 26.83 |
| 6 | Genes as the independent factors in multivariable logistic analysis | CSKN1A1 + PRKCI | 70.73 | 68.25 | 21.95 |
| 7 | Genes as the independent factors in multivariable logistic analysis | CSKN1A1 + PRKCI+ menopause+parity+age | 68.29 | 76.75 | 26.83 |
Random forest models for predicting CIN occurrence
| NO. | Indicators | Factors | accuracy | AUC | OOB |
|---|---|---|---|---|---|
| 8 | All clinical features | age + menopause+HPV + gravidity+parity+TCT | 60.00 | 75.20 | 28.89 |
| 9 | Significant genes | TGFBR2+ FOXO1 + CTBP2 | 71.11 | 70.54 | 26.67 |
| 10 | Significant genes + significant clinical features | TGFBR2 + CTBP2 + FOXO1 + HPV + TCT | 77.78 | 92.09 | 26.67 |
| 11 | Genes as the risk factors in unvariable logistic analysis | CTBP2 + FOXO1 | 73.33 | 74.51 | 40.00 |
| Genes as the risk factors in unvariable logistic analysis + Significant genes | |||||
| 13 | Genes as the independent factors in multivariable logistic analysis | CTBP2 | / | / | / |
| 14 | Genes as the independent factors in multivariable logistic analysis | CTBP2 + HPV + TCT | 75.56 | 85.38 | 26.67 |
Fig. 2the weight of factors in model12. the ROC curve of model12
Fig. 3the roles of CSNK1A1 in Wnt signaling pathway
Fig. 4the roles and locations of PIK3CA and FOXO1 in HPV infection pathway
Fig. 5the roles of TGFBR2 and CTBP2 in TGFB signaling pathway
Fig. 6the role and location of PRKCI in HPV infection pathway
Fig. 7the roles and locations of CCND2 and CTNNB1 in Hippo signaling pathway and HPV infection pathway