| Literature DB >> 31888439 |
Mingzhao Wang1,2, Linglong Ding3,4, Meng Xu3, Juanying Xie5, Shengli Wu6, Shengquan Xu7, Yingmin Yao3, Qingguang Liu8.
Abstract
BACKGROUND: Portal vein system thrombosis (PVST) is potentially fatal for patients if the diagnosis is not timely or the treatment is not proper. There hasn't been any available technique to detect clinic risk factors to predict PVST after splenectomy in cirrhotic patients. The aim of this study is to detect the clinic risk factors of PVST for splenectomy and cardia devascularization patients for liver cirrhosis and portal hypertension, and build an efficient predictive model to PVST via the detected risk factors, by introducing the machine learning method. We collected 92 clinic indexes of splenectomy plus cardia devascularization patients for cirrhosis and portal hypertension, and proposed a novel algorithm named as RFA-PVST (Risk Factor Analysis for PVST) to detect clinic risk indexes of PVST, then built a SVM (support vector machine) predictive model via the detected risk factors. The accuracy, sensitivity, specificity, precision, F-measure, FPR (false positive rate), FNR (false negative rate), FDR (false discovery rate), AUC (area under ROC curve) and MCC (Matthews correlation coefficient) were adopted to value the predictive power of the detected risk factors. The proposed RFA-PVST algorithm was compared to mRMR, SVM-RFE, Relief, S-weight and LLEScore. The statistic test was done to verify the significance of our RFA-PVST.Entities:
Keywords: Cardia devascularization; Discernibility; Feature selection; Independence; Liver cirrhosis; Portal hypertension; Portal vein system thrombosis (PVST); Risk degree; SVM; Splenectomy
Mesh:
Year: 2019 PMID: 31888439 PMCID: PMC6936084 DOI: 10.1186/s12859-019-3233-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Scatter plots of clinic indexes of 5-fold cross validation experiments
the clinic indexes ranked in descending order in their risk degrees of 5-flod cross validation experiments
| Folds | Clinic index |
|---|---|
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 |
The underlined bold fonts mean the detected risk factors
Performance of PVST predictive models on different sets of risk indicators of 5-fold cross validation experiments
| Fold ( | Acc | AUC | Sensitivity | Specificity | Precision | F-measure | FNR | FPR | FDR | MCC | # selected features |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 (2,0.0625) | 0.68 | 0.64 | 0.78 | 0.70 | 0.36 | 0.22 | 0.38 | 1 | |||
| 0.74 | 0.89 | 0.91 | 0.50 | 0.71 | 0.80 | 0.09 | 0.50 | 0.29 | 0.46 | 2 | |
| 2 (0.25,0.25) | 0.74 | 0.84 | 0.38 | 0.69 | 0.81 | 0.62 | 0.31 | 0.51 | 2 | ||
| 0.84 | 8 | ||||||||||
| 3 (0.125,16) | 0.61 | 0.85 | 0.70 | 0.50 | 0.64 | 0.67 | 0.30 | 0.50 | 0.36 | 0.20 | 2 |
| 0.72 | 0.59 | 0.90 | 0.50 | 0.69 | 0.78 | 0.10 | 0.50 | 0.31 | 0.44 | 4 | |
| 0.61 | 0.76 | 0.80 | 0.38 | 0.62 | 0.70 | 0.20 | 0.62 | 0.38 | 0.19 | 6 | |
| 4 (0.25,0.0625) | 0.33 | 0.55 | 0.40 | 0.25 | 0.40 | 0.40 | 0.60 | 0.75 | 0.60 | −0.35 | 1 |
| 0.33 | 0.44 | 0.40 | 0.25 | 0.40 | 0.40 | 0.60 | 0.75 | 0.60 | −0.35 | 2 | |
| 0.33 | 0.23 | 0.50 | 0.13 | 0.42 | 0.45 | 0.50 | 0.87 | 0.58 | − 0.40 | 4 | |
| 5 (0.5,0.125) | 0.56 | 0.80 | 0.60 | 0.50 | 0.60 | 0.60 | 0.40 | 0.50 | 0.40 | 0.10 | 1 |
| 0.56 | 0.70 | 0.50 | 0.63 | 0.63 | 0.56 | 0.50 | 0.37 | 0.37 | 0.13 | 3 | |
| 0.61 | 0.65 | 0.80 | 0.38 | 0.62 | 0.70 | 0.20 | 0.62 | 0.38 | 0.19 | 5 | |
| Average | 0.59 | 0.70 | 0.70 | 0.45 | 0.62 | 0.65 | 0.30 | 0.55 | 0.38 | 0.18 | – |
The underlined bold fonts mean the best results
Experimental results of algorithms of 5-fold cross validation experiments
| Algorithms | Acc | AUC | Sensitivity | Specificity | Precision | F-measure | FNR | FPR | FDR | MCC |
|---|---|---|---|---|---|---|---|---|---|---|
| RFA-PVST | 0.70 | 0.65 | 0.30 | 0.18 | ||||||
| mRMR | 0.53 | 0.53 | 0.78 | 0.20 | 0.55 | 0.64 | 0.22 | 0.80 | 0.45 | NaN |
| SVM-RFE | 0.56 | 0.44 | 0 | 0.56 | 1.00 | 0.44 | NaN | |||
| Relief | 0.56 | 0.56 | 0 | 0.56 | 1.00 | 0.44 | NaN | |||
| S-weight | 0.54 | 0.52 | 0.95 | 0.02 | 0.55 | 0.69 | 0.05 | 0.98 | 0.45 | NaN |
| LLEScore | 0.53 | 0.53 | 0.66 | 0.37 | 0.57 | 0.60 | 0.34 | 0.63 | 0.43 | NaN |
The underlined bold fonts mean the best results
The Friedman’s test results with α = 0.05 of our RFA-PVST and mRMR, SVM-RFE, Relief, S-weight and LLEScore
| Acc | AUC | Sensitivity | Specificity | Precision | |
|---|---|---|---|---|---|
| 11.7790 | 12.5166 | 37.4303 | 46.7109 | 17.0809 | |
| 5 | 5 | 5 | 5 | 5 | |
| 0.0379 | 0.0284 | 4.9e-07 | 6.507e-09 | 0.0043 |
Paired rank comparison of algorithms in Acc, AUC, sensitivity, specificity, and precision of predictive model built on clinic risk indicators to PVST detected by algorithms
| RFA-PVST | mRMR | SVM-RFE | Relief | S-weight | LLEScore | |
|---|---|---|---|---|---|---|
| RFA-PVST | 1.7308 | 1.1923 | 1.1923 | 1.5769 | 1.0000 | |
| mRMR | * | −0.5385 | −0.5385 | − 0.1538 | − 0.7308 | |
| SVM-RFE | 0 | 0.3846 | −0.1923 | |||
| Relief | 0.3846 | −0.1923 | ||||
| S-weight | −0.5769 | |||||
| LLEScore | ||||||
| RFA-PVST | mRMR | SVM-RFE | Relief | S-weight | LLEScore | |
| RFA-PVST | 1.7308 | 2.4231 | 1.7308 | 1.8077 | 1.7692 | |
| mRMR | 0.6923 | 0 | 0.0769 | 0.0385 | ||
| SVM-RFE | * | −0.6923 | −0.6154 | − 0.6538 | ||
| Relief | 0.0769 | 0.0385 | ||||
| S-weight | −0.0385 | |||||
| LLEScore | ||||||
| RFA-PVST | mRMR | SVM-RFE | Relief | S-weight | LLEScore | |
| RFA-PVST | −0.5769 | −2.2692 | −2.2692 | −1.8077 | 0.4615 | |
| mRMR | −1.6923 | −1.6923 | −1.2308 | 1.0385 | ||
| SVM-RFE | * | 0 | 0.4615 | 2.7308 | ||
| Relief | * | 0.4615 | 2.7308 | |||
| S-weight | * | 2.2692 | ||||
| LLEScore | * | * | * | |||
| RFA-PVST | mRMR | SVM-RFE | Relief | S-weight | LLEScore | |
| RFA-PVST | 1.3077 | 2.9615 | 2.9615 | 2.6923 | 0.2308 | |
| mRMR | 1.6538 | 1.6538 | 1.3846 | −1.0769 | ||
| SVM-RFE | * | 0 | −0.2692 | −2.7308 | ||
| Relief | * | −0.2692 | −2.7308 | |||
| S-weight | * | −2.4615 | ||||
| LLEScore | * | * | * | |||
| RFA-PVST | mRMR | SVM-RFE | Relief | S-weight | LLEScore | |
| RFA-PVST | 1.8462 | 1.8846 | 1.8846 | 2.3077 | 1.0769 | |
| mRMR | * | 0.0385 | 0.0385 | 0.4615 | −0.7692 | |
| SVM-RFE | * | 0 | 0.4231 | −0.8077 | ||
| Relief | * | 0.4231 | −0.8077 | |||
| S-weight | * | −1.2308 | ||||
| LLEScore |
Data information
| Male | Female | Age range ( | |
|---|---|---|---|
| PVST | 30 | 22 | 20~71 (47 ± 10) |
| non-PVST | 22 | 18 | 27~77 (47.9 ± 10.8) |
Clinic indexes of splenectomy with cardia devascularization for cirrhotic and portal hypertension patients
| ID | Index name | ID | Index name | ID | Index name |
|---|---|---|---|---|---|
| 1 | Age | 12 | BUN (blood urea nitrogen) | 23 | NE1 (neutrophil count of 1st test) |
| 2 | Gender | 13 | CRE (creatinine) | 24 | NE2 (neutrophil count of 2nd test) |
| 3 | Weight | 14 | GLU (glucose) | 25 | PLT (Platelets) |
| 4 | BV (bleeding volume) | 15 | Na (Natrium) | 26 | PT (prothrombin time) |
| 5 | AST (aspartate aminotransferase) | 16 | K (Kalium) | 27 | INR (International normalized ratio) |
| 6 | ALT (alanine transaminase) | 17 | Ca (calcium) | 28 | APTT (activated partial thromboplastin time) |
| 7 | CHOL (cholesterol) | 18 | RBC (Red blood cell) | 29 | TT (thrombin time) |
| 8 | TBIL (total bilirubin) | 19 | HGB (hemoglobin) | 30 | FIB (fibrinogen) |
| 9 | DBIL (direct bilirubin) | 20 | WBC (White blood cell) | 31 | D-D (D dimer) |
| 10 | TP (total protein) | 21 | LY1 (lymphocyte count of 1st test) | 32 | Anticoagulant therapy, |
| 11 | ALB (albumin) | 22 | LY2 (lymphocyte count of 2nd test) | 33 | Antiplatelet aggregation therapy |
Confusion matrix
| Predictive Positive class (PVST patients) | Predictive Negative class (non-PVST patients) | |
|---|---|---|
| True Positive class (PVST patients) | True positive (TP) | False negative (FN) |
| True Negative class (non-PVST patients) | False positive (FP) | True negative (TN) |