| Literature DB >> 29158886 |
Mindaugas Marozas1, Romanas Zykus2, Andrius Sakalauskas1, Limas Kupčinskas2, Arūnas Lukoševičius1.
Abstract
Portal hypertension (PHT) is a key event in the evolution of different chronic liver diseases and leads to the morbidity and mortality of patients. The traditional reliable PHT evaluation method is a hepatic venous pressure gradient (HVPG) measurement, which is invasive and not always available or acceptable to patients. The HVPG measurement is relatively expensive and depends on the experience of the physician. There are many potential noninvasive methods to predict PHT, of which liver transient elastography is determined to be the most accurate; however, even transient elastography lacks the accuracy to be a perfect noninvasive diagnostic method of PHT. In this research, we are focusing on noninvasive PHT assessment methods that rely on selected best-supervised learning algorithms which use a wide set of noninvasively obtained data, including demographical, clinical, laboratory, instrumental, and transient elastography measurements. In order to build the best performing classification meta-algorithm, a set of 21 classification algorithms have been tested. The problem was expanded by selecting the best performing clinical attributes using algorithm-specific filtering methods that give the lowest error rate to predict clinically significant PHT. The suggested meta-algorithm objectively outperforms other methods found in literature and can be a good substitute for invasive PHT evaluation methods.Entities:
Mesh:
Year: 2017 PMID: 29158886 PMCID: PMC5660781 DOI: 10.1155/2017/6183714
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Contents of data collected during clinical examination.
| Demographical data | Age, gender |
| Clinical data | Height, weight, cause of chronic liver disease |
| Laboratory data | PLT, bilirubin, albumin, prothrombin time, ALT, AST |
| Instrumental data | Spleen width, spleen thickness, spleen length, HVPG |
| TE data | Liver stiffness, spleen stiffness |
| Blood stream data | Presence of umbilical or pararenal shunt, portal vein width, portal blood flow mean peak velocity, portal blood flow mean velocity, portal blood flow type (hepatopetal versus hepatofugal), hepatic vein blood flow type (triphasic, biphasic, or monophasic), hepatic vein damping index, splenic vein width |
Figure 1Selecting important attributes without missing data.
Algorithms used in research.
| BayesNet, Naive Bayes |
| Logistic, Multilayer Perceptron, SGD, Simple Logistic, SMO, Voted Perceptron |
| LazyIBk, lazy.Kstar |
| DecisionTable, JRip, OneR, Part |
| Decision Stump, Hoeffding Tree, J48, LMT, Random Forest, Random Tree, RepTree |
Figure 2The meta-algorithm selects the best performing classifiers and optimal sets of attributes from predefined groups. Each best classifier is then trained and results are combined into a final prediction.
Best performing classification algorithms and attributes from five different groups.
| Algorithm | Records | Increase | AUC | Best performing attributes |
|---|---|---|---|---|
| Naïve Bayes | 91 | 40% | 0.95 | ALT, albumin, pararenal shunt, liver stiffness, spleen stiffness |
| Logistic regression | 98 | 50.77% | 0.96 | Albumin, pararenal shunt, liver stiffness |
| lazy.Kstar | 82 | 26.15% | 0.97 | Umbilical shunt, pararenal shunt, hepatic venous blood flow type, splenic vein width, liver stiffness |
| Decision Table | 103 | 58.46% | 0.89 | Albumin, liver stiffness |
| Random Forest | 107 | 64.62% | 0.96 | Age, albumin, pararenal shunt, liver stiffness, spleen stiffness |
Figure 3Averaged performance of classification algorithms on data without missing values with standard error bars.
Figure 4Performance of classification algorithms on data with unprocessed missing values.
Figure 5Ranked attributes using relief filter.
Figure 6Performance of classification algorithms on data with missing values removed from top-ranked attributes.
Figure 7Performance of classification algorithms on data with missing values imputed using KNN method.
The results of all four algorithms test scenarios.
| Scenario I | Scenario II | Scenario III | Scenario IV | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Algorithm | AUC | STDERR |
| AUC | STDERR |
| AUC | STDERR |
| AUC | STDERR |
|
| Bayes Net | 0.88 | 0.04 | Base | 0.90 | 0.03 | Base | 0.90 | 0.04 | Base | 0.89 | 0.03 | Base |
| Naive Bayes | 0.95 | 0.02 | 0 | 0.95 | 0.03 | 0 | 0.94 | 0.04 | 0 | 0.92 | 0.02 | 0 |
| Logistic | 0.96 | 0.02 | Base | 0.94 | 0.02 | Base | 0.96 | 0.02 | Base | 0.93 | 0.02 | Base |
| Multilayer Perceptron | 0.90 | 0.03 | 0 | 0.95 | 0.02 | 0 | 0.95 | 0.02 | 0 | 0.91 | 0.02 | 0 |
| SGD | 0.82 | 0.05 | 0 | 0.89 | 0.03 | −1 | 0.87 | 0.04 | −1 | 0.82 | 0.03 | −1 |
| Simple Logistic | 0.95 | 0.02 | 0 | 0.95 | 0.02 | 0 | 0.95 | 0.03 | 0 | 0.93 | 0.02 | 0 |
| SMO | 0.87 | 0.05 | 0 | 0.86 | 0.04 | −1 | 0.83 | 0.04 | −1 | 0.75 | 0.04 | −1 |
| Voted Perceptron | 0.95 | 0.04 | 0 | 0.91 | 0.03 | 0 | 0.93 | 0.03 | 0 | 0.79 | 0.03 | 0 |
| LazyIBk | 0.91 | 0.03 | Base | 0.88 | 0.04 | Base | 0.93 | 0.04 | Base | 0.88 | 0.03 | Base |
| lazy.Kstar | 0.97 | 0.02 | 1 | 0.97 | 0.02 | 1 | 0.94 | 0.03 | 0 | 0.84 | 0.02 | 0 |
| Decision Table | 0.89 | 0.03 | Base | 0.88 | 0.03 | Base | 0.88 | 0.04 | Base | 0.85 | 0.04 | Base |
| JRip | 0.82 | 0.04 | 0 | 0.81 | 0.04 | 0 | 0.80 | 0.05 | 0 | 0.80 | 0.04 | 0 |
| OneR | 0.79 | 0.04 | 0 | 0.74 | 0.04 | 0 | 0.79 | 0.05 | −1 | 0.80 | 0.04 | 0 |
| Part | 0.84 | 0.04 | 0 | 0.86 | 0.03 | 0 | 0.91 | 0.04 | 0 | 0.84 | 0.05 | 0 |
| Decision Stump | 0.82 | 0.03 | Base | 0.82 | 0.04 | Base | 0.81 | 0.04 | Base | 0.82 | 0.03 | Base |
| Hoeffding Tree | 0.93 | 0.05 | 1 | 0.90 | 0.02 | 0 | 0.91 | 0.02 | 0 | 0.88 | 0.04 | 0 |
| J48 | 0.86 | 0.05 | 0 | 0.83 | 0.04 | 0 | 0.93 | 0.04 | 0 | 0.78 | 0.04 | 0 |
| LMT | 0.95 | 0.02 | 1 | 0.91 | 0.02 | 1 | 0.95 | 0.03 | 1 | 0.93 | 0.02 | 1 |
| Random Forest | 0.96 | 0.02 | 1 | 0.94 | 0.02 | 1 | 0.91 | 0.02 | 1 | 0.89 | 0.02 | 1 |
| Random Tree | 0.77 | 0.05 | 0 | 0.80 | 0.04 | 0 | 0.81 | 0.05 | 0 | 0.80 | 0.04 | 0 |
| Rep Tree | 0.87 | 0.04 | 0 | 0.88 | 0.04 | 0 | 0.84 | 0.05 | 0 | 0.84 | 0.04 | 0 |
Figure 8Mean AUC and standard error bars of all four scenarios and combining rules of meta-algorithm.
Comparison of best meta-algorithm results from each scenario.
| Scenario | Combining rule | Acc, % | Sn | Sp | Attributes | AUC | Standard error |
|---|---|---|---|---|---|---|---|
| I | Avg of probabilities | 88.46 | 0.75 | 0.92 | 9 | 0.94 | 0.05 |
| II | Min probability | 89.72 | 0.83 | 0.92 | 16 | 0.96 | 0.04 |
| III | Max probability | 86.87 | 0.79 | 0.89 | 16 | 0.94 | 0.04 |
| IV | Max probability | 88.92 | 0.80 | 0.81 | 15 | 0.96 | 0.02 |
Comparison of classification results.
| Attributes | Acc, % | Sn | Sp | F1 | AUC | Classifier |
|---|---|---|---|---|---|---|
| Demographic, laboratory data, liver/spleen TE. Our method | 89.72 | 0.83 | 0.92 | 0.87 | 0.96 | Meta voting |
| Measuring hyaluronan and laminin in serum [ | — | 0.83 | 0.82 | 0.83 | — | Logistic regression |
| Albumin, INR, ALT [ | 77.00 | 0.93 | 0.37 | 0.53 | — | Predictive model |
| Demographic, laboratory data [ | — | 0.82 | 0.76 | 0.79 | 0.82 | Regression |
| Serum, liver TE [ | 80.00 | 0.82 | 0.83 | 0.82 | 0.84 | Neural network |
| Liver TE [ | 88.70 | 0.88 | 0.88 | 0.88 | 0.95 | Statistical |