| Literature DB >> 26880886 |
Somaya Hashem1, Gamal Esmat2, Wafaa Elakel2, Shahira Habashy3, Safaa Abdel Raouf1, Samar Darweesh4, Mohamad Soliman5, Mohamed Elhefnawi6, Mohamed El-Adawy3, Mahmoud ElHefnawi7.
Abstract
Background/Aim. Respectively with the prevalence of chronic hepatitis C in the world, using noninvasive methods as an alternative method in staging chronic liver diseases for avoiding the drawbacks of biopsy is significantly increasing. The aim of this study is to combine the serum biomarkers and clinical information to develop a classification model that can predict advanced liver fibrosis. Methods. 39,567 patients with chronic hepatitis C were included and randomly divided into two separate sets. Liver fibrosis was assessed via METAVIR score; patients were categorized as mild to moderate (F0-F2) or advanced (F3-F4) fibrosis stages. Two models were developed using alternating decision tree algorithm. Model 1 uses six parameters, while model 2 uses four, which are similar to FIB-4 features except alpha-fetoprotein instead of alanine aminotransferase. Sensitivity and receiver operating characteristic curve were performed to evaluate the performance of the proposed models. Results. The best model achieved 86.2% negative predictive value and 0.78 ROC with 84.8% accuracy which is better than FIB-4. Conclusions. The risk of advanced liver fibrosis, due to chronic hepatitis C, could be predicted with high accuracy using decision tree learning algorithm that could be used to reduce the need to assess the liver biopsy.Entities:
Year: 2016 PMID: 26880886 PMCID: PMC4736594 DOI: 10.1155/2016/2636390
Source DB: PubMed Journal: Gastroenterol Res Pract ISSN: 1687-6121 Impact factor: 2.260
Fibrosis records and strata in the data sets.
| Fibrosis stage | Training dataset ( | Test dataset ( |
|---|---|---|
| 0 | 34 | 42 |
| 1 | 11808 | 8337 |
| 2 | 7507 | 5821 |
| 3 | 3170 | 2513 |
| 4 | 171 | 164 |
| Total | 22690 | 16877 |
| Fibrosis strata | ||
| Mild to moderate (0–2) | 19349 | 14200 |
| Advanced (3-4) | 3341 | 2677 |
Characteristics of variables in coherent dataset.
| Characteristics | Training dataset 22690 | Validation dataset 16877 | Pearson correlation coefficients |
|
|---|---|---|---|---|
| Age (yrs) | 40 ± 11 | 40 ± 10 | 0.26 | <0.0001 |
| Gender | ||||
| Female | 6186 (27.3%) | 4555 (26.9%) | −0.03 | 0.008 |
| Male | 16504 (72.7%) | 12322 (73.1%) | ||
| BMI | 26.70 ± 3.79 | 26.79 ± 3.84 | 0.10 | <0.0001 |
| AFP (U/L) | 7.26 ± 26.61 | 7.69 ± 28.49 | 0.10 | <0.0001 |
| ALP (U/L) | 105.41 ± 65.17 | 105.41 ± 65.17 | 0.02 | 0.008 |
| AST (U/L) | 57.27 ± 33.73 | 56.78 ± 34.61 | 0.12 | <0.0001 |
| ALT (U/L) | 61.84 ± 36.89 | 61.84 ± 38.19 | 0.06 | 0.008 |
| Platelet count ( | 212.48 ± 60.64 | 211.55 ± 60.86 | −0.18 | <0.0001 |
| Albumin (g/dL) | 4.39 ± 0.42 | 4.40 ± 0.42 | −0.14 | <0.0001 |
| Indirect bilirubin (mg/dL) | 0.57 ± 1.77 | 0.60 ± 2.24 | −0.00 | 0.088 |
| Total bilirubin (mg/dL) | 0.76 ± 0.28 | 0.76 ± 0.28 | 0.05 | <0.0001 |
| Glucose (mg/dL) | 96.57 ± 19.41 | 96.69 ± 20.71 | 0.08 | <0.0001 |
| Hemoglobin (Hb) | 14.03 ± 1.47 | 14.03 ± 1.62 | −0.00 | 0.0005 |
| WBC (109/L) | 6.44 ± 1.90 | 6.44 ± 1.94 | −0.02 | 0.0001 |
Figure 1Decision tree diagrams. (a) Model 1. (b) Model 2. Advanced fibrosis is considered as the positive, referred to by symbol (adv), while moderate or mild fibrosis is considered as negative and referred to by symbol (m). The liver fibrosis of the patient is scored by summing all of the prediction nodes through which it passes.
Figure 2Flowchart of model 2. S represents the fibrosis score of the patient. If final S ≥ 0, then the patient has an advanced fibrosis and vice versa.
Accuracy and Roc analysis of model 1, model 2, and FIB-4 for predicting advanced fibrosis with criteria value zero.
| Model | Sensitivity % | Specificity % | PPV % | NPV % | ROC | Accuracy % |
|---|---|---|---|---|---|---|
| Model 1 | 15.1 | 97.9 | 55.2 | 87 | 0.78 | 85.7 |
| Model 2 | 17.5 | 97.5 | 54.8 | 87.3 | 0.78 | 85.7 |
| Model 1 | 14.4 | 97.9 | 56.6 | 85.9 | 0.78 | 84.7 |
| Model 2 | 17.4 | 97.5 | 56.6 | 86.2 | 0.78 | 84.8 |
| FIB-4 | 17.9 | 95.51 | 42.8 | 86.1 | 0.73 | 83.2 |
PPV: positive predictive value; NPP: negative predictive value; ROC: receiver operating characteristic curve.
Applying the model on the training set.
Applying the model on the test set.
Figure 3Comparison between the ROC curves of proposed ADTree Model 2 and FIB 4. It shows an improved AUROC for model 2.