| Literature DB >> 35765099 |
Jian-An Jia1,2, Shuqin Zhang3, Xin Bai4, Meng Fang1, Shipeng Chen1, Xiaotao Liang5, Shanfeng Zhu5,6, Danny Ka-Ho Wong7, Anye Zhang8, Jianfeng Feng6,9, Fengzhu Sun10, Chunfang Gao11,12.
Abstract
BACKGROUND: Chronic infection with hepatitis B virus (HBV) has been proved highly associated with the development of hepatocellular carcinoma (HCC). AIMS: The purpose of the study is to investigate the association between HBV preS region quasispecies and HCC development, as well as to develop HCC diagnosis model using HBV preS region quasispecies.Entities:
Keywords: HBV quasispecies; Hepatocellular carcinoma; Next generation sequencing (NGS); PreS region; Sparse logistic regression (SLR)
Mesh:
Substances:
Year: 2022 PMID: 35765099 PMCID: PMC9238101 DOI: 10.1186/s12985-022-01836-9
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 5.913
Demographics and baseline laboratory markers of training cohort and test cohort
| Training cohort | Test cohort | |||
|---|---|---|---|---|
| CHB (n = 63) | HCC (n = 46) | CHB (n = 41) | HCC (n = 71) | |
| Female | 21 (33.3%) | 6 (13.6%) | 18 (43.9%) | 13 (18.3%) |
| Male | 42 (66.7%) | 38 (86.4%) | 22 (53.7%) | 58 (81.7%) |
| Age | 36.6 ± 13.8 | 52.4 ± 8.3 | 40.1 ± 11.9 | 51.1 ± 10.2 |
| Serum HBV DNA | 7.48 | 5.02 | 6.24 | 5.32 |
| (log10 IU/ml) | (6.50–7.83) | (4.43–6.25) | (4.89–7.65) | (4.80–5.85) |
| TBIL (μmol/L) | 13 | 14.2 | 11 | 14.5 |
| (10.0–17.0) | (11.1–19.8) | (8.7–15.7) | (11.5–19.4) | |
| DBIL (μmol/L) | 3 | 5.5 | 4 | 5.7 |
| (3.0–5.0) | (4.0–7.7) | (3.0–6.0) | (4.4–7.5) | |
| TP (g/L) | 74 | 68.8 | 75 | 67.8 |
| (71.5–79.0) | (64.4–71.1) | (70.0–78.0) | (64.1–72.7) | |
| ALB (g/L) | 45.5 | 39.8 | 45 | 40.9 |
| (41.0–49.7) | (37.0–42.5) | (43.0–48.5) | (38.0–43.9) | |
| ALT (U/L) | 38 | 43.4 | 51 | 40.5 |
| (19.0–66.0) | (24.7–78.4) | (29.5–77.0) | (33.0–71.0) | |
| AST (U/L) | 30 | 46 | 33 | 40 |
| (21.0–45.0) | (25.0–84.5) | (26.5–54.2) | (31.0–64.0) | |
| GGT (U/L) | 22 | 81 | 24 | 78 |
| (15.0–51.0) | (41.3–156.0) | (18.0–43.2) | (50.0–121.5) | |
| ALP (U/L) | 71.5 | 92 | 72 | 92 |
| (59.299.7) | (69.25–131.5) | (56.0–82.5) | (73.5–112.0) | |
| Negative | 17 (40.5%) | 36 (85.7%) | 38 (97.4%) | 64 (95.5%) |
| Postive | 25 (59.5%) | 6 (14.3%) | 1 (2.6%) | 3 (4.5%) |
| Negative | 31 (73.8%) | 25 (59.5%) | 8 (20.5%) | 41 (61.2%) |
| Postive | 11 (26.2%) | 17 (40.5%) | 31 (79.5%) | 26 (38.8%) |
| Negative | 33 (78.6%) | 10 (23.8%) | 31 (79.5%) | 18 (26.9%) |
| Postive | 9 (21.4%) | 32 (76.2%) | 8 (20.5%) | 49 (73.1%) |
| B | 26(41.2%) | 8(17.4%) | 15(36.6%) | 12(16.9%) |
| C | 37(58.7%) | 36(78.3%) | 26(63.4%) | 59(83.1%) |
| Tumor size(cm) | 5.6(3.32–9.40) | 11(7.8–14.5) | ||
| Intact | 5 (11.4%) | 10 (15.6%) | ||
| None | 5 (11.4%) | 11 (17.2%) | ||
| Partial | 34 (77.3%) | 43 (67.2%) | ||
| 1 | 32 (72.7%) | 58 (90.6%) | ||
| 2 | 0 (0.0%) | 3 (4.7%) | ||
| 3 | 12 (27.3%) | 3 (4.7%) | ||
| None | 23 (52.3%) | 45(63.3%) | ||
| Yes | 21 (47.7%) | 19 (29.7%) | ||
| AFP(ng/ml) | 565 (29.1–15,375.0) | 187 (7.83–1210.0) | ||
AFP Alpha-fetoprotein, ALB Albumin, ALP Alkaline phosphatase, ALT Alanine aminotransferase AST Aspartate aminotransferase CHB Chronic hepatitis B, DBIL Direct bilirubin, GGT γ-glutamyltransferase; HBV Hepatitis B virus, HBeAg Hepatitis B e antigen, HBeAb Hepatitis B e antibody, HBsAb Hepatitis B antibody, HCC Hepatocellular carcinoma, TBIL Total bilirubin, TP Total protein, PVTT Portal veint umor thrombus
Fig. 1HBV preS region nucleotide entropy of CHB and HCC patients. A. Heat map show the nucleotide entropy in preS region of CHB and HCC patients. B. Comparison of nucleotide entropy in preS, preS1 and preS2 region between in CHB and HCC patients. C. Nucleotide points in preS region with different entropy between in CHB and HCC patients were described by volcano plot filtering. Entropy of all the nucleotide points in preS region were compared between in CHB and HCC patients. After logarithm, the p-values were presented in the y-lab direction. In the x-lab direction, the relative entropy ratios of CHB and HCC patients were also log-transformed and presented. Red spots represented nucleotide points with higher entropy in CHB patients, of which p-values and fold changes were upon specific threshhold. The green spots mean nucleotide points with opposite conditions
Fig. 2Classification results for HCC/CHB patients using SLR with different parameters. Classification results (accuracy, AUC, sensitivity and specificity) for all HCC/CHB patients as a function of penalization parameter λ in sparse logistic regression. The vertical lines show one SD in the CV studies
Classification results for HCC/CHB using LR, SLR, SVM and SSVM
| LR | SLR (λ = 0.53) | SVM | SSVM (λ = 0.55) | ||
|---|---|---|---|---|---|
| Test set | Training set | Test set | Test set | Test set | |
| Accuracy | 0.688 | 0.861 (0.032) | 0.794 | 0.777 | 0.679 |
| Sensitivity | 0.718 | 0.809 (0.051) | 0.803 | 0.775 | 0.662 |
| Specificity | 0.634 | 0.898 (0.019) | 0.780 | 0.780 | 0.707 |
| AUC | 0.644 | 0.883 (0.043) | 0.795 | 0.836 | 0.685 |
LR: We applied the trained model in training set to the test set (Test set). SLR: We did cross validation within the training set (Training set), and applied the trained model to the test set (Test set). SVM: We applied the trained model in training set to the test set (Test set). SSVM: We did cross validation within the training set, and applied the trained model to the test set (Test set)
AUC Area under the receiver operating characteristic curve; CHB Chronic hepatitis B; HCC Hepatocellular carcinoma; LR Logistic regression; SLR Sparse logistic regression; SVM Support vector machine; SSVM Sparse support vector machine
Classification results for HCC/CHB from Hong Kong dataset using SLR
| Training results | Test results | |
|---|---|---|
| Accuracy | 0.822 (0.031) | 0.694 |
| Sensitivity | 0.717 (0.071) | 0.500 |
| Specificity | 0.931 (0.053) | 0.913 |
| AUC | 0.724 (0.054) | 0.607 |
Training results: we did cross validation within the Hong Kong dataset using SLR with λ = 0.56. Test results: we applied the model trained in Shanghai dataset using sparse logistic regression and took Hong Kong dataset as the test set
AUC Area under the receiver operating characteristic curve; CHB Chronic hepatitis B; HCC Hepatocellular carcinoma; SLR Sparse logistic regression
Fig. 3Comparison of entropy of the ten nucleotide points of HBV preS region selected by SLR model. Ten nucleotide points of HBV preS region were selected by SLR model for CHB/HCC classification and entropy between in CHB and HCC patients were compared. All of the ten nucleotide points showed significant divergence between in CHB and HCC patients
Fig. 4Classification results for HCC/CHB patients using SLR with different sample size. The vertical lines show one SD in the cross validation studies
Classification results for HCC/CHB patients of different HBV genotypes using SLR
| Genotype B (λ = 0.58) | Genotype C | Genotype C (λ = 0.53) | Genotype B | |||
|---|---|---|---|---|---|---|
| Training set | Test set* | Test set# | Training set | Test set* | Test set# | |
| Accuracy | 0.924(0.028) | 0.778 | 0.741 | 0.870 (0.023) | 0.776 | 0.741 |
| Sensitivity | 0.678(0.122) | 0.583 | 0.915 | 0.840 (0.064) | 0.864 | 0.500 |
| Specificity | 0.999(0.006) | 0.933 | 0.346 | 0.890 (0.036) | 0.577 | 0.933 |
| AUC | 0.861(0.061) | 0.706 | 0.754 | 0.843 (0.027) | 0.767 | 0.722 |
*Test results were produced using SLR model trained by patients with the same genotype of HBV
#Test results were produced using SLR model trained by patients with the other genotype of HBV
AUC Area under the receiver operating characteristic curve; CHB Chronic hepatitis B; HCC Hepatocellular carcinoma; SLR Sparse logistic regression
Classification results for clinical parameters using SLR in HCC patients
| HBeAg | HBV DNA | ALP | ||||
|---|---|---|---|---|---|---|
| Training set | Test set | Training set | Test set | Training set | Test set | |
| Accuracy | 0.882 (0.026) | 0.672 | 0.791(0.038) | 0.676 | 0.782 (0.037) | 0.634 |
| Sensitivity | 0.929 (0.042) | 0.805 | 0.757(0.084) | 0.667 | 0.712 (0.082) | 0.703 |
| Specificity | 0.920 (0.056) | 0.462 | 0.833(0.074) | 0.690 | 0.868 (0.058) | 0.559 |
| AUC | 0.882 (0.051) | 0.602 | 0.697(0.052) | 0.675 | 0.688 (0.048) | 0.648 |
AUC Area under the receiver operating characteristic curve; HCC Hepatocellular carcinoma; SLR Sparse logistic regression; ALP Alkaline phosphatase; HBV Hepatitis B virus; HBeAg Hepatitis B e antigen
Fig. 5Nucleotide and amino acid mapping for the identified mutation points. Functions of ten nucleotide points of HBV preS region selected by SLR model for CHB/HCC classification are also denoted