| Literature DB >> 25888140 |
Bu-Yeo Kim1, Dong Wook Choi2,3, Seon Rang Woo4, Eun-Ran Park5,6, Je-Geun Lee7, Su-Hyeon Kim8, Imhoi Koo9, Sun-Hoo Park10, Chul Ju Han11,12, Sang Bum Kim13,14, Young Il Yeom15, Suk-Jin Yang16, Ami Yu17,18, Jae Won Lee19, Ja June Jang20, Myung-Haing Cho21, Won Kyung Jeon22, Young Nyun Park23, Kyung-Suk Suh24, Kee-Ho Lee25,26.
Abstract
BACKGROUND: Despite the recent identification of several prognostic gene signatures, the lack of common genes among experimental cohorts has posed a considerable challenge in uncovering the molecular basis underlying hepatocellular carcinoma (HCC) recurrence for application in clinical purposes. To overcome the limitations of individual gene-based analysis, we applied a pathway-based approach for analysis of HCC recurrence.Entities:
Mesh:
Year: 2015 PMID: 25888140 PMCID: PMC4448317 DOI: 10.1186/s12864-015-1472-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Schematic illustration of the analysis strategy. The initial dataset was randomly divided into training and test sets. In a training set, genes below the threshold p-value in a pathway were subjected to PCA. Two models were constructed, specifically, one principal component model using the most significantly associated principal component and weighted model using multiple principal components, and evaluated in the test set. This procedure was repeated 1,000 times with random training and test sets. Finally, median values of statistics from test sets were measured to select significant pathways.
Figure 2Cluster analysis of recurrence-related genes. (A) Dendrogram of clustering pattern measured from the matrix of 209 recurrence-associated genes (Cox regression p-value < 0.01) from our HBV-HCC. Samples were classified into two subgroups: low-risk and high-risk based on recurrence outcome. The black bar indicates patients with early recurrence within 2 years after surgery. Columns represent individual samples, and rows genes. Red and green colors reflect high and low expression levels, respectively, as indicated by scale bars. (B) Kaplan-Meier plots for recurrence rates of the low- and high-risk subgroups. P-values were obtained using the log-rank test. (C) Overlap of recurrence-associated genes (Cox regression p-value < 0.01) among datasets.
Figure 3Recurrence rates in the HBV-HCC dataset. (A) Cumulative recurrence rate of HCC over time. (B) Recurrence rate of HCC per month over time.
Figure 4Determination of threshold p-value and comparison of recurrence-related features among the three HCC datasets. Starting with gene of the lowest univariate Cox p-value, permutation-based pathway analysis was applied by gradually increasing the p-value. The maximum average number of pathways below p-value of 0.01 in the test set was measured at threshold p-values of 0.11, 0.07 and 0.12 from (A) HBV-HCC, (B) public HBV-HCC and (C) public HCV-HCC, respectively, as indicated with arrows. (D) Overlap of recurrence-associated pathways obtained at threshold p-values among datasets. (E) Overlap of recurrence-associated genes at threshold p-values among datasets. (F) Distribution of genes below the threshold p-value from each dataset in 16 common significant pathways.
Figure 5Cluster analysis of recurrence-related pathways. (A) Dendrogram of the clustering pattern measured from the matrix of principal components of 64 recurrence-associated pathways (p-value < 0.01) from HBV-HCC. Samples were classified into two subgroups: low-risk and high-risk. The black bar indicates patients with early recurrence within 2 years after surgery. Columns represent individual samples, and rows pathways. Red and green colors reflect high and low levels of optimal principal component scores, respectively, as indicated by scale bars. (B) Kaplan-Meier plots for recurrence rates of low- and high-risk subgroups. P-values were obtained using the log-rank test. (C) Cross-validation of the low- and high-risk subgroups using six different algorithms: compound covariate (CC), diagonal linear discriminant (DLD), 1-nearest neighbor (1-NN), 3-nearest neighbor (3-NN), nearest centroid (NC) and support vector machine (SVM), implemented in BRB ArrayTools. (D) ROC curve computed with the compound covariate validation algorithm for subgroup classification. (E and F) Dendrogram of the clustering pattern measured from the matrix of principal components of public HBV-HCC (E) or public HCV-HCC (F) on the 64 pathways associated with recurrence in our HBV-HCC dataset.
Relationships between recurrence-associated pathways and clinicopathological variables in HBV-HCC
|
|
| ||||
|---|---|---|---|---|---|
|
| |||||
|
|
|
|
|
|
|
|
| |||||
| Gender (Male/Female) | 52/14 | 65/11 | 0.406 | 0.563 (0.320 0.991) |
|
| Age (<50 y/≥50 y) | 26/40 | 27/49 | 0.763 | 1.056 (0.706 1.577) | 0.790 |
| Platelet (<170 × 109/L/≥170 × 109/L) | 33/25 | 28/36 | 0.204 | 1.019 (0.675 1.538) | 0.928 |
| 4AST (<40 IU/L/≥40 IU/L) | 40/26 | 31/45 |
| 1.096 (0.745 1.613) | 0.639 |
| 4ALT (<40 IU/L/≥40 IU/L) | 41/25 | 33/43 |
| 1.003 (0.682 1.477) | 0.984 |
| Bilirubin (<1 mg/dL/≥1 mg/dL) | 44/22 | 42/34 | 0.224 | 1.050 (0.708 1.556) | 0.806 |
| 4AFP (<300 ng/mL /≥300 ng/mL) | 41/25 | 52/23 | 0.469 | 0.976 (0.644 1.480) | 0.912 |
| Child-Pugh (A/B,C) | 63/3 | 65/10 | 0.131 | 1.813 (0.967 3.401) | 0.0596 |
| UICC (I,II/III,IV) | 39/25 | 50/22 | 0.389 | 1.519 (1.015 2.273) |
|
| Size (<3 cm/≥3 cm) | 14/52 | 12/64 | 0.538 | 2.178 (1.216 3.901) |
|
| Number (Single/Multiple) | 56/10 | 64/10 | 0.972 | 1.874 (1.135 3.094) |
|
| Necrosis (No/Yes) | 33/33 | 25/50 | 0.0664 | 1.565 (1.048 2.339) |
|
| Edmondson’s grade (I,II/III,IV) | 45/19 | 45/27 | 0.435 | 1.273 (0.974 1.665) | 0.0764 |
| Vein invasion (Absent/Present) | 49/12 | 39/31 |
| 1.787 (1.175 2.718) |
|
| Lobular activity (No/Yes) | 39/25 | 18/56 |
| 1.770 (1.166 2.685) |
|
| Capsule (Absent/Present) | 47/19 | 64/11 | 0.0660 | 0.617 (0.374 1.017) | 0.05603 |
| Margin (<2 cm/≥2 cm) | 35/31 | 51/23 | 0.0794 | 0.836 (0.557 1.256) | 0.389 |
| Cirrhosis (Absent/Present) | 37/29 | 41/34 | 0.997 | 1.257 (0.853 1.852) | 0.245 |
| Recurrence-associated pathways | - | - | - | 2.843 (1.881 4.295) |
|
|
| |||||
| Recurrence-associated pathways | 3.220 (2.076 4.994) |
| |||
| Number (Single/Multiple) | 1.839 (1.095 3.089) |
| |||
| Size (<3 cm/≥3 cm) | 2.489 (1.288 4.808) |
| |||
| Edmondson’s grade (I,II/III,IV) | 1.769 (1.159 2.701) |
| |||
1 p-values were calculated using χ2 test.
2CI represents confidence interval.
3 p-values were calculated using the Cox Proportional Hazards Model.
4AST, aspartate aminotransferase; ALT, alanine aminotransferase; AFP, α-fetoprotein.
Bold data indicate statistically significant values (p<0.05).
Figure 6Stratification of small tumor size patients by recurrence-associated pathways. (A) Kaplan-Meier plot for recurrence rates in two subgroups of patients based on tumor size (< and >3 cm) in our HBV-HCC dataset. P-values were obtained using the log-rank test. (B-D) Application of recurrence-associated pathways to patients with small tumors (<3 cm) (B), Edmonson Steiner’s grade (I and II) (C) or single nodules (D) led to further stratification into two subgroups with different recurrence rates.