| Literature DB >> 26414287 |
Faryal Mehwish Awan1, Anam Naz1, Ayesha Obaid1, Amjad Ali1, Jamil Ahmad2, Sadia Anjum1, Hussnain Ahmed Janjua1.
Abstract
Hepatocellular carcinoma (HCC) is the world's third most widespread cancer. Currently available circulating biomarkers for this silently progressing malignancy are not sufficiently specific and sensitive to meet all clinical needs. There is an imminent and pressing need for the identification of novel circulating biomarkers to increase disease-free survival rate. In order to facilitate the selection of the most promising circulating protein biomarkers, we attempted to define an objective method likely to have a significant impact on the analysis of vast data generated from cutting-edge technologies. Current study exploits data available in seven publicly accessible gene and protein databases, unveiling 731 liver-specific proteins through initial enrichment analysis. Verification of expression profiles followed by integration of proteomic datasets, enriched for the cancer secretome, filtered out 20 proteins including 6 previously characterized circulating HCC biomarkers. Finally, interactome analysis of these proteins with midkine (MDK), dickkopf-1 (DKK-1), current standard HCC biomarker alpha-fetoprotein (AFP), its interacting partners in conjunction with HCC-specific circulating and liver deregulated miRNAs target filtration highlighted seven novel statistically significant putative biomarkers including complement component 8, alpha (C8A), mannose binding lectin (MBL2), antithrombin III (SERPINC1), 11β-hydroxysteroid dehydrogenase type 1 (HSD11B1), alcohol dehydrogenase 6 (ADH6), beta-ureidopropionase (UPB1) and cytochrome P450, family 2, subfamily A, polypeptide 6 (CYP2A6). Our proposed methodology provides a swift assortment process for biomarker prioritization that eventually reduces the economic burden of experimental evaluation. Further dedicated validation studies of potential putative biomarkers on HCC patient blood samples are warranted. We hope that the use of such integrative secretome, interactome and miRNAs target filtration approach will accelerate the selection of high-priority biomarkers for other diseases as well, that are more amenable to downstream clinical validation experiments.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26414287 PMCID: PMC4586137 DOI: 10.1371/journal.pone.0138913
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Schematic outline of multi-step HCC circulating biomarkers prioritization process.
Liver-specific proteins extracted from various databases were screened using SignalP 4.1, SecretomeP 2.0, ExoCarta, TargetP 1.1 and TMHMM v. 2.0 servers to assess their secretory nature. Liver-specific secreted proteins once verified for their expression in liver (HPA and BioGPS) and blood (Plasma Proteome Database) were further prioritized depending upon their presence in secretome proteome of HCC patients, HCC cell lines and primary human hepatocytes. To infer possible involvement of prioritized proteins in HCC pathogenesis, their interactome analysis was done with AFP (as a standard biomarker for the diagnosis of HCC). Interacting proteins were then analysed for their interaction with HCC specific liver deregulated and circulating miRNA. Results were then statistically verified using SurvExpress validation tool to finally prioritize putative circulating biomarkers for HCC.
Fig 2Identification of liver-specific secreted proteins.
Liver-specific secreted proteins identified using seven publicly available gene and protein databases. Databases based on microarray data (TiSGeD, BioGPS and VeryGene) unveiled 845; ESTs data (TiGER, UniGene and C-It) revealed 473 and HPA database based on immunohistochemistry data revealed 69 liver-specific proteins. A total of 272 proteins were identified in two or more than two databases and thus selected for further analysis.
Total number of liver-specific proteins identified in gene and protein databases.
| Parameters | Liver Specific |
|---|---|
| Total number of proteins identified | 731 |
| (in ≥ two databases) | 272 |
|
| |
| One database | 459 |
| Two databases | 77 |
| Three databases | 81 |
| Four databases | 64 |
| Five databases | 26 |
| Six databases | 24 |
Liver-specific secreted/shed proteins identified by each database utilized in this study.
| Gene | TiSGeD | TiGER | UniGene | C-it | VeryGene | BioGPS | HPA | Reference |
|---|---|---|---|---|---|---|---|---|
| ADH6 | √ | √ | √ | Secretome of primary human hepatocytes [ | ||||
| ANG | √ | √ | √ | √ | Previously studied as biomarker [ | |||
| APOA5 | √ | √ | √ | √ | Secretome of HCC cell line [ | |||
| APOC3 | √ | √ | √ | √ | Secretome of primary human hepatocytes [ | |||
| APOC4 | √ | √ | √ | √ | √ | √ | ||
| APOF | √ | √ | √ | √ | √ | |||
| ASL | √ | √ | Secretome of primary human hepatocytes [ | |||||
| C4A | √ | √ | √ | Previously studied as biomarker [ | ||||
| C8A | √ | √ | √ | √ | √ | √ | Secretome of HCC (Hep3B, serum) [ | |
| CFHR4 | √ | √ | √ | √ | Previously studied as biomarker [ | |||
| CYP2A6 | √ | √ | √ | √ | √ | √ | Secretome of primary human hepatocytes [ | |
| CYP2A7 | √ | √ | √ | √ | ||||
| CYP2C18 | √ | √ | √ | |||||
| CYP2E1 | √ | √ | √ | √ | √ | √ | ||
| CYP4A22 | √ | √ | √ | |||||
| F10 | √ | √ | √ | Secretome of HCC cell lines [ | ||||
| F9 | √ | √ | √ | √ | √ | |||
| FCN2 | √ | √ | √ | |||||
| GCKR | √ | √ | √ | |||||
| GSTM1 | √ | √ | √ | Normal and HCC liver tissue [ | ||||
| HAMP | √ | √ | √ | |||||
| HP | √ | √ | √ | √ | √ | √ | Previously studied as biomarker [ | |
| HRG | √ | √ | √ | √ | √ | √ | Previously studied as biomarker [ | |
| HSD11B1 | √ | √ | √ | Secretome of primary human hepatocytes [ | ||||
| ITIH4 | √ | √ | √ | √ | √ | Previously studied as biomarker [ | ||
| MBL2 | √ | √ | √ | √ | Secretome of HCC cell lines (Hep3B, HepG2) [ | |||
| NR0B2 | √ | √ | √ | |||||
| PLGLB2 | √ | √ | √ | √ | √ | |||
| PON3 | √ | √ | √ | √ | ||||
| RDH16 | √ | √ | √ | √ | √ | Secretome of primary human hepatocytes [ | ||
| SERPINC1 | √ | √ | √ | √ | √ | √ | Secretome of HCC serum [ | |
| SLC25A47 | √ | √ | √ | √ | √ | |||
| SLC27A5 | √ | √ | √ | √ | ||||
| SPP2 | √ | √ | √ | √ | √ | √ | ||
| TFR2 | √ | √ | √ | Secretome of HCC (Cell line Hep3B) [ | ||||
| TMPRSS6 | √ | √ | √ | |||||
| UPB1 | √ | √ | √ | √ | Secretome of primary human hepatocytes [ | |||
| VTN | √ | √ | √ | √ | Previously studied as biomarker [ |
Fig 3Performance and accuracy evaluation (%) of databases.
3A. Graphical representation of databases performance has been shown in percentages. BioGPS database revealed 97%, VeryGene database 92%, TiSGeD database 76%, TiGER database 66%, UniGene database 37%, C-It database 5% and the HPA unveiling 45% performance for the identification of liver-specific protein biomarkers. Performance % was calculated by dividing number of proteins identified by each database to total number of proteins that passed the filtering criteria. 3B. Graphical representation of accuracy of the initial protein identifications with HPA database showing the highest accuracy of 25%, VeryGene database showing 8% accuracy, TiSGeD database showing 15%, TiGER database showing 8%, UniGene database showing 19%, C-It showing 2% and BioGPS database showing 20% accuracy. The accuracy was calculated by dividing number of proteins that had passed the filtering criteria by each database to the total number of proteins each database initially identified.
Fig 4Interactome network analysis (protein-protein).
Interactome analysis of candidate proteins with current standard HCC biomarker AFP was retrieved by tools: GeneMANIA (A & B), STRING (C & D). Interacting partners of AFP (TP53, FOXA1, FOXA3, GPC3, IGFBP1, NR3C1, F2, AHSG, ACTL6A and JUN) along with DKK1 and MDK are also main elements of the interactome. The size of the gray nodes in Fig 4A and 4B represents the degree of association with the input genes (i.e., smaller size represents less association).
Fig 5Interactome network analysis (miRNA-gene).
Interactome network analysis of HCC-specific circulating miRNAs and genes encoding candidate protein biomarkers (retrieved using miRWalk, miRTarBase, TargetScan and microRNA.org) were visualized using Cytoscape software. The red colored circles represent seven final prioritized candidate marker proteins in our study.
Seven statistically significant putative HCC specific biomarkers prioritized through integrated in-silico approach.
| Gene | Biological functions | HCC-specific deregulated miRNAs (Liver) (Circulating) | |
|---|---|---|---|
| ADH6 | Metabolism of xenobiotics by cytochrome P450, drug metabolism, glycolysis/gluconeogenesis, tyrosine metabolism, metabolic pathways, retinol metabolism and fatty acid metabolism. | hsa-miR-182, hsa-miR-185, hsa-miR-203, hsa-miR-199a-5p, hsa-miR-199b-5p, hsa-miR-146a, hsa-miR-211, hsa-miR-150 | hsa-miR-199a-5p, hsa-miR-146a, hsa-miR-150 |
| UPB1 | Beta alanine metabolism, metabolic pathway, pantotheanate and CoA biosynthesis, drug metabolism-other enzymes and pyrimidine metabolism. | hsa-miR-216a, hsa-miR-181c, hsa-miR-181a, hsa-miR-181b, hsa-miR-134, hsa-let-7e, hsa-let-7b, hsa-let-7a, hsa-let-7c, hsa-let-7f, hsa-let-7g, hsa-let-7d, hsa-miR-224 | hsa-let-7f, hsa-let-7c, hsa-miR-224 |
| C8A | Complement and coagulation cascades, prion disease, systemic lupus erythematosus and amoebiasis. | hsa-miR-212, hsa-miR-132, hsa-miR-93, hsa-miR-106a, hsa-miR-106b, hsa-miR-17, hsa-miR-20a, hsa-miR-302b, hsa-miR-26a, hsa-miR-26b, hsa-miR-145, hsa-miR-148a, hsa-miR-148b, hsa-miR-152, hsa-miR-186, hsa-miR-129-5p | hsa-miR-93, hsa-miR-17, hsa-miR-520a-3p, hsa-miR-520b, hsa-miR-26a, |
| HSD11B1 | Steroid hormone biosynthesis, metabolic pathways, and in aldosterone-regulated sodium reabsorption. | hsa-miR-181c, hsa-miR-181a, hsa-miR-181b, hsa-miR-374a, hsa-miR-374b, hsa-miR-192, hsa-miR-215, hsa-miR-23a, hsa-miR-23b, hsa-miR-132, hsa-miR-212, hsa-miR-26a, hsa-miR-26b, hsa-mir-122, hsa-mir-125a, hsa-mir-125b-1, hsa-mir-125b-2, hsa-mir-145, hsa-mir-222 | hsa-miR-192, hsa-miR-215, hsa-miR-23a, hsa-miR-23b, hsa-miR-26a, hsa-mir-122, hsa-mir-222 |
| MBL2 | Complement and coagulation cascades, acute-phase response, classical pathway, negative regulation of viral process, opsonization, positive regulation of phagocytosis. | hsa-miR-320c, hsa-miR-374b, hsa-miR-374a, hsa-miR-186, hsa-miR-200b, hsa-miR-301a, hsa-miR-301b, hsa-miR-137, hsa-miR-23a, hsa-miR-23b, hsa-miR-206, hsa-miR-216b, hsa-miR-30a, hsa-miR-30e, hsa-miR-30c, hsa-miR-146a, hsa-miR-190b, hsa-miR-190, hsa-miR-130a, hsa-miR-130b, hsa-miR-148a, hsa-miR-148b, hsa-miR-152, hsa-miR-145, hsa-miR-196a, hsa-miR-216a, hsa-miR-1, hsa-let-7a-2, hsa-let-7a-3, hsa-let-7a-1, hsa-let-7b, hsa-let-7c, hsa-let-7d, hsa-let-7e, hsa-let-7f-2, hsa-let-7g, hsa-mir-1-2, hsa-mir-1-1, hsa-mir-10a, hsa-mir-125b-1, hsa-mir-125b-2, hsa-mir-15a, hsa-mir-16-1, hsa-mir-16-2, hsa-mir-7-1, hsa-mir-7-2, hsa-mir-7-3, hsa-mir-99a | hsa-miR-23a, hsa-miR-23b, hsa-miR-30c, hsa-miR-146a, hsa-miR-130b, hsa-miR-1, hsa-miR-206, hsa-let-7f-2, has-let-7c, hsa-mir-16-1, hsa-mir-16-2 |
| SERPINC1 | Complement and coagulation cascades, neuroactive ligand receptor interaction and regulation of actin cytoskeleton. | hsa-miR-186, hsa-miR-19a, hsa-miR-19b, hsa-miR-143, hsa-miR-7 | hsa-miR-7 |
| CYP2A6 | Drug metabolic process, epoxygenase P450 pathway, exogenous drug catabolic process, oxidation-reduction process, small molecule metabolic process, steroid metabolic process, xenobiotic metabolic process. | hsa-mir-101-1, hsa-mir-101-2, hsa-mir-126, hsa-mir-199a-1, hsa-mir-199a-2, hsa-mir-199b, hsa-mir-34a | hsa-mir-199a-1, hsa-mir-199a-2 |
Fig 6Comparison of Kaplan-Meier curves of the current standard HCC biomarker (AFP) and candidate seven circulating biomarkers (C8A, MBL2, SERPINC1, HSD11B1, ADH6, UPB1, CYP2A6).
SurvExpress analysis showed the results from liver hepatocellular carcinoma dataset using TCGA RNASeq platform of SurvExpress. A shows the Kaplan-Meier curve for risk groups, concordance index, and P-value of the log-rank testing equality of survival curves for AFP. B shows the Kaplan-Meier curve for risk groups, concordance index, and P-value of the log-rank testing equality of survival curves for C8A, MBL2, SERPINC1, HSD11B1, ADH6, UPB1 and CYP2A6.
Fig 7Receiver operating characteristic (ROC) analysis of sensitivity and specificity by proposed seven candidate biomarkers and AFP in predicting disease-free survival (DFS).
The score performance was assessed by calculating the area under the ROC (AUROC) which was 0.861 (KM method) and 0.854 (NNE method), respectively for proposed candidate biomarkers while for AFP; AUROC was 0.354 (KM method) and 0.5 (NNE method) respectively.
Fig 8Relapse-free survival and ROC curve analysis.
Proposed candidate biomarkers better predicted relapse-free survival (p = 0.01191) (A) as compared to AFP (P = 0.1987) (B). With respect to the discriminating ability of proposed biomarkers, long rank equal curve showed statistically significant p-value (<0.05) for HCC p = 0.02361 while for cirrhotic liver p-value was 0.1985 (which not significant) (Fig 9A and 9B).
Fig 9ROC curve analysis of proposed candidate biomarkers in HCC and cirrhotic datasets.
With respect to HCC, the candidate biomarkers showed statistically significant relation (p = 0.02824) (A) while for cirrhosis there was no significant correlation (p = 0.1985) (B).