| Literature DB >> 31425520 |
Kassaporn Duangkumpha1,2, Thomas Stoll3, Jutarop Phetcharaburanin1,2, Puangrat Yongvanit2, Raynoo Thanan1, Anchalee Techasen2,4, Nisana Namwat1,2, Narong Khuntikeo2,5, Nittaya Chamadol2,6, Sittiruk Roytrakul7, Jason Mulvenna3, Ahmed Mohamed3, Alok K Shah3, Michelle M Hill3, Watcharin Loilome1,2.
Abstract
Cholangiocarcinoma (CCA) is a primary malignant tumor of the epithelial lining of biliary track associated with endemic Opisthorchis viverrini (Ov) infection in northeastern Thailand. Ov-associated periductal fibrosis (PDF) is the precancerous lesion for CCA, and can be detected by ultrasonography (US) to facilitate early detection. However, US cannot be used to distinguish PDF from cancer. Therefore, the objective of this study was to discover and qualify potential urine biomarkers for CCA detection in at-risk population. Biomarker discovery was conducted on pooled urine samples, 42 patients per group, with PDF or normal bile duct confirmed by ultrasound. After depletion of high abundance proteins, 338 urinary proteins were identified from the 3 samples (normal-US, PDF-US, CCA). Based on fold change and literature review, 70 candidate proteins were selected for qualification by multiple reaction monitoring mass spectrometry (MRM-MS) in 90 individual urine samples, 30 per group. An orthogonal signal correction projection to latent structures discriminant analysis (O-PLS-DA) multivariate model constructed from the 70 candidate biomarkers significantly discriminated CCA from normal and PDF groups (P = 0.003). As an independent validation, the expression of 3 candidate proteins was confirmed by immunohistochemistry in CCA tissues: Lysosome associated membrane glycoprotein 1 (LAMP1), lysosome associated membrane glycoprotein 2 (LAMP2) and cadherin-related family member 2 (CDHR2). Further evaluation of these candidate biomarkers in a larger cohort is needed to support their applicability in a clinical setting for screening and monitoring early CCA and for CCA surveillance.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31425520 PMCID: PMC6699711 DOI: 10.1371/journal.pone.0221024
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The workflow for urinary protein biomarker discovery.
(A) urine samples were collected from participants and stored at -8000b0c until analysis. (B) urine samples selected for study were pooled, concentrated and depleted for highly-abundant protein removal using an immunodepletion kit, followed by in-solution tryptic digestion and tandem mass spectrometry. Protein identification and database searching were conducted using MaxQuant software (version 1.6.1.0). (C) Biomarker qualification for selected candidates used a custom MRM-MS assay. Data processing was performed using the Skyline and R programs. The statistical analysis software “SIMCA” and “SPSS" were used to perform the univariate/multivariate statistical analysis. (D) Immunohistochemical staining techniques were used to validate potential biomarkers on TMA.
The clinical and demographic information on all subjects for the discovery and qualification groups.
| Discovery phase | Qualification phase | |||||||
|---|---|---|---|---|---|---|---|---|
| Normal US | PDF US | CCA | Normal US | PDF US | CCA | |||
| Sample size(n) | 42 | 42 | 42 | 30 | 30 | 30 | ||
| 1.000 | 1.000 | |||||||
| Male | 31 (74%) | 31 (74%) | 31 (74%) | 19 (63%) | 19 (63%) | 19 (63%) | ||
| Female | 11 (26%) | 11 (26%) | 11 (26%) | 11 (37%) | 11 (37%) | 11 (37%) | ||
| Median±SD | 63±5 | 62±6 | 65±6 | 63±5 | 62±4 | 64±7 | ||
| (Age range) | (55–73) | (55–77) | (43–76) | (55–73) | (55–69) | (43–76) | ||
| Positive | 8 (19%) | 7 (17%) | 1 (2%) | 5 (17%) | 6 (20%) | |||
| Negative | 34 (81%) | 35 (83%) | 41 (98%) | 24 (80%) | 23 (77%) | 0 | ||
| Unknown | 1 (3%) | 1 (3%) | 30 (100%) | |||||
| Positive | 14 (33%) | 12 (29%) | 27 (64%) | 9 (30%) | 7 (23%) | 17 (57%) | ||
| Negative | 23 (55%) | 26 (62%) | 13 (31%) | 17 (57%) | 19 (63%) | 11 (37%) | ||
| Unknown | 5 (12%) | 4 (9%) | 2 (5%) | 4 (13%) | 4 (14%) | 2 (6%) | ||
| Positive | 13 (31%) | 11 (26%) | 33 (79%) | 9 (30%) | 7 (23%) | 21 (70%) | ||
| Negative | 24 (57%) | 27 (64%) | 7 (17%) | 17 (57%) | 19 (63%) | 7 (23%) | ||
| Unknown | 5 (12%) | 4 (10%) | 2 (4%) | 4 (13%) | 4 (14%) | 2 (7%) | ||
| Non-papillary | 26 (70%) | 18 (60%) | ||||||
| Papillary | 16 (30%) | 12 (40%) | ||||||
| No | 22 (52%) | 16 (53%) | ||||||
| Yes | 20 (48%) | 14 (47%) | ||||||
NOTE
‡indicating there are missing data.
“” indicating P-value were calculated using Pearson’s chi-square test.
Fig 2Urinary biomarker discovery.
Venn diagram represents the overlap among protein sample types with 338 identified proteins.
Fig 3Biomarker qualification and multivariate analysis.
(A) and (B) PCA and O-PLS-DA score plots of MRM results of urinary candidates that show sample differentiation; normal pathology (green), PDF (blue) and CCA group (red).
The correlation coefficient values of multivariate analysis and the differences of univariate analysis of all qualified urine protein candidates using O-PLS-DA analysis.
| (+)PDF vs (-)N | (+)N vs (-)CCA | (+)PDF vs (-)CCA | |||||||
| R2X = 53.3% | R2X = 49.01% | R2X = 60.6% | |||||||
| Q2Y = 7.8% | Q2Y = 35.2% | Q2Y = 47.5% | |||||||
| CV-ANOVA | |||||||||
| 1 | QSOX1 | O00391 | Sulfhydryl oxidase 1 | +0.8262 | +0.4494 | +0.8618 | |||
| 2 | CUBN | O60494 | Cubilin | +0.8675 | +0.2862 | +0.7742 | |||
| 3 | ALDOB | P05062 | Fructose-bisphosphate aldolase B | +0.6070 | +0.6524 | ||||
| 4 | LDHB | P07195 | L-lactate dehydrogenase B chain | +0.5499 | +0.2669 | +0.5975 | |||
| 5 | CTSD | P07339 | Cathepsin D | +0.4565 | +0.6003 | +0.6639 | |||
| 6 | HEXB | P07686 | Beta-hexosaminidase subunit beta | +0.5276 | +0.4629 | ||||
| 7 | CTSB | P07858 | Cathepsin B | +0.4799 | |||||
| 8 | NGFR | P08138 | Tumor necrosis factor receptor superfamily member 16 | +0.7148 | +0.5093 | ||||
| 9 | CSF1 | P09603 | Macrophage colony-stimulating factor 1 | +0.7236 | +0.7864 | ||||
| 10 | LAMP1 | P11279 | Lysosome-associated membrane glycoprotein 1 | +0.7553 | -0.3105 | +0.4459 | |||
| 11 | CDH1 | P12830 | Cadherin-1 | +0.7765 | +0.4845 | ||||
| 12 | LAMP2 | P13473 | Lysosome-associated membrane glycoprotein 2 | +0.5142 | -0.2945 | +0.4170 | |||
| 13 | ANPEP | P15144 | Aminopeptidase N | +0.6536 | +0.4812 | +0.6203 | |||
| (+)PDF vs (-)N | (+)N vs (-)CCA | (+)PDF vs (-)CCA | |||||||
| R2X = 53.3% | R2X = 49.01% | R2X = 60.6% | |||||||
| Q2Y = 7.8% | Q2Y = 35.2% | Q2Y = 47.5% | |||||||
| CV-ANOVA | |||||||||
| 14 | ARSA | P15289 | Arylsulfatase A | +0.6561 | +0.3088 | +0.6857 | |||
| 15 | GNS | P15586 | N-acetylglucosamine-6-sulfatase | +0.6074 | +0.4606 | +0.6407 | |||
| 16 | CDH2 | P19022 | Cadherin-2 | +0.3771 | +0.3688 | +0.4775 | |||
| 17 | SIRPA | P78324 | Tyrosine-protein phosphatase non-receptor type substrate 1 | +0.7258 | +0.7130 | ||||
| 18 | DPT | Q07507 | Dermatopontin | +0.5355 | +0.7207 | ||||
| 19 | LGALS3BP | Q08380 | Galectin-3-binding protein | +0.5655 | +0.5785 | ||||
| 20 | ASAH1 | Q13510 | Acid ceramidase | +0.6289 | +0.6086 | ||||
| 21 | HAVCR2 | Q8TDQ0 | Hepatitis A virus cellular receptor 2 | +0.7361 | +0.6285 | ||||
| 22 | PVRL2 | Q92692 | Nectin-2 | +0.5745 | +0.5347 | +0.7271 | |||
| 23 | GGH | Q92820 | Gamma-glutamyl hydrolase | +0.6948 | +0.5754 | ||||
| +0.6854 | |||||||||
| 25 | WNK1 | Q9H4A3 | Serine/threonine-protein kinase WNK1 | +0.5186 | +0.6782 | +0.7182 | |||
| 26 | SIAE | Q9HAT2 | Sialate O-acetylesterase | +0.7384 | +0.6408 | ||||
| 27 | RETN | Q9HD89 | Resistin | +0.5125 | +0.3340 | ||||
NOTE: = P-values were calculated using a Mann-Whitney U -test for pairwise group comparison.
* P<0.05
** P<0.01
*** P<0.001
**** P<0.0001.
The p(corr) value is a correlation coefficient (ranging from -1.0 to 1.0) for each model.
“+” or “–”indicates higher correlation in either group of O-PLS-DA pairwise comparison models.
The P-value of all O-PLS-DA models was derived from permutation tests (n = 500).
Fig 4The protein interaction network analysis of 27 significant candidate proteins using the STRING database (http://string-db.org).
A; two main cellular component pathways were identified in this study: lysosome biogenesis (blue circle) and membrane part (red circle). Each node lists the gene name of the candidate according to protein ID from Table 2. The different intensity of lines represents the protein association of confidence.
Fig 5Immunohistochemical staining of three candidate proteins.
LAMP1, LAMP2 and CDHR2, was performed on cadaveric donor liver tissues (the first column) and human CCA microtissue arrays which demonstrated low and high expression (the second and third column). The red arrows indicate the positive of LAMP1 and LAMP2 expression at the luminal surface (red arrows).
Clinico-pathological data and urinary candidate expression.
| LAMP1 expression | LAMP2 expression | CDHR2 expression | |||||||
|---|---|---|---|---|---|---|---|---|---|
| L | H | L | H | L | H | ||||
| 89 | 121 | 85 | 124 | 99 | 114 | ||||
| 0.656 | 0.183 | 67 | 75 | 0.884 | |||||
| Male | 58 | 83 | 52 | 87 | 67 | 75 | |||
| Female | 31 | 38 | 33 | 37 | 32 | 39 | |||
| 0.405 | 0.674 | 0.585 | |||||||
| <61 | 45 | 54 | 41 | 56 | 44 | 55 | |||
| ≥61 | 44 | 67 | 44 | 68 | 55 | 59 | |||
| 0.125 | 0.054 | 0.714 | |||||||
| Normal | 66 | 80 | 63 | 83 | 72 | 77 | |||
| Abnormal | 4 | 1 | 0 | 5 | 2 | 3 | |||
| 0.882 | 0.492 | 0.937 | |||||||
| Normal | 68 | 79 | 62 | 85 | 72 | 78 | |||
| Abnormal | 4 | 2 | 1 | 3 | 2 | 2 | |||
| 0.868 | 0.868 | 1.000 | |||||||
| Normal | 27 | 33 | 26 | 35 | 29 | 32 | |||
| Abnormal | 43 | 48 | 37 | 53 | 45 | 48 | |||
| 0.510 | 0.101 | 0.333 | |||||||
| Normal | 29 | 47 | 27 | 50 | 34 | 44 | |||
| Abnormal | 41 | 34 | 36 | 38 | 40 | 36 | |||
| 1.000 | 0.485 | 0.217 | |||||||
| Papillary | 44 | 59 | 44 | 58 | 44 | 61 | |||
| Non-papillary | 45 | 62 | 41 | 66 | 55 | 53 | |||
| 0.161 | 0.066 | 0.782 | |||||||
| Extraductal CCA | 35 | 60 | 31 | 62 | 45 | 49 | |||
| Intraductal CCA | 54 | 61 | 54 | 62 | 54 | 65 | |||
| 0.569 | 0.575 | ||||||||
| Early stage | 38 | 46 | 42 | 41 | 42 | 43 | |||
| Late stage | 51 | 75 | 43 | 83 | 57 | 71 | |||
| 0.886 | 0.147 | 0.673 | |||||||
| Negative | 33 | 47 | 38 | 42 | 36 | 45 | |||
| Positive | 56 | 74 | 47 | 82 | 63 | 69 | |||
| 0.750 | 0.747 | 0.747 | |||||||
| Alive | 21 | 31 | 20 | 35 | 20 | 32 | |||
| Death | 68 | 90 | 65 | 92 | 65 | 92 | |||
NOTE: P-value were calculated using Pearson’s chi-square test.
‡indicating there are missing data.
L, Low and H, High expression.