| Literature DB >> 27167498 |
Agnieszka Latosinska1,2, Manousos Makridakis1, Maria Frantzi3, Daniel M Borràs4,5,6, Bart Janssen4, William Mullen7, Jerome Zoidakis1, Axel S Merseburger8,9, Vera Jankowski10, Harald Mischak3,7, Antonia Vlahou1.
Abstract
Characterization of disease-associated proteins improves our understanding of disease pathophysiology. Obtaining a comprehensive coverage of the proteome is challenging, mainly due to limited statistical power and an inability to verify hundreds of putative biomarkers. In an effort to address these issues, we investigated the value of parallel analysis of compartment-specific proteomes with an assessment of findings by cross-strategy and cross-omics (proteomics-transcriptomics) agreement. The validity of the individual datasets and of a "verified" dataset based on cross-strategy/omics agreement was defined following their comparison with published literature. The proteomic analysis of the cell extract, Endoplasmic Reticulum/Golgi apparatus and conditioned medium of T24 vs. its metastatic subclone T24M bladder cancer cells allowed the identification of 253, 217 and 256 significant changes, respectively. Integration of these findings with transcriptomics resulted in 253 "verified" proteins based on the agreement of at least 2 strategies. This approach revealed findings of higher validity, as supported by a higher level of agreement in the literature data than those of individual datasets. As an example, the coverage and shortlisting of targets in the IL-8 signalling pathway are discussed. Collectively, an integrative analysis appears a safer way to evaluate -omics datasets and ultimately generate models from valid observations.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27167498 PMCID: PMC4863247 DOI: 10.1038/srep25619
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Overview of the study workflow.
The main steps of data collection and analysis are presented.
Summary of the average number of peptides and proteins (including single peptide identifications) identified in individual samples (n = 5) in the T24 and T24M bladder cancer cells.
| Conditioned Medium | ER/Golgi Fraction | Total Cell Extract | |
|---|---|---|---|
| T24 | 5,814 ± 19,48 | 7,354 ± 295 | 9,887 ± 515 |
| T24M | 6,292 ± 716 | 7,241 ± 667 | 10,237 ± 382 |
| T24 and T24M | 6,053 ± 14,07 | 7,298 ± 490 | 10,062 ± 466 |
| T24 | 1,083 ± 224 | 1,530 ± 55 | 1,922 ± 108 |
| T24M | 1,150 ± 86 | 1,501 ± 96 | 1,965 ± 57 |
| T24 and T24M | 1,116 ± 164 | 1,515 ± 75 | 1,944 ± 85 |
Figure 2Overview of the numbers of proteins identified in total cell extract, ER/Golgi fraction and conditioned medium.
Venn diagram representing a comparative analysis of all proteins (≥2 peptides) identified following application of the three individual protein fractionation strategies.
Figure 3Functional analysis of differentially expressed proteins.
Comparison of biological functions for the differentially expressed proteins identified in total cell extract, ER/Golgi fraction and conditioned medium.
Differentially expressed proteins in T24M versus T24 cells supported by all three proteomics strategies.
| Accession | Protein Name | Cell-Extract | ER/Golgi | Conditioned Medium | |||
|---|---|---|---|---|---|---|---|
| Fold Change | p-value | Fold Change | p-value | Fold Change | p-value | ||
| P08582** | Melanotransferrin | only T24M | 0.01 | 2.52 | 0.01 | Only T24M | 0.01 |
| P05161** | Ubiquitin-like protein ISG15 | 4.99 | 0.01 | 6.06 | 0.01 | Only T24M | 0.01 |
| Q13642** | Four and a half LIM domains protein 1 | 2.31 | 0.01 | 1.86 | 0.01 | 10.8 | 0.01 |
| P61769** | Beta-2-microglobulin | 1.78 | 0.05 | 16.68 | 0.03 | 1.60 | 0.03 |
| Q14011** | Cold-inducible RNA-binding protein | 1.52 | 0.01 | Only T24M | 0.02 | 1.76 | 0.05 |
| Q09666** | Neuroblast differentiation-associated protein AHNAK | 0.64 | 0.01 | 0.52 | 0.01 | 0.54 | 0.01 |
| Q01813** | 6-phosphofructokinase type C | 0.54 | 0.01 | 0.30 | 0.01 | only T24 | 0.02 |
| P15559** | NAD(P)H dehydrogenase [quinone] 1 | 0.54 | 0.01 | 0.51 | 0.05 | 0.12 | 0.01 |
| P06396** | Gelsolin | 0.53 | 0.01 | 0.51 | 0.01 | 0.48 | 0.01 |
| Q9UBG0** | C-type mannose receptor 2 | 0.52 | 0.03 | 0.4 | 0.02 | 0.54 | 0.01 |
| P11413** | Glucose-6-phosphate 1-dehydrogenase | 0.52 | 0.01 | 0.51 | 0.01 | 0.56 | 0.01 |
| P08670 | Vimentin | only T24M | 0.01 | Only T24M | 0.01 | Only T24M | 0.01 |
| P28838 | Cytosol aminopeptidase | 2.22 | 0.01 | 3.57 | 0.01 | 2.46 | 0.01 |
| P13797 | Plastin-3 | 1.79 | 0.01 | 1.63 | 0.01 | 1.66 | 0.01 |
| P05362 | Intercellular adhesion molecule 1 | 1.64 | 0.01 | 2.3 | 0.01 | 3.05 | 0.01 |
| O95336 | 6-phosphogluconolactonase | 1.60 | 0.01 | 3.79 | 0.01 | 4.15 | 0.01 |
| P26639 | Threonine--tRNA ligase, cytoplasmic | 1.54 | 0.01 | 1.50 | 0.01 | 2.32 | 0.01 |
| P00492 | Hypoxanthine-guanine phosphoribosyltransferase | 1.51 | 0.01 | 2.12 | 0.01 | 1.77 | 0.01 |
| Q8IV08 | Phospholipase D3 | 1.86 | 0.01 | Only T24M | 0.02 | only T24M | 0.02 |
**Differential expression (fold change >1.5) was also supported by transcriptomics.
Assessment of the validity of proteomics findings based on literature.
| Cell line study (T24M vs. T24) | # overlapping molecules | |||
|---|---|---|---|---|
| Category | # molecules | BcCluster | Glad4U | |
| Individual proteomic analysis | Total cell extract | 253 | 19 (7.5%) | 27 (10.7%) |
| ER/Golgi | 217 | 22 (10.1%) | 28 (12.9%) | |
| Conditioned medium | 256 | 27 (10.5%) | 38(14.8%) | |
| Compilation from all proteomics methods | 614 | 54 (8.8%) | 71 (11.6%) | |
| “omics” verified findings | Agreement in all 4 strategies | 11 | 2 (18.2%) | 4 (36.4%) |
| Agreement in 2 or 3 out of 4 strategies | 242 | 31 (12.8%) | 36 (14.9%) | |
| All verified proteins | 253 | 33 (13.0%) | 40 (15.8%) | |
Findings from individual proteomics experiments (CE, ER/Golgi, CM) as well as from the “verified” dataset (established based on statistical significance and expression trend agreement between at least 2 strategies e.g. proteomics analysis of CE, ER/Golgi, CM, and transcriptomics) were evaluated. Proteins extracted from the bladder cancer database (BcCluster)20 and by using GLAD4U21 were utilized as reference.
Top 15 Ingenuity Canonical Pathways predicted to be enriched (p < 0.05) based on the integrated “verified” dataset.
| # | Ingenuity Canonical Pathways | Molecules | Rank, Cell extract | Rank, ER/Golgi | Rank, CM | Rank, Glad4U | Rank, BCCluster |
|---|---|---|---|---|---|---|---|
| 1 | Interferon Signaling | IFIT1, STAT1, IFIT3, ISG15, MX1, IFI35, BAX | Not predicted | ||||
| 2 | Superoxide Radicals Degradation | SOD2, SOD3, CAT, NQO1 | 92 (n.s.) | Not predicted | |||
| 3 | Caveolar-mediated Endocytosis Signaling | HLA-A, ITGAV, DNM2, ITGB1, ITGA6, B2M, HLA-B, FLOT2 | |||||
| 4 | Hepatic Fibrosis / Hepatic Stellate Cell Activation | NFKB2, IGFBP4, COL12A1, STAT1, CSF1, BAX, IL6, ICAM1, VEGFC, COL18A1, CXCL8, MYH9 | 200 (n.s.) | 197 (n.s.) | |||
| 5 | Role of Tissue Factor in Cancer | ITGAV, ITGB1, CSF1, ITGA6, CXCL1, YES1, VEGFC, CXCL8, ARRB1 | 100 (n.s.) | ||||
| 6 | Role of IL-17F in Allergic Inflammatory Airway Diseases | NFKB2, CXCL6, CXCL10, CXCL1, IL6, CXCL8 | 275 (n.s.) | Not predicted | |||
| 7 | Putrescine Degradation III | ALDH3A2, IL4I1, ALDH3A1, ALDH1A3 | 149 (n.s.) | 348 (n.s.) | 336 (n.s.) | ||
| 8 | Tryptophan Degradation X (Mammalian, via Tryptamine) | ALDH3A2, IL4I1, ALDH3A1, ALDH1A3 | 154 (n.s.) | 351 (n.s.) | 340 (n.s.) | ||
| 9 | Ethanol Degradation IV | ALDH3A2, CAT, ALDH3A1, ALDH1A3 | Not predicted | 357 (n.s.) | 420 (n.s.) | ||
| 10 | Complement System | CFB, C3, C1QBP, C1S, C1R | Not predicted | 195 (n.s.) | Not predicted | 339 (n.s.) | |
| 11 | Dopamine Degradation | ALDH3A2, IL4I1, ALDH3A1, ALDH1A3 | 168 (n.s.) | 362 (n.s.) | |||
| 13 | Virus Entry via Endocytic Pathways | HLA-A, DNM2, ITGB1, ITGA6, B2M, HLA-B, RAC2 | |||||
| 14 | Role of Pattern Recognition Receptors in Recognition of Bacteria and Viruses | EIF2AK2, NFKB2, C3, OAS3, OAS2, DDX58, IL6, CXCL8 | 128 (n.s.) | ||||
| 15 | Pentose Phosphate Pathway | TKT, G6PD, PGLS | 380 (n.s.) | 391 (n.s.) |
Pathways were ranked based on the significance level. Subsequently, the overlap between the top 15 pathways, as defined based on the integrated “verified” dataset, and pathways predicted based on the individual proteomics datasets (CE, ER/Golgi, CM) and literature mined dataset was established. The respective rank for the overlapping pathways is indicated. Significant pathways with the rank ≤ 15 are marked in bold, while significant pathways with rank > 15 are marked in italics. (n.s. not significant results).
Figure 4Graphical representation of the IL-8 signaling pathway based on multi-omics profiling.
Protein changes identified by each of the individual proteomics strategies (CE, ER/Golgi, CM) are shown, as well as those supported by at least two experimental strategies (CE, ER/Golgi, CM, transcriptomics). The expression trend of each molecule in T24M vs. T24 cells is indicated with arrows ( for up- and for down-regulated proteins in T24M vs. T24).