| Literature DB >> 23324392 |
Wilson Wen Bin Goh1, Mengyuan Fan, Hong Sang Low, Marek Sergot, Limsoon Wong.
Abstract
BACKGROUND: Proteomics Signature Profiling (PSP) is a novel hit-rate based method that proved useful in resolving consistency and coverage issues in proteomics. As a follow-up study, several points need to be addressed: 1/ PSP's generalisability to pathways, 2/ understanding the biological interplay between significant complexes and pathway subnets co-located on the same pathways on our liver cancer dataset, 3/ understanding PSP's false positive rate and 4/ demonstrating that PSP works on other suitable proteomics datasets as well as expanding PSP's analytical resolution via the use of specialised ontologies.Entities:
Mesh:
Year: 2013 PMID: 23324392 PMCID: PMC3636053 DOI: 10.1186/1471-2164-14-35
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Analytical pipeline, clustering results and GO term distributions. A: Detected proteins in mod- and poor-stage HCC patients are used to build PDSs (Pathway-Derived Subnets) from an integrated pathway database (PathwayAPI). These PDSs are used for calculating hit rates for each patient to generate a PSP. The set of PSPs are used for sample class analysis as well as significant feature identification. B: Sample class analysis PDSs have sufficient resolution to segregate mod- and poor-stage patients with high confidence. C: Significant GO term distribution A large number of significant GO terms are associated with metabolic functions. Although cancer-associated terms such as apoptosis, growth and immune responses are also uncovered. This is consistent with earlier observations based on this dataset
Figure 2Left graph (red) shows absolute count distribution of false positive features while right (purple) is the distribution of proportion (false positive features/total number of features). At 5% significance, the left shift of peaks is within expectation. However, the frequency distributions is still rather high, implying internal clustering among poor patients. This is expected given high variability of reported proteins between poor stage liver cancer patients
Figure 3False positive distribution for PSP (A) and PDS (B). Co-localisation and expression profile of PDSs and complexes (DNA synthesome and TNF-alpha/NF-kappa B signaling complex 5) on the purine metabolism pathway. A: The purine metabolic pathway is shown as an undirected graph. Significant PSP clusters are highlighted in green while significant PDS is shown in purple. B and C: Regression plots for expression scores of protein complexes and PDSs co-located on the same biological pathway in mod and poor stage respectively. Enveloped in a checked circle are two complex outliers that are low expressing in the mod stage relative to the co-locating PDS but swung to overexpression in the poor stage
List of potentially novel and novel lipid associated complexes implicated in liver cancer
| PN | 1096 | SNX and PDGF receptor complex | reported to be involved in transport and transmembrane signal transduction | lipid involvement yes |
| PN | 1104 | Transferrin receptor complex | reported involved in transport and receptor mediated endocytosis | lipid association yes |
| PN | 563 | Complex V; F1F0 ATPase | energy production, mitochondrial processes, particularly in heart muscle | lipid involvement yes |
| PN | 654 | BLOC1-BLOC2 complex | transport and targeting | lipid involvement potentially novel |
| PN | 142 | CD147-gamma-secretase complex (APH-1a, PS-1, PEN-2, NCT variant) | signaling, protein fate | lipid involvement yes |
| PN | 652 | AP3-Bloc1-complex | transport | lipid involvement yes |
| PN | 657 | Retromer complex (SNX1, SNX2, VPS35, VPS29, VPS26A) | protein targeting and transport | lipid involvement yes |
| PN | 2837 | Profilin 1 complex | cytoskeleton organisation, endocytosis, potential metastasis involvement | lipid involvement yes |
| PN | 1060 | Retromer complex (SNX1, SNX2, VPS35, VPS29, VPS26B) | transport | lipid association yes |
| N | 280 | HMGB1-HMGB2-HSC70-ERP60-GAPDH complex | DNA repair, nucleic acid binding, response to stress and DNA damage stimulus);high cancer association | lipid involvement not immediately decipherable |
| N | 312 | Cell cycle kinase complex CDK4l | cell cycle control, cancer association yes | lipid involvement not immediately decipherable |
| N | 247 | RalBP1-CCNB1-AP2A-NUMB-EPN1 complex | cell cycle control, endocytosis | lipid involvement not immediately decipherable |
| N | 5230 | CHUK-NFKB2-REL-IKBKG-SPAG9-NFKB1-NFKBIE-COPB2-TNIP1-NFKBIA-RELA-TNIP2 complex | Signaling, cancer association yes | lipid involvement not immediately decipherable |
| N | 311 | Cell cycle kinase complex CDK2 | cell cycle control, cancer association yes | lipid involvement not immediately decipherable |
| N | 5423 | HSP70-BAG5-PARK2 complex | protein folding and stabilisation | lipid involvement not immediately decipherable |
| N | 2390 | CD98-LAT2-ITGB1 complex | cell adhesion, may define cell polarity which in turn has to do with invasiveness | lipid involvement not immediately decipherable |
*PN – potentially novel, N – novel.
Figure 4Overlaps between significant complexes identified via lipid-associated GO BP, CC and MF terms. The overlaps between significant lipid-associated clusters identified via lipid-associated GO BP, CC and MF terms are expressed as a venn diagram. For all 3 categories, about at least half of the complexes are shared between two categories
Figure 5Generalisability tests using NSCLC dataset. A: The underlying patient subclasses can be recovered with high confidence using complexes. B: The left histogram shows the absolute (Abs) count of significant clusters per randomisation. The right histogram is the number of significant clusters normalised by the total number of randomisations (ratio sig clusters). C: Protein support over 30 samples. Most proteins are only supported by a subset of samples