| Literature DB >> 24833333 |
Wilson Wen Bin Goh1, Limsoon Wong2, Judy Chia Ghee Sng3.
Abstract
The integration of networks with genomics (network genomics) is a familiar field. Conventional network analysis takes advantage of the larger coverage and relative stability of gene expression measurements. Network proteomics on the other hand has to develop further on two critical factors: (1) expanded data coverage and consistency, and (2) suitable reference network libraries, and data mining from them. Concerning (1) we discuss several contemporary themes that can improve data quality, which in turn will boost the outcome of downstream network analysis. For (2), we focus on network analysis developments, specifically, the need for context-specific networks and essential considerations for localized network analysis.Entities:
Year: 2013 PMID: 24833333 PMCID: PMC4009760 DOI: 10.3390/biology3010022
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Figure 1A comparison of the different features compassing each acquisition strategy. The color coding represents the strength of the data acquisition, with warmer colors as strong and cooler colors as weak. Coverage is the extent of the underlying assayable proteome. Iterative Discovery is whether the spectra can be revisited for further validations. Reproducibility refers to consistency of identifications and measurements. Sensitivity is the ability to detect low abundance ions. Analytical ease refers to whether the data is readily analyzable with minimal computational resources. Throughput refers to the number of samples that can be handled and tested simultaneously. (Abbreviations: DDA, Data Dependent Acquisition; TDA, Target Dependent Acquisition; DIA, Data Independent Acquisition).
Figure 2An overview outlining a general workflow using networks in proteomics. Panel A is a simple depiction of sample preparation and proteomic processing, producing a list of identified/quantified proteins. This data set normally suffers from inconsistency and coverage issues (if performed via shot-gun methods) which is difficult to analyze effectively. Panel B depicts a simple subnet-based workflow where dysregulated proteins (red) are mapped onto a network, and used to predict novel clusters. These novel clusters would consist of undetected proteins which can be re-checked (Recovery) against the original mass spectra (peptide-spectra matches that did not meet the filtering requirements). Alternatively, recovery can also be achieved via the use of SRM/MRM proteomics (not indicated). The improved data set (with expanded set of proteins), in their respective clusters (biologically contextualized) can be deployed for functional analyses. Panel C shows an example of how proteomics can improve the reference network. Here, affinity purification coupled to proteomics-based direct monitoring methods can be used to identify state-specific rewiring events or novel interactions between proteins under various conditions. These in turn, can be used to build more accurate/higher quality reference networks for biological analysis.
Pros and Cons of the various Proteomics Pipelines.
| Pipeline | Pros | Cons |
|---|---|---|
| TPP | Very streamlined and comprehensive | Limited software options—no conversion or preprocessing; lack of expandability/flexibility or integration options |
| Proteomatic | Very user-friendly interface; Attempts at data integration from various platforms e.g., | Limited software options; lack of expandability/flexibility or integration options; Not as streamlined as TPP |
| OpenMS/TOPP | Large software options; Highly expandable/flexible and many integration options | Lack of annotation and examples; can be unstable and many software are not tested rigorously |