| Literature DB >> 23226516 |
Mathias Kuhring1, Bernhard Y Renard.
Abstract
Proteogenomic approaches have gained increasing popularity, however it is still difficult to integrate mass spectrometry identifications with genomic data due to differing data formats. To address this difficulty, we introduce iPiG as a tool for the integration of peptide identifications from mass spectrometry experiments into existing genome browser visualizations. Thereby, the concurrent analysis of proteomic and genomic data is simplified and proteomic results can directly be compared to genomic data. iPiG is freely available from https://sourceforge.net/projects/ipig/. It is implemented in Java and can be run as a stand-alone tool with a graphical user-interface or integrated into existing workflows. Supplementary data are available at PLOS ONE online.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23226516 PMCID: PMC3514238 DOI: 10.1371/journal.pone.0050246
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Overview of iPiG components and their graphical user interfaces.
iPiG consists of the main program (PSM mapping) and two optional tools. In likely order of use, there is a download tool to retrieve necessary resource data easily and automatically, a gene filter to check consistency of required annotations and finally the mapping procedure itself. Although not part of iPiG, the output files allow the direct visualization in a genome browser as a final step.
Figure 2PSMs in a genome browser.
Custom annotation tracks showing mapped peptides from an iPiG BED file imported into the UCSC Genome Browser. The peptide tracks only contain unique peptides that are mapped to referenced genes (annotation-filtered mapping). The peptides can be easily shown in context with other annotations such as the UCSC Genes (blue track). In particular, here PSMs are shown that mapped to the region of the gene CELA2A on chromosome one. The first PSM track (squish) shows all PSMs individually, where the different colors (grey, orange, green) code for different identification scores (low, mid, high). The second PSM track (dense) shows the PSMs overlapping and thus indicating a contiguous object. Both PSM tracks accurately represent the exon-intron structure of the corresponding gene, where thick blocks indicate exon parts and thin lines and arrows indicate intron parts resp. strand orientation.