| Literature DB >> 19210778 |
Sandra N Loevenich1, Erich Brunner, Nichole L King, Eric W Deutsch, Stephen E Stein, Ruedi Aebersold, Ernst Hafen.
Abstract
Entities:
Mesh:
Substances:
Year: 2009 PMID: 19210778 PMCID: PMC2648944 DOI: 10.1186/1471-2105-10-59
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The placement of PeptideAtlas. The placement of PeptideAtlas bridging global and targeted proteomics experiments is depicted. Protein extracts were prepared from diverse in vivo and in vitro samples. The proteins were digested and, after additional fractionation steps, analyzed via LC-MS/MS. The spectra were mapped to peptides that likely gave rise to them using sequence database searching. Subsequently, the mappings were statistically validated. Spectral centric data mining was facilitated by a relational database (SBEAMS). Then, re-organization of the data in a peptide-centric manner was the focus during PeptideAtlas construction: Data accumulated over many experiments, generated over the course of several months or years, was complexity-reduced and condensed concentrating on the actual peptide entities. This now allows for retrieval of high quality proteotypic peptides as well as for gene model validation based on expressed peptide sequences.
Databases used in this study.
| Ensembl | FlyBase | ||
| Eu- and Heterochromatin | Euchromatin | Heterochromatin | |
| Search against protein database | v27.3c | r3.2 | - |
| Search against 6-frame translation of DNA | v31e | r3.2.1 | - |
| Reference database for coordinate resolution | v42 | r4.3 | r3.2b2 |
Listed in the different columns are the Ensembl database releases used for searching and the FlyBase/BDGP releases they correspond to.
Figure 2GBrowse on the FlyBase website. The figure shows a screenshot of the genome browser GBrowse on the FlyBase website. The gene model of shibire (shi) on the forward strand of the X chromosome is currently known to have 9 different mRNAs encoding 2 different proteins. 29 peptides are displayed that map to only one location in the genome (shi) and have been observed at least twice. 23 of these peptides lie within exons. In addition, 6 peptides lie across 2 exons and cover 5 splice sites. The peptides are hyperlinked to the PeptideAtlas website where more information about each peptide is available.
Figure 3A 'lost' peptide in CG30084. Shown here is part of the gene CG30084 (it was called tun in release 4) of the FlyBase annotation. Within the FlyBase release 3.2 it was annotated to encode the 4 protein isoforms CG30084-PA, CG30084-PB, CG30084-PC, and CG30084-PD (shown in dark grey). The PeptideAtlas peptide PAp00061581 (PSIASITAPGSASAPAPVPSAAPTK) was part of the splice variant CG30084-PB (red frame, PAp00061581 highlighted in yellow). The annotations of the subsequent release 4.3 are shown in beige color. As one can see, none of the 4 isoforms in this newer release (CG30084-PA,-PC,-PE, and -PF) can account for the observed peptide.
Table listing the 8 "lost" peptides which are not in line with the latest annotation.
| PA accession | Peptide sequence | No. of observations |
| PAp00063021 | RIINFGSNHTANTATKALGAGSEAGAGAGVGMATATATATVGR | 1 |
| PAp00058607 | MELHKQYTTVGASMLTPPDAKAIIAGPTDLYVK | 1 |
| PAp00060480 | NNAPGLINAGIVELDSHNLILAR | 1 |
| PAp00053627 | ISMHSAAICPPGAR | 1 |
| PAp00068376 | TTTTDGAIRR | 3 |
| PAp00047403 | ERERTTIVR | 4 |
| PAp00051256 | HEVAVGAEQGGADNLR | 1 |
| PAp00071978 | YPLYYTVHSAPEQHHIHVYHLPVCK | 1 |
Figure 4A peptide highlights a missing splice form. Part of the gene model of the Na pump alpha subunit (Atpα, CG5670) is depicted. In front of the black background, different types of sequence data are displayed: several predictions (light pink, purple, and different shades of turquoise), conserved coding regions (bright yellow), cDNAs alignments (greens), and peptides from the PeptideAtlas (bright pink). In front of the light blue background, alternative splice forms annotated in release 5.12 are shown in dark blue. The PeptideAtlas peptide PAp00073066 was identified in a 6-frame search and maps within the Atpα gene region. Note that while prediction algorithms postulate an alternative exon in this region, there are no supporting cDNAs (nor ESTs; not shown). The splice variant Atpα-PI, added in FlyBase annotation release 5.11, now accounts for the identified peptide sequence, NPEIDNLVNER. The codon for the last residue of the peptide spans the adjacent intron, thus supporting the annotated splice sites.