| Literature DB >> 26811807 |
Justin W Walley1, Steven P Briggs1.
Abstract
One of the objectives of genome science is the discovery and accurate annotation of all protein-coding genes. Proteogenomics has emerged as a methodology that provides orthogonal information to traditional forms of evidence used for genome annotation. By this method, peptides that are identified via tandem mass spectrometry are used to refine protein-coding gene models. Namely, these peptides are used to confirm the translation of predicted protein-coding genes, as evidence of novel genes or for correction of current gene models. Proteogenomics requires deep and broad sampling of the proteome in order to generate sufficient numbers of unique peptides. Therefore, we propose that proteogenomic projects are designed so that the generated peptides can also be used to create a comprehensive protein atlas that quantitatively catalogues protein abundance changes during development and in response to environmental stimulus.Entities:
Keywords: Annotation; Atlas; Proteogenomics; Proteomics
Year: 2015 PMID: 26811807 PMCID: PMC4723421 DOI: 10.1016/j.cpb.2015.02.001
Source DB: PubMed Journal: Curr Plant Biol ISSN: 2214-6628
Fig. 1Examples of gene model revision. Currently annotated exons are shown in red. Gene model revision suggested by novel peptides is depicted in black. Proteogenomically identified peptides are shown in yellow.
Proteogenomic publications in plants. (If Novel Genes and Model Revision were not clearly identified all values went into the Model Revision Column.)
| Organism | Peptides | Proteins | Novel peptides | Novel genes | Model revision | Citation |
|---|---|---|---|---|---|---|
| 86,456 | 13,029 | 261 | 22 | 35 | [ | |
| 144,079 | 12,769 | 18,024 | 778 | 695 | [ | |
| 4943 | 56 | [ | ||||
| 9336 | 932 | 3 | 65 | [ | ||
| 15,121 | 5034 | 166 | 40 | [ | ||
| 78,647 | 9843 | 1568 | 32 | 293 | [ | |
| 225,166 | 14,615 | 24,782 | 165 | 1904 | [ | |
| 203 | 17 | 5 | 8 | [ |
Plant protein atlas publications. To be considered a protein atlas publication we required the quantification of protein abundance for at least several thousand proteins across three or more cell-types and/or plant anatomical structures.
| Organism | Proteome coverage | Samples | Phosphorylation | Citation |
|---|---|---|---|---|
| 13,029 | Multiple developmental stages from roots, leaves, flowers and seeds | No | [ | |
| 1995 | Six root cell types | No | [ | |
| 14,165 | Aleurone/pericarp as well as multiple developmental stages of endosperm and embryo | Yes | [ | |
| 7538 | Leaf, root and stem | No | [ |